LOCUS NC_012037 3653 bp DNA circular BCT 06-JAN-2010 DEFINITION Anaerocellum thermophilum DSM 6725 plasmid pATHE02, complete sequence. ACCESSION NC_012037 VERSION NC_012037.1 GI:222530721 DBLINK Project:29407 KEYWORDS . SOURCE Anaerocellum thermophilum DSM 6725 ORGANISM Anaerocellum thermophilum DSM 6725 Bacteria; Firmicutes; Clostridia; Clostridiales; Anaerocellum group; Anaerocellum. REFERENCE 1 (bases 1 to 3653) AUTHORS Kataeva,I.A., Yang,S.J., Dam,P., Poole,F.L. II, Yin,Y., Zhou,F., Chou,W.C., Xu,Y., Goodwin,L., Sims,D.R., Detter,J.C., Hauser,L.J., Westpheling,J. and Adams,M.W. TITLE Genome sequence of the anaerobic, thermophilic, and cellulolytic bacterium 'Anaerocellum thermophilum' DSM 6725 JOURNAL J. Bacteriol. 191 (11), 3760-3761 (2009) PUBMED 19346307 REFERENCE 2 (bases 1 to 3653) AUTHORS Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Sims,D., Meincke,L., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ovchinnikova,G., Kataeva,I. and Adams,M.W.W. CONSRTM US DOE Joint Genome Institute TITLE Complete sequence of plasmid2 of Anaerocellum thermophilum DSM 6725 JOURNAL Unpublished REFERENCE 3 (bases 1 to 3653) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (04-FEB-2009) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 4 (bases 1 to 3653) AUTHORS Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Sims,D., Meincke,L., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ovchinnikova,G., Kataeva,I. and Adams,M.W.W. CONSRTM US DOE Joint Genome Institute TITLE Direct Submission JOURNAL Submitted (26-JAN-2009) US DOE Joint Genome Institute, 2800 Mitchell Drive B310, Walnut Creek, CA 94598-1698, USA COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence was derived from CP001395. URL -- http://www.jgi.doe.gov JGI Project ID: 4084710 Source DNA available from Michael Adams (adams@bmb.uga.edu) Bacteria available from DSMZ: DSM 6725 Contacts: Michael Adams (adams@bmb.uga.edu) David Bruce (microbe@cuba.jgi-psf.org) Annotation done by JGI-ORNL and JGI-PGF Finishing done by JGI-LANL Finished microbial genomes have been curated to close all gaps with greater than 98% coverage of at least two independent clones. Each base pair has a minimum q (quality) value of 30 and the total error rate is less than one per 50000. The JGI and collaborators endorse the principles for the distribution and use of large scale sequencing data adopted by the larger genome sequencing community and urge users of this data to follow them. it is our intention to publish the work of this project in a timely fashion and we welcome collaborative interaction on the project and analysis. (http://www.genome.gov/page.cfm?pageID=10506376) Meta information: Organism display name: Anaerocellum thermophilum Z-1320, DSM 6725 Culture Collection IDs: DSM 6725 GOLD ID: Gi03121 http://genomesonline.org/GOLD_CARDS/Gi03121.html Sequencing Platforms: Sanger, 454 Phenotypes: Thermoacidophile, Cellulose degrader Diseases: None Habitat: Fresh water, Hot spring Oxygen Requirement: Anaerobe Temperature Range: Thermophile Biotic Relationship: Free living Isolation: Hot spring on the Kamchatka peninsula in Russia. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..3653 /organism="Anaerocellum thermophilum DSM 6725" /mol_type="genomic DNA" /strain="DSM 6725" /db_xref="taxon:521460" /plasmid="pATHE02" /note="type strain of Anaerocellum thermophilum" gene 1125..2117 /locus_tag="Athe_2777" /db_xref="GeneID:7409742" CDS 1125..2117 /locus_tag="Athe_2777" /inference="similar to AA sequence:KEGG:Cphy_3144" /note="KEGG: cpy:Cphy_3144 hypothetical protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_002574610.1" /db_xref="GI:222530722" /db_xref="GeneID:7409742" /translation="MGKPSIIKQVLNEFEKQIRFGESKHEAKREERERCEVTGETWNP ARVEGIFSFSTYREYVKEALEFANWARTEKGCKDLEQARAYVSEYLQSHIDKGYSAWT VKKEAAALAKLYHCRTTDFKVELPARHREEIERSRGYKDHDREFSKERNRDIIIFSKA TGLRRRELERVSSRDIFRGPDGRLYVHVSNGKGGRERDVHVLQKYEREVERIVREREG RDRLFDRVPIRMDVHSYRREYARERYREVEREISRERKLFDRVEDLVRSRLTRLYPDR FREIGERQLTRELTRADGLYHRSDGREFDRLALWEVSNDLGHNRIDVVARHYLD" gene complement(2218..2439) /locus_tag="Athe_2778" /db_xref="GeneID:7409739" CDS complement(2218..2439) /locus_tag="Athe_2778" /inference="similar to AA sequence:KEGG:Csac_2655" /note="KEGG: csc:Csac_2655 hypothetical protein" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_002574611.1" /db_xref="GI:222530723" /db_xref="GeneID:7409739" /translation="MKKLTIEFTREEAMYLLGYFTARAMEGYRFDEFEQGIIKKLADK CNVEFVFENGKILQARYKGNLFYCTTPQE" gene complement(2420..2749) /locus_tag="Athe_2779" /db_xref="GeneID:7409740" CDS complement(2420..2749) /locus_tag="Athe_2779" /inference="ab initio prediction:Prodigal:1.4" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_002574612.1" /db_xref="GI:222530724" /db_xref="GeneID:7409740" /translation="MLTKQELIQRLLEIKQLCFYIEEDEAKQIENIVEDLLSKIRNVK LNQLKRLVHAERKKYPIGTAENLLFHNLYEKLKNVQPDDTERIDRLYKEYLSLVKGVK ESEKANN" gene complement(2743..3192) /locus_tag="Athe_2780" /db_xref="GeneID:7409741" CDS complement(2743..3192) /locus_tag="Athe_2780" /inference="ab initio prediction:Prodigal:1.4" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_002574613.1" /db_xref="GI:222530725" /db_xref="GeneID:7409741" /translation="MKKIGYKKMVDPETGEVQTFILIGHDFEDTDFVKLPFISIKLIM EDKDLAKSAMRILSYIVQHKISFNNYTFALSYEYDIKGNIDMSKKQYHLAIKKLIEKD LLIKIGRGRFMLNPRRIRYGRADQLRKFESEYDKIKKTEKGKGDKEC" ORIGIN 1 aaaaaaatta gagcaggatt tgagcaagag gggcgctgca accccctttg aaaccgcttt 61 cataacccga aaagctcttc acccaccgca tcaatattca tacccgcttg cataattatt 121 catccctgat tgtaccacaa aacccagaaa agtcctggga aaatcatgaa gattcttcca 181 ttttccaatt tccaggaagc gaattgaaat tcccagggat tttttgcctt tttccaggga 241 atgaactgga atttccagcg gtttttccag ggacttaacc acacgaaagg tactttccgc 301 catcaaaggc ttgcaaaccc ataggaaact actggcaggc aaaaccttcc acccagtagc 361 aaatctcttc taccttgttt tgcgagcgtt agcagggagc tgtccgttag ctttgtgtta 421 gcatccttct gttagccttt ggcacgttag tagacttttg ttagttctct cttcttgttt 481 ttatcttcat tgtgattgtt ttccagaaga ctgcttgaca ttttgttatt gtagtgttat 541 gatctatttc agaagcgaaa agtaaaccgc ggtagcgggt tactttttgc gtaatatttg 601 gaattgtcag tttggcacgg gaaatggaat cgggcaggta tagggaaagg caatagccgt 661 ggacggcgtt ccgattccgc tttgtttgtg cgttagcaca tgagtcttgc tgttatcacg 721 ctctgctctg tggtagtaca ccgcaagggg tacaccattt ggagttttgc cgcatctgag 781 agtttagcag tttgcaaaac aaagttcgca gttagaaaag ttaggcaagg ggcgatgacg 841 tgcacgtctg acacgcttga cgtgcaccga gccgagagta gagcgtgatg acatagaaaa 901 agcccagtag caggttgtgc gtaccgtgac atgctactgt ggaagacaag taaatagtca 961 aaaaacggtg cgcttactgt tcgtgggctc atgggaatac cgtgagcatt ctggacaggt 1021 tcaacagcct gaaagggcaa aacccccctt gaaagggttt aaatacacac gtcctttttg 1081 tcgttttttg tcggtttaaa tatttatgca ggggatgata gggaatggga aagccgtcca 1141 taatcaagca agtactcaat gaattcgaaa agcagataag gttcggggaa agtaagcatg 1201 aagcaaagag agaagagcgg gagagatgtg aagttactgg agagacgtgg aatcctgctc 1261 gtgtggaagg catttttagt ttctcaacct acagggagta cgttaaggag gcgttagaat 1321 ttgcgaattg ggctcgtact gaaaaggggt gtaaggattt agaacaagca cgggcctatg 1381 tgtctgaata tttgcagtcg catatagaca aagggtatag cgcgtggact gttaagaaag 1441 aagcagcagc cctggcgaaa ctgtatcatt gtcgtacaac tgactttaaa gtagagcttc 1501 ccgcaagaca cagggaagag attgagagaa gcaggggata caaagaccac gatagggagt 1561 ttagcaaaga gaggaataga gacattatca tcttttcaaa agctactggg ctgagaagaa 1621 gggaattgga aagagtgagt tctcgggata tctttcgtgg gcctgacgga agattatatg 1681 tgcacgtgag caacggcaag ggcggtagag aaagggatgt tcatgttttg cagaaatacg 1741 agagagaggt tgagaggata gtcagagagc gggaaggaag agacaggctg tttgacaggg 1801 tccccataag gatggacgta cacagctata ggagggaata tgcaagagag cgttacagag 1861 aagttgagcg tgagataagt cgtgagagaa agcttttcga cagagttgag gatctcgttc 1921 gtagtaggct tacaaggctc tatcctgaca ggtttagaga aattggcgaa agacaactta 1981 ctcgtgaact cacaagagct gatgggcttt atcatcgcag tgatggtagg gagtttgacc 2041 gcctggcatt gtgggaagtt tcaaacgact tgggacataa tcgaattgac gttgttgcaa 2101 gacactatct ggattaagcg aataaaggct caagaaagtg gataaaaaaa cagggggtat 2161 attgatatat cccccctgtt ttttgtgcgt ctacaggacc ttatttgcgt ttcaaggcta 2221 ttcttgtggg gtagtgcagt aaaaaaggtt gcctttgtat cttgcctgta gaatcttgcc 2281 gttttcaaaa acaaattcaa cgttgcactt atcagcaagt ttcttgatta ttccctgctc 2341 aaattcgtcg aacctgtacc cttccattgc tcttgctgtg aaataaccca gaaggtacat 2401 tgcttcttca cgtgtaaact caattgttag ctttttcact ctcttttacc ccctttacga 2461 gactgagata ttctttgtac aatctgtcaa tacgttctgt atcgtcaggt tgtacgtttt 2521 tgagtttttc gtagagatta tgaaacaaca ggttctctgc tgttccaata gggtatttct 2581 ttctttccgc gtggactagc cttttcagtt ggtttagctt tacgttccgg atcttgctca 2641 gtaggtcttc gacgatattt tcaatctgtt ttgcttcgtc ttcttcgatg tagaagcata 2701 gttgtttgat ttcaagtagc ctctgaataa gttcctgctt tgttaacatt ccttgtcacc 2761 cttccccttt tcagtctttt ttattttgtc gtattctgat tcgaactttc tcagttggtc 2821 agctctgccg taacgtatgc gtcgaggatt tagcatgaat cgtccacggc ctattttaat 2881 caacaaatct ttctcgatta gctttttgat tgcaagatgg tattgctttt ttgacatgtc 2941 gatgtttcct ttgatgtcgt attcgtatga aagtgcaaac gtgtagttgt taaacgagat 3001 tttgtgttgt acaatgtatg agagaattcg cattgctgat tttgcaaggt ctttatcttc 3061 cattatgagt ttgatagaaa tgaatggaag tttgacaaag tctgtgtctt caaaatcatg 3121 tccgatcagt atgaaagttt gtacttcgcc tgtttcaggg tcgaccattt ttttgtaccc 3181 tatcttcttc atttgaatcc cccttttttt acctttttcc aattctaaag ccattataat 3241 acacttaaag taacttgtca agtaacttca ggtgtaaaaa attacacttc aggtgtaata 3301 aattacactt tccattcagt attttcaagg ctttgtgggt aactttattc ttatctatgt 3361 atatatcgcc tgcgttagca ggcttgaaaa tttccagtta ggataagcag gaacaacggt 3421 cgctgacgct gaacactgac gaaatagctg acgccccaaa gtccacaaca gtgccaaacc 3481 gataacaaaa acatgctaac gcaaacatag actaacgcac gactgacgtc gtgatgtgtg 3541 tgtgggccta cctacacaca aaaagaacta acaacagctg actaacgtct gaagagctct 3601 aacaacactt tgctaacgct gagctaacgg acagctcaac gttaacaccc gct //