LOCUS NC_014827 7420 bp DNA circular BCT 07-JAN-2011 DEFINITION Ruminococcus albus 7 plasmid pRUMAL04, complete sequence. ACCESSION NC_014827 VERSION NC_014827.1 GI:317133710 KEYWORDS . SOURCE Ruminococcus albus 7 ORGANISM Ruminococcus albus 7 Bacteria; Firmicutes; Clostridia; Clostridiales; Ruminococcaceae; Ruminococcus. REFERENCE 1 (bases 1 to 7420) AUTHORS Lucas,S., Copeland,A., Lapidus,A., Cheng,J.-F., Bruce,D., Goodwin,L., Pitluck,S., Chertkov,O., Detter,J.C., Han,C., Tapia,R., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Ovchinnikova,G., Weimer,P., Mead,D. and Woyke,T. CONSRTM US DOE Joint Genome Institute TITLE Complete sequence of plasmid4 of Ruminococcus albus 7 JOURNAL Unpublished REFERENCE 2 (bases 1 to 7420) CONSRTM US DOE Joint Genome Institute NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (29-DEC-2010) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 7420) AUTHORS Lucas,S., Copeland,A., Lapidus,A., Cheng,J.-F., Bruce,D., Goodwin,L., Pitluck,S., Chertkov,O., Detter,J.C., Han,C., Tapia,R., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Ovchinnikova,G., Weimer,P., Mead,D. and Woyke,T. CONSRTM US DOE Joint Genome Institute NCBI Genome Project US DOE Joint Genome Institute TITLE Direct Submission JOURNAL Submitted (17-DEC-2010) US DOE Joint Genome Institute, 2800 Mitchell Drive B310, Walnut Creek, CA 94598-1698, USA COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence is identical to CP002407. URL -- http://www.jgi.doe.gov JGI Project ID: 4085694 Source DNA and organism available from David Mead (dmead@lucigen.com) Contacts: David Mead (dmead@lucigen.com) Tanja Woyke (microbe@cuba.jgi-psf.org) Annotation done by JGI-ORNL and JGI-PGF Finishing done by JGI-LANL The JGI and collaborators endorse the principles for the distribution and use of large scale sequencing data adopted by the larger genome sequencing community and urge users of this data to follow them. it is our intention to publish the work of this project in a timely fashion and we welcome collaborative interaction on the project and analysis. (http://www.genome.gov/page.cfm?pageID=10506376). ##MIGS-Data-START## investigation_type :: bacteria_archaea project_name :: Ruminococcus albus 7 collection_date :: Missing lat_lon :: Missing depth :: Missing alt_elev :: Missing country :: Missing environment :: Host num_replicons :: 5 ref_biomaterial :: Missing biotic_relationship :: Free living trophic_level :: Missing rel_to_oxygen :: Anaerobe isol_growth_condt :: Missing sequencing_meth :: WGS assembly :: Newbler v. 2.3 (pre-release) finishing_strategy :: Finished GOLD Stamp ID :: Gi07008 Gene Calling Method :: JGI-ORNL pipeline Cell Shape :: Coccus-shaped Motility :: Nonmotile Sporulation :: Nonsporulating Temperature Range :: Mesophile Temperature Optimum :: 40C Gram Staining :: Gram+ Diseases :: None ##MIGS-Data-END## ##Genome-Assembly-Data-START## Finishing Goal :: Finished Current Finishing Status :: Finished Assembly Method :: Newbler v. 2.3 Genome Coverage :: 30x Sequencing Technology :: 454, Illumina ##Genome-Assembly-Data-END## COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..7420 /mol_type="genomic DNA" /db_xref="taxon:697329" /strain="7" /plasmid="pRUMAL04" /organism="Ruminococcus albus 7" gene 14..376 /locus_tag="Rumal_4008" /db_xref="GeneID:10093823" CDS 14..376 /locus_tag="Rumal_4008" /protein_id="YP_004090315.1" /transl_table=11 /note="PFAM: protein of unknown function DUF891; KEGG: afn:Acfer_1073 protein of unknown function DUF891" /db_xref="GI:317133711" /db_xref="InterPro:IPR009241" /db_xref="GeneID:10093823" /codon_start=1 /inference="protein motif:PFAM:PF05973" /translation="MQEFEIIFYEKADGTEPAKDFILSLDTKMRAKMFRTVELLQKNG NRLREPESKPIEDGIMELRAKVGSDISRVLYFFVVGHKVVLTNGFIKKTQKTPRSEIE RAKQYRADYLSRKENEND" /product="protein of unknown function DUF891" gene 369..641 /locus_tag="Rumal_4009" /db_xref="GeneID:10093815" CDS 369..641 /locus_tag="Rumal_4009" /protein_id="YP_004090316.1" /transl_table=11 /note="KEGG: lba:Lebu_0841 transcriptional regulator, XRE family; PFAM: helix-turn-helix domain protein; SMART: helix-turn-helix domain protein" /db_xref="GI:317133712" /db_xref="GO:0043565" /db_xref="InterPro:IPR001387" /db_xref="GeneID:10093815" /codon_start=1 /inference="protein motif:PFAM:PF01381" /translation="MTNFNELLAEQMKDDEFRREYEALEPEFTIMQAMIDARNSSGLT QKQLSDRSGIAQGDISKLENGNANPSIRTLQRLANAMGKKLRIEFL" /product="helix-turn-helix domain protein" gene complement(707..991) /locus_tag="Rumal_4010" /db_xref="GeneID:10093816" CDS complement(707..991) /locus_tag="Rumal_4010" /protein_id="YP_004090317.1" /transl_table=11 /note="KEGG: ere:EUBREC_3497 hypothetical protein" /db_xref="GI:317133713" /db_xref="GeneID:10093816" /codon_start=1 /inference="similar to AA sequence:KEGG:EUBREC_3497" /translation="MSKSKTYEERIAELQEKQKQLKAQEKKLKAQKSADERKKRTRHL IEIGGAVYSVLGREYADGDIERLVAFLKGQNQRGGFFNKAMNDFPDNKPE" /product="hypothetical protein" gene complement(978..1391) /locus_tag="Rumal_4011" /db_xref="GeneID:10093817" CDS complement(978..1391) /locus_tag="Rumal_4011" /protein_id="YP_004090318.1" /transl_table=11 /note="KEGG: cpo:COPRO5265_1220 excinuclease ABC, C subunit; PFAM: Excinuclease ABC C subunit domain protein; SMART: Excinuclease ABC C subunit domain protein" /db_xref="GI:317133714" /db_xref="GO:0004518" /db_xref="InterPro:IPR000305" /db_xref="GeneID:10093817" /codon_start=1 /inference="protein motif:PFAM:PF01541" /translation="MLRDFQLEALKEWYRIKALGLDDIIYEPNCILYDRSPYIYLFYN NNNIVIYVGHTQNIANRMSKHIEESPFWEEVSYIMCAETPNLNSLADYERYYIQKYKP KYNKRGFKKPPYLPDMVALDFIEYKGKEVLNYVKK" /product="Excinuclease ABC C subunit domain protein" gene 1621..3667 /locus_tag="Rumal_4012" /db_xref="GeneID:10093818" /pseudo gene 4147..5556 /locus_tag="Rumal_4013" /db_xref="GeneID:10093819" CDS 4147..5556 /locus_tag="Rumal_4013" /protein_id="YP_004090319.1" /transl_table=11 /note="KEGG: smf:Smon_1511 hypothetical protein" /db_xref="GI:317133715" /db_xref="GeneID:10093819" /codon_start=1 /inference="similar to AA sequence:KEGG:Smon_1511" /translation="MSDNVLNNFRECYELLSNVYDQVNGEDFYRYIFPDNENSGELSS DFSHPNAIYLYHDEQDEGAKRRLRRRIMLNDTWASDYMDYVEGNGMTLCSGLTYHSRS NKLQYAQTMNALIFDLDGVGRSELEHVLERTTGSGDVYRSIPKPTFIVLSGRGLHLYY VFDEPIALYPNIKLQLKSLKHDLTFRIWEYKGTSQVESIQYQSIGQGFRMVGSTNSKY GNTVTAFKIGGKVSLDFLNSYAIKPENRVDVNRPYRPSKIDRATAAELYPEWYQKVVV EGNKRPNKWNIGYRKKKQVKDYALYEWWKRRVTEIEGGHRYFYLMCLVIYACKCDVPK KQLKVDIQACFEVLRAYKHNNELTQDDVDSAMECYSKDYYNFTIADIEILTDVRIERN KRNYQKQEWHLEDIRSKKANMKRRGQPFRNAEGRPSKQQIVLEWQQLHPMGRKVDCIR DTGLSKPTVLKWWRKEFEA" /product="hypothetical protein" gene 5553..6104 /locus_tag="Rumal_4014" /db_xref="GeneID:10093820" CDS 5553..6104 /locus_tag="Rumal_4014" /protein_id="YP_004090320.1" /transl_table=11 /note="KEGG: llc:LACR_B8 hypothetical protein" /db_xref="GI:317133716" /db_xref="GeneID:10093820" /codon_start=1 /inference="similar to AA sequence:KEGG:LACR_B8" /translation="MSKTIKQIADELGVSKDRVKYLVKKLPSGWVEKRGNITYINADG ERNIYMLEGKKWGKSDEITHIETELDRVISTHLPTEEQKKDMEIERLKARVAELERQL EYERANSEEKQALLKAWNDEQIQQFNRIIEDNKKLLQLIDQEQQLHLRSIESKDKGLA IEEKQPEPKRHWWQRKRKEPTEE" /product="hypothetical protein" gene 6101..6307 /locus_tag="Rumal_4015" /db_xref="GeneID:10093821" CDS 6101..6307 /locus_tag="Rumal_4015" /protein_id="YP_004090321.1" /transl_table=11 /note="KEGG: shm:Shewmr7_4042 hypothetical protein" /db_xref="GI:317133717" /db_xref="GeneID:10093821" /codon_start=1 /inference="similar to AA sequence:KEGG:Shewmr7_4042" /translation="MNEPTKKKKGGARAGAGAKPKYGCKTVTMRVPADMVDEIRAFIC KKKHIEPKPEPEQELEGQVSFSDI" /product="hypothetical protein" gene 6320..6718 /locus_tag="Rumal_4016" /db_xref="GeneID:10093822" CDS 6320..6718 /locus_tag="Rumal_4016" /protein_id="YP_004090322.1" /transl_table=11 /note="KEGG: spj:MGAS2096_Spy1123 hypothetical protein" /db_xref="GI:317133718" /db_xref="GeneID:10093822" /codon_start=1 /inference="similar to AA sequence:KEGG:MGAS2096_Spy1123" /translation="MDIKFYNEEHQKAFYSICKRMKHLDCYHFSLAYLLSLDKVLREH TDEVFDFKEDCIKREGLHKGFQTGTSMKTTRLAFNLWNGCYDDGETYTNKDGYETELP SSYYSPDQIFCCKDYAPYYWQAIRIRFELN" /product="hypothetical protein" ORIGIN 1 gtggagaatg tatatgcagg aatttgaaat aatattttat gaaaaggctg atgggacaga 61 gccggcaaag gattttatat taagtcttga tacaaagatg agagctaaaa tgttcagaac 121 agtagagctt cttcaaaaaa acggcaacag attaagagaa ccggaatcaa agcctataga 181 agacggtata atggagttaa gagcaaaagt aggctcggat atatctaggg tcttgtattt 241 ctttgtagtc ggacataaag ttgttttgac aaatggcttt ataaagaaaa cacaaaaaac 301 accaaggtca gagattgaaa gagccaaaca gtacagagct gattatctaa gcagaaagga 361 gaatgaaaat gactaatttc aatgaacttc tcgcagagca gatgaaagat gatgaatttc 421 gtagagaata tgaagcactt gaaccagaat ttacaattat gcaggctatg atagatgctc 481 gtaattcaag cggacttacg cagaaacagc tttcagatcg ttcgggtata gcacagggtg 541 atataagcaa gcttgaaaac ggaaatgcta atccgtcaat aagaacttta caacggcttg 601 caaatgctat gggcaaaaaa cttaggatag agtttttata ataacaatta taaagacgga 661 ctaccgcagc agcggacagt ccgtcttttt cttttttgtc agcccattat tcgggcttat 721 tgtcgggaaa atcattcata gccttgttaa agaagccgcc cctctggttc tgacctttga 781 ggaacgcaac aagacgttcg atatcaccat ctgcatactc tctgccgagt acgctgtata 841 ccgcaccgcc gatctctatc aagtgccgtg tgcgtttttt tctttcgtcc gctgacttct 901 gtgctttcag ctttttctct tgtgctttga gctgtttctg cttttcttgc agctcagcta 961 ttcgttcttc atatgtctta ctttttgaca taatttaata cctctttccc tttgtattca 1021 ataaaatcca gcgcaaccat atcaggcaaa tagggtggct ttttaaaacc tcttttgtta 1081 tatttaggct tgtatttctg tatgtaatat ctctcataat ctgcaagcga atttagattg 1141 ggtgtttctg cgcacataat ataagacact tcttcccaaa atggactttc ttcaatgtgc 1201 tttgacattc tatttgctat gttttgagtg tgtccaacgt agatcactat attattgtta 1261 ttatagaaca agtatatgta aggacttcta tcatataaaa tacaattagg ctcataaata 1321 atgtcatcaa gcccaagtgc ctttattcga taccattctt ttaaagcttc taattgaaaa 1381 tctctaagca tatacatcac cagtataatt ttatcatata tacattgatt tgtcaagtag 1441 gtattgacaa actaagcgaa atttgatata atcatttctt aagagcgcag cgaatagaaa 1501 tgcgcactta tacaccacat tcgctacgct cacatggtgt atttaagtgc gctctccgag 1561 ggcaagagcg ttcccctctt gacttccctt tttatttttc agagaaagga gtgttgccaa 1621 atggcaatct atcattgcag tattaagata ggcagcagag caaacggaca aagcgcaata 1681 gccgcagcag cgtatcgagc aggtgacaat ctgaaagata aagaaactgg tttagtgtcc 1741 gactactccc gaaagggcgg cgttgtgttt tccgaaatct cgttgtgcca aaacgctcct 1801 gccgaatatg ccgacagggc tactctttgg aacgcagttc acgaaatcga gaaagccaaa 1861 aactcacagt tgtggcgtga gtttgaggtg gcacttccgc aggagtttag ccgagccgaa 1921 cagatagaca cagttcgtgg atttgtcaaa ggcttgacca agcaaggaat gtgtgtggat 1981 tggagcttgc acgataaaga ggacggaaac ccacacgctc atatcatggc tactatgcgg 2041 agcataaccg agggcggcaa atgggcaccc aagagccgca aggtgtatga ccttgacgag 2101 aacggcgagc gcatttttca aaaagtcgat aagtcggggc gcaagcagta caagaaccac 2161 aaagaagatt acaacgattg gaacaagaaa gagcgtgtcg aggaatggag agctgcgtgg 2221 gctgcttgct gcaatgaaag gcttgccgag cgtgatcgga tagatcaccg cagctataaa 2281 cggcagggca tagagcaaga gccgactatc catgagggat atgcagcaag gaaaatcgca 2341 gccgagggta agccctcgga gcgtgttcgc ataaatgaag aaatccgaga gagaaataat 2401 atgctgaaac agcttgccga gcagctgcac cacatcgaac agcagcttgc cgagcttatc 2461 aaggagaaag gaagtgcagc gatcaatgca ggaaaacaac gaattgcaga cctactcgcc 2521 cgaagaagtc gagcagttga tagcgatgac cgaggacttg cagacggaga acgaacaaca 2581 gagagccgaa ctgaacgaac tacggcaaca gaagcagacc gccttatcag agaatcagaa 2641 gctgaaagac gagctgcact ccgcaatgtt ggacttgaag aaagtggaag gacagcacaa 2701 ggaaacgaga aaaaagctga acactctgct acacggttac aacccacaga acaaccagac 2761 acagcagcaa tcttcggagc tgcggacagc cttatccaag atacagacac taaccaagaa 2821 gctgtcagag cagcaacggc agacacagac agtactatcc gacaatcaga aactaaggtc 2881 acaagtgcag cagctcaccg ccaagaacga gagcttgcag agcagcgacg agcagctgaa 2941 gaacgccgaa gagctgaaga agcagagcgc aaggcaagag cagcaagcga aagagcgaga 3001 aagacgagct acgatagagg cagatagggc tagacgagaa gcagccgcag caatcgcaag 3061 agccgagcaa aaagaagaag cggcgataat agccaagaac gcagcggaac ataccaagca 3121 gcagcaggaa agtatcatca aagagaaagc agaagcctta aacaggcaat atcaggtgga 3181 atggaacgtt attggtgttg cccttatcgt gtatagctta tttgctaccg tgttcaccgc 3241 cattaagtca aaacgctgta tgtctgatat taccgcagca ggaaagctaa tagttaaaat 3301 cgcaaaggga atagttcatg tgataactgc ccttgcaagc cgtgtcggaa gtgtcggagc 3361 gaaaatccca cagcccatag tgtccacgat cgtgagctat ttgctaatgg cacttattac 3421 agcgatatgt gttgttgtta tccttgcacc cttgtatttg ggcattaaag ctatttcaaa 3481 ctgctatacg ctgcattgtt gggacgaggt aagtccattg gttgccctgg tctgccttgc 3541 agttcttgtg tggggagctg aactcatgcc gttgaacatc gtgctgctga tgatactttc 3601 acacgcagca tacatcgttg tccgttggta catagatggt tggagagagg caagaggact 3661 agcataatgc cctcagaagc caccgtaagc cacacagagc cacgctaatc ttaataatat 3721 agaatcaccc cttacactct tgaaacgcca cacagagccg ctcagaacgt gcgaggggca 3781 tttttacagc ctttttctcg cggagataga aatgtattgg tgaagttggt actataaaca 3841 tttaattgaa agcgtcacaa aatcagtatt gacaaatgga gatttatgtg ctatactgaa 3901 aaaaaggtaa aaaaacttat tgataatcta agccccttat accgtaggta ttctgagggg 3961 gctgacagag taggtaaaaa aacttattga taatctaagc cccttatacc gtaggtattg 4021 agcgtgagcg tgtgtggggc ggttcttctt tcttttggtt cttttctttc ttctccaaga 4081 aaagaagaaa gaaaagaacg ttacccgcag taatgatgtt atatccgcat aagtgaggtg 4141 ttctgcttgt ccgataatgt attgaataac ttccgtgagt gctatgagct gctttccaac 4201 gtgtacgatc aggtcaatgg cgaggacttc tatcgttata tcttccccga caacgagaac 4261 tctggtgagc tgtccagcga tttttctcac cctaacgcta tttatctgta tcacgatgaa 4321 caggacgagg gtgcaaaaag gcgtttacgc cgcaggataa tgctcaatga tacgtgggct 4381 tctgattata tggactatgt tgaggggaac ggcatgactt tgtgcagcgg tctgacttat 4441 catagccgca gcaacaagct ccagtatgct cagaccatga acgcactcat ttttgacctt 4501 gacggtgtcg ggcgttctga attggagcac gttttggagc gtacaacagg ttcgggcgat 4561 gtttaccgca gtattcctaa gccgactttc attgttttga gtgggcgagg gcttcacttg 4621 tactatgttt ttgatgagcc tatcgccctt tatcccaaca taaagctgca attgaagtcc 4681 ttgaaacacg atctcacatt cagaatatgg gagtataaag gcacttcgca ggtcgagagt 4741 attcagtatc agagcatcgg tcaaggtttc cgtatggtcg gcagcactaa cagcaagtat 4801 gggaataccg tcactgcctt taaaatcggc ggtaaagtca gccttgattt cctcaatagc 4861 tatgctatta agcccgaaaa tcgggttgat gttaaccgtc cgtaccgccc gagcaaaatt 4921 gaccgagcaa ccgcagcgga gctttatccc gaatggtatc agaaagttgt tgttgagggc 4981 aacaaacgcc ctaacaagtg gaatatcggc tatcgtaaga agaagcaagt caaagactat 5041 gctctttatg agtggtggaa acgccgtgtg accgagattg agggcggaca ccgttatttt 5101 taccttatgt gccttgtaat atacgcctgc aagtgtgatg taccgaaaaa acagctcaaa 5161 gtcgatattc aggcgtgttt tgaagtcctt cgtgcttata aacacaataa cgagctgaca 5221 caagatgatg ttgatagtgc tatggagtgc tattcaaagg attattataa tttcacgata 5281 gctgatatcg agatcttgac cgatgttcgt atcgagcgta ataagcggaa ctatcaaaag 5341 caggagtggc atttggaaga tatccgaagt aaaaaggcaa atatgaaacg tagaggtcag 5401 ccttttagaa atgccgaggg cagaccaagt aagcagcaga tcgtacttga atggcagcag 5461 ctgcacccta tggggcgcaa agttgattgt atcagagata ctggtttgtc aaagcctacc 5521 gttctcaaat ggtggcgaaa ggagtttgaa gcatgagcaa aaccataaag cagatcgctg 5581 acgagctggg cgtgagcaaa gaccgagtaa agtatctagt gaaaaaatta cccagtgggt 5641 gggtagaaaa acggggaaat attacctaca taaatgctga tggtgaacgg aatatctata 5701 tgctagaggg aaaaaagtgg ggaaaatctg acgaaattac ccatattgaa accgaacttg 5761 atcgggtaat ttctacccac ttacccaccg aagaacagaa gaaagatatg gaaattgaga 5821 ggttaaaagc tagggttgca gaacttgaac gccaattaga atatgaaagg gctaattccg 5881 aagagaaaca agctcttttg aaagcgtgga atgatgaaca gatacagcaa tttaatcgca 5941 taattgagga taataaaaaa ttattacagc ttatagacca agaacagcag ctgcatttac 6001 gttcgattga gagtaaagat aaaggtcttg ctatcgagga aaagcagccc gaacctaagc 6061 ggcattggtg gcagcgtaaa cgaaaggagc cgaccgagga atgaacgagc cgacaaaaaa 6121 gaagaaaggc ggagcgaggg caggagcagg cgcaaagcca aaatacggct gcaagaccgt 6181 gacaatgcgt gttcctgctg atatggtgga cgaaatcagg gcttttatct gcaagaaaaa 6241 acatatcgag ccgaagcctg agcctgaaca agagctagag gggcaagttt cctttagcga 6301 tatataagga ggactaacaa tggatatcaa gttttacaat gaagaacacc aaaaggcatt 6361 ttacagcatt tgtaagcgaa tgaaacacct tgactgctac catttttcgc ttgcgtatct 6421 tctttctctc gataaggtac ttcgtgaaca taccgatgaa gtattcgact tcaaagagga 6481 ctgtataaaa cgtgaggggc tgcacaaagg ctttcagaca ggcacttcga tgaaaacaac 6541 acgccttgcc tttaaccttt ggaatggctg ctatgatgac ggtgaaacct acacgaataa 6601 ggacggctat gaaacggagc tgccgagcag ctactactcg cccgatcaga tcttctgctg 6661 caaggactat gcaccgtact actggcaggc tattcggatc cgttttgaat tgaattagta 6721 tagaaaaaag gcgaaacaat aacgtttcgc cttttgatca ttgttcgata ttactttttt 6781 tgactaaaat ccaatcacaa ggaaatatat gttttttgtc tgctttataa actagtatta 6841 aaacaaatat ccatatatgt ttggttttta ctacgttatt atttcctaaa gagtccctta 6901 tgatcattgc agagggtcca tccttagaat taaatatatc ttgctgttgt atgtgtttta 6961 gaatataatt tttttcttgt aatttcatta ttaaagtttt accgttgagc tggcgtttcc 7021 taagtaagcc tttccgcctg caatagttgt aatgtagacg tttttccaac caacctttac 7081 tttcttggtg tttttcggaa ttgaaaatga ttttgtagag ccatttgcct tgttatatct 7141 tccattaagg tctttgtatt catctattct gcctgaaaaa taacttttgg gttttaatat 7201 tgatgtgtta ctgcagtata ttgttccaga cacattggtt atctgtgttc ctgttgcgct 7261 ccaatagaga tgagctttat ctattatatc acttttttga gtgattgtaa aaacagaaat 7321 aacactttgt atttgtaatt tgtgaatttt tctccaatag cattgacaat ataggattta 7381 tcctatataa ttaattatag gataaatcct ataatttgat //