LOCUS NC_009794 5601 bp DNA circular BCT 14-SEP-2007 DEFINITION Citrobacter koseri ATCC BAA-895 plasmid pCKO2, complete sequence. ACCESSION NC_009794 VERSION NC_009794.1 GI:157149316 DBLINK Project:12716 KEYWORDS . SOURCE Citrobacter koseri ATCC BAA-895 ORGANISM Citrobacter koseri ATCC BAA-895 Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; Enterobacteriaceae; Citrobacter. REFERENCE 1 (bases 1 to 5601) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (13-SEP-2007) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 2 (bases 1 to 5601) AUTHORS McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J., Clifton,W.S., Latreille,P., Courtney,L., Pepin,K., Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R. CONSRTM The Citrobacter koseri Genome Sequencing Project TITLE Direct Submission JOURNAL Submitted (29-AUG-2007) Genetics, Genome Sequencing Center, 4444 Forest Park Parkway, St. Louis, MO 63108, USA COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence was derived from CP000824. Citrobacter (diversus) koseri--Citrobacter cells are isolated from water, sewage, soils, and food, as well as from the feces of man and other animals, where they may be normal inhabitants. They can be found in urine, sputum, and other clinical specimens. They can sometimes be opportunistic pathogens particularly in immunocompromised patients in hospitals or in infants (Pepperell et al., Antimicrob Agents Chemother. 2002 Nov;46(11):3555-60. and references therein). The strain of Citrobacter koseri being sequenced, strain CDC 4225-83, was isolated in 1983 in Maryland, where it caused neonatal meningitis. It was provided by Caroline Mohr and Melissa Campbell of CDC. The strain is available from the American Type Culture Collection as ATCC BAA-895 or from the Salmonella Genetic Stock Centre as SGSC4696. The genome was sequenced to 8X coverage, using plasmid and fosmid libraries and was finished to an error rate of less than 1 per 10,000 bases. Automated annotation was performed and manual annotation will continue in the labs of Michael McClelland and Kenneth Sanderson. The National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH) has funded this project. Coding sequences below are predicted using GeneMark v3.3 and Glimmer2 v2.13.Intergenic regions not spanned by GeneMark and Glimmer2 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. RNA genes were determined usingtRNAscan-SE 1.23 or Rfam v8.0. This sequence was finished as follows unless otherwise noted: all regions were double stranded, sequenced with an alternate chemistries or covered by high quality data (i.e., phred quality >=30); an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one m13 subclone. COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..5601 /organism="Citrobacter koseri ATCC BAA-895" /mol_type="genomic DNA" /strain="ATCC BAA-895" /db_xref="ATCC:BAA-895" /db_xref="taxon:290338" /plasmid="pCKO2" gene 3..179 /locus_tag="CKO_pCKO2p07156" /db_xref="GeneID:5585604" CDS 3..179 /locus_tag="CKO_pCKO2p07156" /note="Psort location: nuclear, score: 23" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456634.1" /db_xref="GI:157149317" /db_xref="GeneID:5585604" /translation="MQKPKKKPLVVKKRSWHPAVMANLMRQAQEEDYRREQMAELERS ARSHFADIGGEDGG" gene 169..513 /locus_tag="CKO_pCKO2p07157" /db_xref="GeneID:5585605" CDS 169..513 /locus_tag="CKO_pCKO2p07157" /inference="protein motif:HMMPfam:IPR008687" /inference="similar to AA sequence:INSD:AAS97970.1" /note="Psort location: cytoplasmic, score: 23" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456635.1" /db_xref="GI:157149318" /db_xref="InterPro:IPR008687" /db_xref="GeneID:5585605" /translation="MAVKRTRFLGIRVTDGEYQQLLERCNGRQLAVWMRETCLDTRPA RSLRLPSIDPVLLRQLAGMGNNLNQIARKINGGQWSGADRVQVVAALMAIDAGLERLR HAVLEKGADDDR" gene 503..2038 /locus_tag="CKO_pCKO2p07158" /db_xref="GeneID:5585606" CDS 503..2038 /locus_tag="CKO_pCKO2p07158" /inference="protein motif:HMMPfam:IPR005094" /inference="similar to AA sequence:INSD:" /note="KEGG: reh:H16_A2580 3.7e-07 cafA2; ribonuclease G and E; COG: COG0419 ATPase involved in DNA repair; Psort location: nuclear, score: 23" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456636.1" /db_xref="GI:157149319" /db_xref="InterPro:IPR005094" /db_xref="GeneID:5585606" /translation="MIVKFHPRGRGGGAGPVDYLLGKDRQRDGAIVLQGKPEEVRELI DASPYAKKYTSGVLSFAEQDLPPGQREKLMASFERVLMPGLDKDQYSVLWVEHRDKGR LELNFLVPNTELLTGRRLQPYYDRADRPRIDAWQTVVNGRLGLHDPNAPENRRALVTP SSLPKTKQEAAEAITRGLLALASSGELKTRQDVTEALESAGFEVVRTTKSSISIADPD GGRNIRLKGAIYEQSFNAGEGLRAEIESAAAEYRRNAESRIQRAREVCQSGTERKREE NQRRHPRPRAGYERGHAVEPPERDAYGRADMADHHDGVRTAVRQPERGVVVSGEPDSV KSGRNRPAERSAVEAEREDLGRDVPGRQQRTFSGAAEGDGRRQDAELDGRERAVEAER AETGEGVSADDRAGKTVAERIRAATAGLLEKAGRVGERLRGMAENVRAYATGERGAER ARDALESAGGTLERATAAFEPVVQRHEMAVAAERAHEQRQHEKELTEARSRQRSYDGP SLG" gene complement(2296..2496) /locus_tag="CKO_pCKO2p07159" /db_xref="GeneID:5585607" CDS complement(2296..2496) /locus_tag="CKO_pCKO2p07159" /inference="similar to AA sequence:INSD:AAZ04438.1" /note="Psort location: nuclear, score: 23; ORF located using Blastx" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456637.1" /db_xref="GI:157149320" /db_xref="GeneID:5585607" /translation="MIIRNAVNISSLRAFLLTIERITIMGLKPGPKRIADSTGEPDKR QRDNKKTPGNTDKLKPSKSSKK" gene 2587..2811 /locus_tag="CKO_pCKO2p07160" /db_xref="GeneID:5585608" CDS 2587..2811 /locus_tag="CKO_pCKO2p07160" /inference="protein motif:HMMPfam:IPR002145" /inference="protein motif:superfamily:IPR010985" /inference="similar to AA sequence:INSD:ABN63943.1" /note="COG: COG4710 Predicted DNA-binding protein with an HTH domain; Psort location: nuclear, score: 23" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456638.1" /db_xref="GI:157149321" /db_xref="InterPro:IPR002145" /db_xref="InterPro:IPR010985" /db_xref="GeneID:5585608" /translation="MLAIRLSDEIESRLDSLAKQTGRTKTFYAREAILAHLEDLEDYY LSAETAARVRRGDEAVHSSEDVRKSLGLDD" gene 2846..3061 /locus_tag="CKO_pCKO2p07161" /db_xref="GeneID:5585609" CDS 2846..3061 /locus_tag="CKO_pCKO2p07161" /inference="protein motif:HMMPfam:IPR007712" /inference="similar to AA sequence:REFSEQ:YP_001041697.1" /note="COG: COG2026 Cytotoxic translational repressor of toxin-antitoxin stability system; Psort location: cytoplasmic, score: 23" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456639.1" /db_xref="GI:157149322" /db_xref="InterPro:IPR007712" /db_xref="GeneID:5585609" /translation="MDKQNARRIVDFMSLRIAVAADPRQSGKPLKGELGEFWRYRVGD YRVLCEIRDDELVILAATIGHRREVYD" gene 3033..3182 /locus_tag="CKO_pCKO2p07162" /db_xref="GeneID:5585610" CDS 3033..3182 /locus_tag="CKO_pCKO2p07162" /note="Psort location: cytoplasmic, score: 23" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456640.1" /db_xref="GI:157149323" /db_xref="GeneID:5585610" /translation="MDIAAKFTTEARYAGFFDADNRYYGKYSTNYVFYCLSVLVNQQV VLTIM" gene complement(3131..3481) /locus_tag="CKO_pCKO2p07163" /db_xref="GeneID:5585611" CDS complement(3131..3481) /locus_tag="CKO_pCKO2p07163" /note="Psort location: endoplasmic reticulum, score: 9" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456641.1" /db_xref="GI:157149324" /db_xref="GeneID:5585611" /translation="MYWLIAFLAFIFPGRSFADGYTAYYRCNGTQEASLTVKDNKLSM VVGLSKFNRQLDTYRFKEKGIEVVMQRAASDDNSEAIMSFIMPDVNNMTLLVKNNGYI IVKTTCWLTKTDKQ" gene complement(3547..4092) /locus_tag="CKO_pCKO2p07164" /db_xref="GeneID:5585612" CDS complement(3547..4092) /locus_tag="CKO_pCKO2p07164" /note="Psort location: cytoplasmic, score: 23" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456642.1" /db_xref="GI:157149325" /db_xref="GeneID:5585612" /translation="MFFKRHLTEKEIYKAALPIVRLQGRVNTSLRFLSKYSDVNKNGL IALHLNLLHFISYNIHGVESLKITQAGINKAVQQSPHADEIKNIAQAIGRKGPEYIEG LIRNPWGKKNDNPLIFIMSIEESIKETLNSGYMLYRVAPITEGEAAAFMDVLKQLLLF EDNFRLFGRDLVYKIEKLKIR" gene complement(4118..4501) /locus_tag="CKO_pCKO2p07165" /db_xref="GeneID:5585613" CDS complement(4118..4501) /locus_tag="CKO_pCKO2p07165" /inference="protein motif:superfamily:IPR010985" /note="Psort location: nuclear, score: 23" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456643.1" /db_xref="GI:157149326" /db_xref="InterPro:IPR010985" /db_xref="GeneID:5585613" /translation="MRNGSFVRNKKRNSEGGGVMRERTNIMLDIGLKQSLQRLSNKTK MSLSDLINQRLSGSLSVEERIGEKPEWMRNLDPYLANHFSNIELPLPSPLKEYQEKMD EIDSIINEMMVLKEYFRAKFYDEIK" gene 4513..4668 /locus_tag="CKO_pCKO2p07166" /db_xref="GeneID:5585614" CDS 4513..4668 /locus_tag="CKO_pCKO2p07166" /note="Psort location: extracellular, including cell wall, score: 9" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456644.1" /db_xref="GI:157149327" /db_xref="GeneID:5585614" /translation="MKKEGGSGLYMMCSALRASVAGVFPWDGWSSGDNRTDDHLKDLL EILFFCA" gene complement(4684..4785) /locus_tag="CKO_pCKO2p07167" /db_xref="GeneID:5585615" misc_RNA complement(4684..4785) /locus_tag="CKO_pCKO2p07167" /product="RNAI" /inference="nucleotide motif:Rfam:RF00106" /note="Rfam score 58.43" /db_xref="GeneID:5585615" gene complement(4829..5065) /locus_tag="CKO_pCKO2p07169" /db_xref="GeneID:5585616" CDS complement(4829..5065) /locus_tag="CKO_pCKO2p07169" /inference="similar to AA sequence:INSD:" /note="Psort location: mitochondrial, score: 23; ORF located using Blastx" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456645.1" /db_xref="GI:157149329" /db_xref="GeneID:5585616" /translation="MFRLFPFREAWRFLIAHAAGISARFRSFAPSWAVCTNPPFSPTA APFPVTIILSPTRTDTQKRHWQQPLVTGYIRERY" gene 4874..5173 /locus_tag="CKO_pCKO2p07168" /db_xref="GeneID:5585617" CDS 4874..5173 /locus_tag="CKO_pCKO2p07168" /inference="similar to AA sequence:INSD:" /note="Psort location: nuclear, score: 23; ORF located using Blastx" /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="YP_001456646.1" /db_xref="GI:157149328" /db_xref="GeneID:5585617" /translation="MAFLRVRPGWTQDDSYRKGRSSRAERGVRAYSPAWSERPKPSRD TSSVSYEKAPRFPKGKKAEQVSGKRQGRNRRAHEGAAGEKSPASLSPVGFRPPLT" ORIGIN 1 taatgcagaa gccgaagaag aagccgctcg tcgtaaaaaa aagatcctgg catccggcag 61 tgatggcaaa tctgatgcgt caggcacagg aggaagatta ccgccgggag cagatggcag 121 agcttgagcg ttcagcccgg tcacattttg cggatatcgg aggagaggat ggcggttaag 181 cgtacccgtt ttttaggtat ccgggtgacc gatggggagt accagcagct gctggagcgc 241 tgcaatggaa ggcaactggc agtgtggatg cgggaaacct gtctcgatac gcgcccggcg 301 cggtcgttgc ggctgccgtc catcgatccg gttttgctgc gccagcttgc cgggatgggg 361 aacaacctca accagatcgc ccggaaaatt aacggcggcc agtggtcagg cgctgaccgg 421 gttcaggtgg tggccgcgct gatggccatc gatgccggac tcgagcggct gcggcatgcc 481 gtgctggaaa agggggcaga cgatgatcgt taagtttcat ccgcgagggc gaggcggcgg 541 tgccgggccg gtggattatc tgctggggaa agaccgccag cgcgacgggg ccatcgtcct 601 gcagggaaag ccggaggagg tccgggagct tatcgacgcc tcgccctacg ccaaaaagta 661 cacctccggc gtcctgtctt ttgctgaaca ggatttaccg cccggccagc gtgaaaagct 721 gatggcgagc ttcgagcggg ttctgatgcc cggactcgat aaagaccagt acagcgtgct 781 gtgggttgaa caccgggaca aggggcggct ggaactgaat tttctggtac cgaacacgga 841 actgctgacc ggcaggcgtc tccagccgta ctacgaccgg gcagaccgtc cgcgcatcga 901 tgcgtggcag accgtggtga acgggcgact ggggctgcac gacccgaacg caccggagaa 961 ccggcgcgcg ctggtgacgc catcttcgtt acccaaaacg aagcaggaag ccgcagaggc 1021 tattacgcgg ggcttactgg ccctcgcctc gtcaggggag cttaaaacgc gtcaggacgt 1081 cactgaggcg ctggaaagcg caggttttga ggtcgtgcgc accacaaaga gcagcatcag 1141 cattgccgac ccggacggag ggcgaaacat ccgacttaag ggagccatct atgaacagtc 1201 ttttaacgct ggcgaaggac ttagagcaga aatcgaaagc gcagcagcag agtaccggcg 1261 aaatgctgaa agccgcattc agcgagcacg agaagtctgt cagagcggaa ctgagcgaaa 1321 gcgggaagag aatcagcgcc gccatccgcg cccacgagca gggtatgagc gcggccatgc 1381 agtcgaaccg cctgagcgtg atgcgtatgg tcgggcggac atggctgacc atcacgatgg 1441 tgtcaggact gctgttcgcc agcctgagcg gggtgttgtg gtatcagggg agcctgatag 1501 cgtcaaatct ggccgaaatc gaccggcaga acgcagcgct gtcgaagctg aacgcgaaga 1561 cctggggcgt gacgtacctg gaagacagca acggacgttt tctggtgctg ccgaagggga 1621 cggccgtaga caggacgcag agctggacgg tcgggaacgg gctgtcgaag cagaacgcgc 1681 tgagactggt gaaggagtaa gtgccgatga ccgagctgga aaaacagttg ctgagcgcat 1741 tagagcagct acagcaggac tactcgaaaa ggctggacga gtgggagagc gccttcgggg 1801 aatggcggaa aatgtcaggg cttatgcaac gggagaacgc ggcgctgagc gagcgcgtga 1861 cgcgcttgag tcagcaggtg gaacgcttga gcgggcaact gcagcgtttg agccggtagt 1921 gcagcgtcat gagatggccg tggccgcaga acgggcgcac gagcaacggc agcacgaaaa 1981 ggaactgacg gaagcgcgtt cccggcagcg aagctatgac ggtccgtcgc tgggctgaaa 2041 tgcaggaaaa cagggttctc atcgggcatt tacgctgaaa ccggatgtca ttcacctggc 2101 tgcatggcta tgcagccagg taaaatttat ctgtcgcgaa cggggcggtt tatcgggtgg 2161 tttgttgccg gttttacctg actgccaccc cgttcgcggc gaacccgtcc ggggcggtgc 2221 gggcaacggg tgttatgttt aatcagcgtg aacggcagat acaaaaaaag ccccggaggg 2281 cttttaacta ctgacttatt tttttgaaga tttgctgggt ttcaatttgt cagtatttcc 2341 cggcgttttt ttattgtcac gctggcgttt atcaggttcc cctgttgaat cggcaattcg 2401 tttcggaccc ggcttaagac ccataatcgt tatcctttca atagttaaga ggaaagccct 2461 caagctagat atattgaccg cgttacggat tatcaagagg ggctaacaag attatttttt 2521 gttcttttag gttgattgtg gtactgttgc tatacatgta tagcaaatgg tgttcgggag 2581 gtttatatgc ttgctatccg gttatctgat gagattgagt cccgtctgga ctcgctggcg 2641 aagcaaaccg gcagaacaaa gacgttttat gcgcgggaag caatactggc gcatctggaa 2701 gacctggagg attattatct ttcagcagaa actgctgcac gcgttcgccg tggtgatgaa 2761 gcagtgcatt cgtctgaaga cgtgaggaag tcacttggtc tggacgatta actattccga 2821 tcgggcgctc aaatcgttac gcaagatgga caaacagaac gcacgacgga ttgtggattt 2881 tatgagttta cgcattgcag ttgctgccga tcctcgccag tcagggaagc cgctcaaagg 2941 tgagctgggc gagttctggc gctatcgggt gggagattat cgcgttctgt gtgagatccg 3001 agatgacgag cttgttatcc ttgccgccac gattggacat cgccgcgaag tttacgactg 3061 aagcccgcta tgcgggcttt tttgatgccg acaacaggta ttatggtaaa tactcgacta 3121 actacgtgtt ctactgtttg tcggttttag ttaaccagca ggttgtctta acgataatgt 3181 agccgttatt tttaactaat aaggtcatat tgttaacgtc aggcataata aaagacatta 3241 tggcctcaga gttatcatca gaagcagctc tctgcattac tacttcgatt cccttttctt 3301 tgaatctata agtatccagt tgcctattaa acttagaaag acctacaacc atacttagtt 3361 tgttatcttt aacagtaaga gatgcttcct gagtaccatt acatcggtag taagcggtgt 3421 aaccgtcagc aaaagatcga ccggggaata taaatgctag gaacgcaatt aaccaataca 3481 tagttgagaa tcttggtatt acacatttca ctatcctctc tttcattttt cgttcaccat 3541 actttgttat ctgattttta atttttctat tttataaacc aagtccctac caaaaaggcg 3601 aaaattatcc tcaaataata ataattgttt aagaacatcc atgaaagcag ctgcttcacc 3661 ctcagttatg ggagctactc tatataacat ataacctgag ttaagggttt cttttatact 3721 ttcttcaatg ctcattataa atattagtgg attatcattt ttcttccccc atgggtttct 3781 aattaatcct tcgatgtact caggtccttt gcgtcctatt gcctgcgcaa tattttttat 3841 ttcatctgcg tgagggcttt gttgtacagc tttgtttatg cctgcttgag tgattttcag 3901 agactctaca ccgtgtatgt tgtatgagat aaaatgtaat aaattaagat gtaatgcaat 3961 taatccattt ttgtttacat ctgagtattt ggataaaaaa cgtaatgaag tgttaactct 4021 cccttgaagc ctaacaattg ggagtgctgc cttgtatatt tccttctccg ttaaatgccg 4081 tttaaaaaac ataattctcc gcctccgtat agcgtgtcta ttttatttcg tcgtaaaact 4141 tagctctgaa atattccttt agaaccatca tttcattaat aattgaatct atctcatcca 4201 ttttttcttg gtattctttt aatggtgaag ggagcggtaa ctcaatattt gaaaagtgat 4261 tggcgagata gggatcaagg tttctcatcc attcaggctt ttcgccaatg cgttcttcaa 4321 ctgaaagtga acctgataat ctctggttaa taagatcact tagactcatt ttggttttat 4381 tcgaaagccg ttgtaaggat tgctttaatc ctatatctag catgatattt gttctttctc 4441 tcattacccc tcccccctcg ctattgcgtt ttttgttgcg cacaaagcta ccattgcgca 4501 ttaagaaacg caatgaaaaa ggagggaggg agtggtttgt acatgatgtg cagtgcgctg 4561 agggcatcag tggcgggtgt ttttccgtgg gatggatgga gctccgggga taacagaact 4621 gacgatcatt taaaggatct tcttgagatc ctttttttct gcgcgtaatc tcttgcactg 4681 taaacgaaaa aaccgcctgg ggaggcggtt tgatcgaagg ttaagtcagt tggggaactg 4741 cttaaccggg taactgggta tacgagaccg cagataccaa aactgttctt tcagtgtagc 4801 cgcagttagg cacccacttc aagacctctc aatatctctc tcgtatatat ccagttacca 4861 atggctgctg ccagtggcgt ttttgcgtgt ccgtccgggt tggactcaag atgatagtta 4921 ccggaaaggg cgcagcagtc gggctgaacg gggggttcgt gcatacagcc cagcttggag 4981 cgaacgacct aaaccgagcc gagataccag cagcgtgagc tatgagaaag cgccacgctt 5041 cccgaaaggg aaaaaggcgg aacaggtatc cggtaagcgg cagggtcgga acaggagagc 5101 gcacgaggga gccgccggag agaaatcacc ggcatcttta agtcctgtcg ggtttcgtcc 5161 tcctctgact tgagcgtcca tttctgtgat gctcgtcagg ggaggcggag cctatggaaa 5221 aacggccacg ccgcatcctt ttctccggca ctttctgatt tgagcgcctt tgctttatcc 5281 cgttatccgt gagccggtca cgcccgccgc agccgaacgg caggagcgca gcgaccgagt 5341 gagcgaggag gcgtcatttc atctgtccgt gcatggtatt gcggacgctc ctgagccaca 5401 tatcacacct gccacatgag gcactttcat gaatcgcgct ctcatgtcag catagtaagc 5461 cagtatacac agcgtaaacg ctgtgactgg ttcagggctg cgccccgaaa cccgctaaca 5521 ccactgacgc gctggcgctt gtccaacgcc ggagcgaccc ggaaaacatt tccgtgccgt 5581 ccggcgtttg cgaaggaggg a //