Definition Bacillus anthracis str. Sterne chromosome, complete genome.
Accession NC_005945
Length 5,228,663

Click here to switch to the map view.

The map label for this gene is ydcI [H]

Identifier: 49183274

GI number: 49183274

Start: 241905

End: 244079

Strand: Direct

Name: ydcI [H]

Synonym: BAS0241

Alternate gene names: 49183274

Gene position: 241905-244079 (Clockwise)

Preceding gene: 49183273

Following gene: 49183275

Centisome position: 4.63

GC content: 36.51

Gene sequence:

>2175_bases
TTGTATATGGAAATGGTAGATAATCGACAAGCGTTAATGAAAATGTTAGTGAAGGAATTAGGCTTTACCGAAAAGCAAGT
TCGTCATGTTATTCAATTAACAGAAGAAGGTAACACAGTTCCATTTATTGCTCGTTACCGAAAAGAATGGACAGGCTCTT
TAGATGAAGTGCAAATTCGTACAATTTTAGAGAGATGGCAATATATAATGCAGCTTGAAGATAGGAAAGAAGAGGTTCTT
CGTCTTATTGATGAGAAGGGGAAACTGACAGGAGAATTACGTCAGCAAATTTTGAAAGCTACAAAGTTGCAAGAAGTAGA
AGATTTATATCGTCCATATAAAGAAAAGAGAAGAACGAAAGCAACGATTGCAAAAGAAAAAGGATTAGAGCCATTGGCTG
AATGGTTATTGTTATATACGAAAGAAGATCCGAATCAGAAGGCAATGGAGTTTATAAATGCCGAGAAAGAAGTGCAATCT
GCAGAAGAAGCATTACAAGGTGCTCAAGACATCATTGCAGAAATCGTTTCAGATGAAGCGGCATATCGTAGTTGGATTCG
GAATGTTACTTTTAGAAAAGGTGTTATGTCTTCGGTTGTAAAAGATGAAGAGAAAGATGAAAAGAATATATATGAAATGT
ATTACGATTATGAAGAACCGTTGCAGAAAGTAGTACCGCATCGCGTGTTAGCGATGAATCGCGGTGAGAAAGAAGATATA
TTGAGAGTTTCTGTTGTTCCGCCAGTAGATGAGATAGTAACTTTCTTATATAAGAAAGTAATTCGCGATAACGATTCAAA
AAGTGCACAGTATGTAAAGTTAGCAATTGAGGATGGTTATAAAAGATTAATTCAATCCTCTATTGAAAGAGAGATTCGAA
AAGAATTAACAGAAACAGCTGAGGAACAAGCGATACATATTTTCTCTGAGAACTTACGTAATTTGTTATTACAACCTCCA
ATGAAAGGAAAGGTTGTGTTAGCTGTAGACCCTGCATATAGAACGGGGTGTAAATTAGCTATTGTAGACGATACGGGGAA
AGTTTTATATATTGATGTTATTTATCCGCATCCACCAGTGCGTAAATATGAAGATGCCAAAGCAAAAGTTCTTTCTATTA
TAGATAAATACCAAGTTGAGATGATTGCGATTGGGAATGGTACAGCTTCTAGAGAATCGGAAGAATTTATAGTTGATGTA
TTACAAAATGTGAAACGAGAAGTCTTCTATATTATTGTGAACGAAGCTGGTGCGAGTGTGTATTCGGCTTCTGATTTAGC
CCGTGAGGAATTCCCGGATTTACAGGTTGAAGAAAGAAGTGCGGTTTCTATCGGTAGACGTCTGCAAGATCCACTTGCTG
AACTTGTAAAGATTGATCCTAAATCAGTTGGGGTTGGACAATATCAGCATGATGTATCTCAAAAAAGATTAAATGAATCA
TTAACGTTTGTAGTAGAGACAGCAGTTAACCAAGTCGGTGTGAATGTAAATACAGCTTCAGTTGCATTGTTGCAATATGT
TTCAGGCTTATCAAAAACTGTTGCGAAAAATATTGTAGCAAAGCGTGAAGAAGAAGGGAAATTTACAAAAAGAACAGACT
TAAAGAAAATACCACGTCTAGGTGCGAAGACATATGAACAATGTATAGGTTTCTTGCGTATATTAGAAGGGGCAAATCCG
TTAGACCGTACAGGTATTCATCCGGAACAATATAAAAATGTTGAATTGTTATTAAAGAGTCTAGGGTTATCAAAAGATGA
CGTAGGGCAACCACAATTACAAAAGAGTTTAGAAGAAGTGGAGATTTCTAAGTTGTCGGAAGAAACGGGAATTGGAGAGC
CGACATTAGTCGATATTATAGATGCACTCATTAGTCCAGAGCGAGACATGAGGGATGAGTTGCCTAAACCGCTTCTGAAA
AAAGGGATTTTGAAATTAGAAGATTTAAAACGTGGTATGGAACTGGAAGGAACAGTACGTAACGTTGTTGATTTTGGTGC
TTTTGTTGATGTAGGTGTGAAACAAGATGGGTTAGTACATATTTCTAAACTAAGCAAACAATATGTGAAGCATCCGTTAG
ATGTTGTATCTGTTGGACAAATCGTCAAAGTATGGGTAGATGATATTGATACGAAAAAAGGTCGTGTTGCATTATCTATG
TTGCCAATTGAATAG

Upstream 100 bases:

>100_bases
GAAGCGCTACAAATTAGTTTAGGACTAATAGATTTTTAAATCGGCAGTTTAAGTTGCTCTCTTTGAAGGGCAACTTTTTT
TATTATAGAGGAGGTTACGG

Downstream 100 bases:

>100_bases
TAAGAGAAGAGAGCAGATGAAACTGCTCTTTTTTTATAGGGCAAAACGTGTAATGGATTAGGAACTTTGATGTATTTTGC
TATAATAAAACCAGCATTCA

Product: S1 RNA-binding domain-containing protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 724; Mature: 724

Protein sequence:

>724_residues
MYMEMVDNRQALMKMLVKELGFTEKQVRHVIQLTEEGNTVPFIARYRKEWTGSLDEVQIRTILERWQYIMQLEDRKEEVL
RLIDEKGKLTGELRQQILKATKLQEVEDLYRPYKEKRRTKATIAKEKGLEPLAEWLLLYTKEDPNQKAMEFINAEKEVQS
AEEALQGAQDIIAEIVSDEAAYRSWIRNVTFRKGVMSSVVKDEEKDEKNIYEMYYDYEEPLQKVVPHRVLAMNRGEKEDI
LRVSVVPPVDEIVTFLYKKVIRDNDSKSAQYVKLAIEDGYKRLIQSSIEREIRKELTETAEEQAIHIFSENLRNLLLQPP
MKGKVVLAVDPAYRTGCKLAIVDDTGKVLYIDVIYPHPPVRKYEDAKAKVLSIIDKYQVEMIAIGNGTASRESEEFIVDV
LQNVKREVFYIIVNEAGASVYSASDLAREEFPDLQVEERSAVSIGRRLQDPLAELVKIDPKSVGVGQYQHDVSQKRLNES
LTFVVETAVNQVGVNVNTASVALLQYVSGLSKTVAKNIVAKREEEGKFTKRTDLKKIPRLGAKTYEQCIGFLRILEGANP
LDRTGIHPEQYKNVELLLKSLGLSKDDVGQPQLQKSLEEVEISKLSEETGIGEPTLVDIIDALISPERDMRDELPKPLLK
KGILKLEDLKRGMELEGTVRNVVDFGAFVDVGVKQDGLVHISKLSKQYVKHPLDVVSVGQIVKVWVDDIDTKKGRVALSM
LPIE

Sequences:

>Translated_724_residues
MYMEMVDNRQALMKMLVKELGFTEKQVRHVIQLTEEGNTVPFIARYRKEWTGSLDEVQIRTILERWQYIMQLEDRKEEVL
RLIDEKGKLTGELRQQILKATKLQEVEDLYRPYKEKRRTKATIAKEKGLEPLAEWLLLYTKEDPNQKAMEFINAEKEVQS
AEEALQGAQDIIAEIVSDEAAYRSWIRNVTFRKGVMSSVVKDEEKDEKNIYEMYYDYEEPLQKVVPHRVLAMNRGEKEDI
LRVSVVPPVDEIVTFLYKKVIRDNDSKSAQYVKLAIEDGYKRLIQSSIEREIRKELTETAEEQAIHIFSENLRNLLLQPP
MKGKVVLAVDPAYRTGCKLAIVDDTGKVLYIDVIYPHPPVRKYEDAKAKVLSIIDKYQVEMIAIGNGTASRESEEFIVDV
LQNVKREVFYIIVNEAGASVYSASDLAREEFPDLQVEERSAVSIGRRLQDPLAELVKIDPKSVGVGQYQHDVSQKRLNES
LTFVVETAVNQVGVNVNTASVALLQYVSGLSKTVAKNIVAKREEEGKFTKRTDLKKIPRLGAKTYEQCIGFLRILEGANP
LDRTGIHPEQYKNVELLLKSLGLSKDDVGQPQLQKSLEEVEISKLSEETGIGEPTLVDIIDALISPERDMRDELPKPLLK
KGILKLEDLKRGMELEGTVRNVVDFGAFVDVGVKQDGLVHISKLSKQYVKHPLDVVSVGQIVKVWVDDIDTKKGRVALSM
LPIE
>Mature_724_residues
MYMEMVDNRQALMKMLVKELGFTEKQVRHVIQLTEEGNTVPFIARYRKEWTGSLDEVQIRTILERWQYIMQLEDRKEEVL
RLIDEKGKLTGELRQQILKATKLQEVEDLYRPYKEKRRTKATIAKEKGLEPLAEWLLLYTKEDPNQKAMEFINAEKEVQS
AEEALQGAQDIIAEIVSDEAAYRSWIRNVTFRKGVMSSVVKDEEKDEKNIYEMYYDYEEPLQKVVPHRVLAMNRGEKEDI
LRVSVVPPVDEIVTFLYKKVIRDNDSKSAQYVKLAIEDGYKRLIQSSIEREIRKELTETAEEQAIHIFSENLRNLLLQPP
MKGKVVLAVDPAYRTGCKLAIVDDTGKVLYIDVIYPHPPVRKYEDAKAKVLSIIDKYQVEMIAIGNGTASRESEEFIVDV
LQNVKREVFYIIVNEAGASVYSASDLAREEFPDLQVEERSAVSIGRRLQDPLAELVKIDPKSVGVGQYQHDVSQKRLNES
LTFVVETAVNQVGVNVNTASVALLQYVSGLSKTVAKNIVAKREEEGKFTKRTDLKKIPRLGAKTYEQCIGFLRILEGANP
LDRTGIHPEQYKNVELLLKSLGLSKDDVGQPQLQKSLEEVEISKLSEETGIGEPTLVDIIDALISPERDMRDELPKPLLK
KGILKLEDLKRGMELEGTVRNVVDFGAFVDVGVKQDGLVHISKLSKQYVKHPLDVVSVGQIVKVWVDDIDTKKGRVALSM
LPIE

Specific function: Unknown

COG id: COG2183

COG function: function code K; Transcriptional accessory protein

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 S1 motif domain [H]

Homologues:

Organism=Homo sapiens, GI221136781, Length=769, Percent_Identity=34.850455136541, Blast_Score=409, Evalue=1e-114,
Organism=Homo sapiens, GI27597090, Length=550, Percent_Identity=22.5454545454545, Blast_Score=76, Evalue=1e-13,
Organism=Escherichia coli, GI87082262, Length=721, Percent_Identity=44.7988904299584, Blast_Score=573, Evalue=1e-164,
Organism=Escherichia coli, GI1787140, Length=75, Percent_Identity=45.3333333333333, Blast_Score=70, Evalue=6e-13,
Organism=Caenorhabditis elegans, GI17511129, Length=762, Percent_Identity=30.7086614173228, Blast_Score=269, Evalue=3e-72,
Organism=Caenorhabditis elegans, GI17552892, Length=315, Percent_Identity=25.0793650793651, Blast_Score=71, Evalue=2e-12,
Organism=Drosophila melanogaster, GI62484314, Length=746, Percent_Identity=33.3780160857909, Blast_Score=384, Evalue=1e-106,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR012340
- InterPro:   IPR016027
- InterPro:   IPR003029
- InterPro:   IPR005227
- InterPro:   IPR006641
- InterPro:   IPR022967
- InterPro:   IPR018974
- InterPro:   IPR023097 [H]

Pfam domain/function: PF00575 S1; PF09371 Tex_N [H]

EC number: NA

Molecular weight: Translated: 82527; Mature: 82527

Theoretical pI: Translated: 5.36; Mature: 5.36

Prosite motif: PS50126 S1

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
2.1 %Met     (Translated Protein)
2.3 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
2.1 %Met     (Mature Protein)
2.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MYMEMVDNRQALMKMLVKELGFTEKQVRHVIQLTEEGNTVPFIARYRKEWTGSLDEVQIR
CCHHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHHH
TILERWQYIMQLEDRKEEVLRLIDEKGKLTGELRQQILKATKLQEVEDLYRPYKEKRRTK
HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
ATIAKEKGLEPLAEWLLLYTKEDPNQKAMEFINAEKEVQSAEEALQGAQDIIAEIVSDEA
HHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH
AYRSWIRNVTFRKGVMSSVVKDEEKDEKNIYEMYYDYEEPLQKVVPHRVLAMNRGEKEDI
HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCHHHHHHHHCCHHHHHHCCCCHHCE
LRVSVVPPVDEIVTFLYKKVIRDNDSKSAQYVKLAIEDGYKRLIQSSIEREIRKELTETA
EEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHH
EEQAIHIFSENLRNLLLQPPMKGKVVLAVDPAYRTGCKLAIVDDTGKVLYIDVIYPHPPV
HHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCEEEEEECCCCEEEEEEECCCCCC
RKYEDAKAKVLSIIDKYQVEMIAIGNGTASRESEEFIVDVLQNVKREVFYIIVNEAGASV
CCCHHHHHHHHHHHHHHCEEEEEECCCCCCCCHHHHHHHHHHHHHHHEEEEEEECCCCCH
YSASDLAREEFPDLQVEERSAVSIGRRLQDPLAELVKIDPKSVGVGQYQHDVSQKRLNES
HHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHH
LTFVVETAVNQVGVNVNTASVALLQYVSGLSKTVAKNIVAKREEEGKFTKRTDLKKIPRL
HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC
GAKTYEQCIGFLRILEGANPLDRTGIHPEQYKNVELLLKSLGLSKDDVGQPQLQKSLEEV
CHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHCCHHHHHHHHCCCCCCCCCHHHHHHHHHH
EISKLSEETGIGEPTLVDIIDALISPERDMRDELPKPLLKKGILKLEDLKRGMELEGTVR
HHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHH
NVVDFGAFVDVGVKQDGLVHISKLSKQYVKHPLDVVSVGQIVKVWVDDIDTKKGRVALSM
HHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCEEEEE
LPIE
CCCC
>Mature Secondary Structure
MYMEMVDNRQALMKMLVKELGFTEKQVRHVIQLTEEGNTVPFIARYRKEWTGSLDEVQIR
CCHHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHHH
TILERWQYIMQLEDRKEEVLRLIDEKGKLTGELRQQILKATKLQEVEDLYRPYKEKRRTK
HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
ATIAKEKGLEPLAEWLLLYTKEDPNQKAMEFINAEKEVQSAEEALQGAQDIIAEIVSDEA
HHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH
AYRSWIRNVTFRKGVMSSVVKDEEKDEKNIYEMYYDYEEPLQKVVPHRVLAMNRGEKEDI
HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCHHHHHHHHCCHHHHHHCCCCHHCE
LRVSVVPPVDEIVTFLYKKVIRDNDSKSAQYVKLAIEDGYKRLIQSSIEREIRKELTETA
EEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHH
EEQAIHIFSENLRNLLLQPPMKGKVVLAVDPAYRTGCKLAIVDDTGKVLYIDVIYPHPPV
HHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCEEEEEECCCCEEEEEEECCCCCC
RKYEDAKAKVLSIIDKYQVEMIAIGNGTASRESEEFIVDVLQNVKREVFYIIVNEAGASV
CCCHHHHHHHHHHHHHHCEEEEEECCCCCCCCHHHHHHHHHHHHHHHEEEEEEECCCCCH
YSASDLAREEFPDLQVEERSAVSIGRRLQDPLAELVKIDPKSVGVGQYQHDVSQKRLNES
HHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHH
LTFVVETAVNQVGVNVNTASVALLQYVSGLSKTVAKNIVAKREEEGKFTKRTDLKKIPRL
HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC
GAKTYEQCIGFLRILEGANPLDRTGIHPEQYKNVELLLKSLGLSKDDVGQPQLQKSLEEV
CHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHCCHHHHHHHHCCCCCCCCCHHHHHHHHHH
EISKLSEETGIGEPTLVDIIDALISPERDMRDELPKPLLKKGILKLEDLKRGMELEGTVR
HHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHH
NVVDFGAFVDVGVKQDGLVHISKLSKQYVKHPLDVVSVGQIVKVWVDDIDTKKGRVALSM
HHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCEEEEE
LPIE
CCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9384377 [H]