Definition | Bacillus anthracis str. CDC 684, complete genome. |
---|---|
Accession | NC_012581 |
Length | 5,230,115 |
Click here to switch to the map view.
The map label for this gene is 227814937
Identifier: 227814937
GI number: 227814937
Start: 2141291
End: 2142265
Strand: Direct
Name: 227814937
Synonym: BAMEG_2348
Alternate gene names: NA
Gene position: 2141291-2142265 (Clockwise)
Preceding gene: 227814935
Following gene: 227814942
Centisome position: 40.94
GC content: 35.28
Gene sequence:
>975_bases ATGGCAAAGCGCTATGAAAATATGGATAATGTTAGCACAAAAAAATCAATTCGCTCTTTTATACGCTGGCGTAAAGAACG AAAGCAAAACAAAAAAGATTTTTCTTTCTTAGTAGAACAATCACCCGTTAAACAAAGTGCATTTTTGCAAAATAATGTTA AAAAAACGACTGTCACATGGATTGGGCATTCCACTTTTCTTATCCAAACGAATGGACTTAATATATTAACGGATCCAGTA TGGGCCAATAAATTAAAATTAGTCCCAAGACTTACAGAACCTGGACTTTCTATAAAAGAACTACCTAAAATTGATATCGT TCTTCTTTCACATGGCCATTATGATCACTTAGATTTTTCAACACTCCGCCAGCTGAACGATGACGTATTATACTTAGTGC CCATCGGGTTAAAAAAATTATTTACTCGTAAAAAATTCAACAATGTAGAAGAGTATAAATGGTGGGAAAGTACGACTATT GATAATGTTTCCTTTCACTTCGTACCTGCTCAGCACTGGACGAGAAGATCATTATTTGATATGAATACATCTCACTGGGG TGGATGGATTATTAAGAATGACAATATGGAGGAAACCATTTATTTTTGCGGAGATAGTGGTTATTTCCAAGGCTTCAAAG AAATTGGCAAACGCTTTTCAATTGATATTGCCCTTATGCCAATTGGAGCTTATGAACCAGAATGGTTTATGAAAATATCT CACGTTTCACCTGAAGAAGCAGTGCAAGCTTATTTAGATTTACATGCTACACACTTTATACCAATGCATTACGGGGCTTT TGCACTCGCTGATGAAACACCTCGCGAGGCAATAACAAGGCTTCGAAATAATTGGAATTTACGTATGTTACCGTGGGAAC AACTGCATGTACTCTTTTTAGGTCAAACTTTCACTTATAATAGCGAAACACCTACTAAAAAAGTGAATGAAAAAATTGAA ACGTTACATGTATAA
Upstream 100 bases:
>100_bases TTTTTACAATTACTTAAACATTCATCCATGTTATACACTGTATTCCTTAACAACATTTCGGTACAATATAATTACAACGA TTTTAAATGAGGTGATGTTC
Downstream 100 bases:
>100_bases CGTAACTACTTATTTTCTATATCCAAACATATAATATAGAAAATCATAAAGTGAAACTTTAATCAGTAGGTGTTTTCTAT GAATGAATATTTACAGACTC
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 324; Mature: 323
Protein sequence:
>324_residues MAKRYENMDNVSTKKSIRSFIRWRKERKQNKKDFSFLVEQSPVKQSAFLQNNVKKTTVTWIGHSTFLIQTNGLNILTDPV WANKLKLVPRLTEPGLSIKELPKIDIVLLSHGHYDHLDFSTLRQLNDDVLYLVPIGLKKLFTRKKFNNVEEYKWWESTTI DNVSFHFVPAQHWTRRSLFDMNTSHWGGWIIKNDNMEETIYFCGDSGYFQGFKEIGKRFSIDIALMPIGAYEPEWFMKIS HVSPEEAVQAYLDLHATHFIPMHYGAFALADETPREAITRLRNNWNLRMLPWEQLHVLFLGQTFTYNSETPTKKVNEKIE TLHV
Sequences:
>Translated_324_residues MAKRYENMDNVSTKKSIRSFIRWRKERKQNKKDFSFLVEQSPVKQSAFLQNNVKKTTVTWIGHSTFLIQTNGLNILTDPV WANKLKLVPRLTEPGLSIKELPKIDIVLLSHGHYDHLDFSTLRQLNDDVLYLVPIGLKKLFTRKKFNNVEEYKWWESTTI DNVSFHFVPAQHWTRRSLFDMNTSHWGGWIIKNDNMEETIYFCGDSGYFQGFKEIGKRFSIDIALMPIGAYEPEWFMKIS HVSPEEAVQAYLDLHATHFIPMHYGAFALADETPREAITRLRNNWNLRMLPWEQLHVLFLGQTFTYNSETPTKKVNEKIE TLHV >Mature_323_residues AKRYENMDNVSTKKSIRSFIRWRKERKQNKKDFSFLVEQSPVKQSAFLQNNVKKTTVTWIGHSTFLIQTNGLNILTDPVW ANKLKLVPRLTEPGLSIKELPKIDIVLLSHGHYDHLDFSTLRQLNDDVLYLVPIGLKKLFTRKKFNNVEEYKWWESTTID NVSFHFVPAQHWTRRSLFDMNTSHWGGWIIKNDNMEETIYFCGDSGYFQGFKEIGKRFSIDIALMPIGAYEPEWFMKISH VSPEEAVQAYLDLHATHFIPMHYGAFALADETPREAITRLRNNWNLRMLPWEQLHVLFLGQTFTYNSETPTKKVNEKIET LHV
Specific function: Unknown
COG id: COG2220
COG function: function code R; Predicted Zn-dependent hydrolases of the beta-lactamase fold
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: To K.pneumoniae romA [H]
Homologues:
Organism=Homo sapiens, GI170932483, Length=316, Percent_Identity=36.3924050632911, Blast_Score=192, Evalue=5e-49, Organism=Homo sapiens, GI170932481, Length=316, Percent_Identity=36.3924050632911, Blast_Score=192, Evalue=5e-49, Organism=Caenorhabditis elegans, GI17543102, Length=250, Percent_Identity=33.6, Blast_Score=160, Evalue=7e-40, Organism=Caenorhabditis elegans, GI17543100, Length=250, Percent_Identity=33.6, Blast_Score=160, Evalue=9e-40, Organism=Caenorhabditis elegans, GI25148824, Length=250, Percent_Identity=33.6, Blast_Score=160, Evalue=1e-39, Organism=Caenorhabditis elegans, GI17543104, Length=232, Percent_Identity=35.7758620689655, Blast_Score=155, Evalue=2e-38, Organism=Saccharomyces cerevisiae, GI6325154, Length=254, Percent_Identity=36.6141732283465, Blast_Score=149, Evalue=6e-37,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 38113; Mature: 37982
Theoretical pI: Translated: 9.49; Mature: 9.49
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 2.8 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 2.2 %Met (Mature Protein) 2.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAKRYENMDNVSTKKSIRSFIRWRKERKQNKKDFSFLVEQSPVKQSAFLQNNVKKTTVTW CCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCHHHHHHHCCCCEEEEEE IGHSTFLIQTNGLNILTDPVWANKLKLVPRLTEPGLSIKELPKIDIVLLSHGHYDHLDFS EECEEEEEEECCCEEEECCCCCCCEEECCCCCCCCCCHHHCCCEEEEEEECCCCCCCCHH TLRQLNDDVLYLVPIGLKKLFTRKKFNNVEEYKWWESTTIDNVSFHFVPAQHWTRRSLFD HHHHCCCCEEEEEECCHHHHHHHHHCCCHHHHCCCCCCCCCCEEEEEEECCHHHHHHHHC MNTSHWGGWIIKNDNMEETIYFCGDSGYFQGFKEIGKRFSIDIALMPIGAYEPEWFMKIS CCCCCCCCEEEECCCCCEEEEEECCCHHHHHHHHHCHHEEEEEEEEECCCCCCHHEEEEE HVSPEEAVQAYLDLHATHFIPMHYGAFALADETPREAITRLRNNWNLRMLPWEQLHVLFL CCCHHHHHHHHHHHHHHEEEEECCCEEEECCCCHHHHHHHHHCCCCEEEECHHHEEEEEE GQTFTYNSETPTKKVNEKIETLHV EEEEEECCCCCHHHHHHHHHHCCC >Mature Secondary Structure AKRYENMDNVSTKKSIRSFIRWRKERKQNKKDFSFLVEQSPVKQSAFLQNNVKKTTVTW CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCCCHHHHHHHCCCCEEEEEE IGHSTFLIQTNGLNILTDPVWANKLKLVPRLTEPGLSIKELPKIDIVLLSHGHYDHLDFS EECEEEEEEECCCEEEECCCCCCCEEECCCCCCCCCCHHHCCCEEEEEEECCCCCCCCHH TLRQLNDDVLYLVPIGLKKLFTRKKFNNVEEYKWWESTTIDNVSFHFVPAQHWTRRSLFD HHHHCCCCEEEEEECCHHHHHHHHHCCCHHHHCCCCCCCCCCEEEEEEECCHHHHHHHHC MNTSHWGGWIIKNDNMEETIYFCGDSGYFQGFKEIGKRFSIDIALMPIGAYEPEWFMKIS CCCCCCCCEEEECCCCCEEEEEECCCHHHHHHHHHCHHEEEEEEEEECCCCCCHHEEEEE HVSPEEAVQAYLDLHATHFIPMHYGAFALADETPREAITRLRNNWNLRMLPWEQLHVLFL CCCHHHHHHHHHHHHHHEEEEECCCEEEECCCCHHHHHHHHHCCCCEEEECHHHEEEEEE GQTFTYNSETPTKKVNEKIETLHV EEEEEECCCCCHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 9634230; 12218036 [H]