Definition | Bacillus cereus subsp. cytotoxis NVH 391-98, complete genome. |
---|---|
Accession | NC_009674 |
Length | 4,087,024 |
Click here to switch to the map view.
The map label for this gene is 152976925
Identifier: 152976925
GI number: 152976925
Start: 3293995
End: 3294819
Strand: Reverse
Name: 152976925
Synonym: Bcer98_3227
Alternate gene names: NA
Gene position: 3294819-3293995 (Counterclockwise)
Preceding gene: 152976927
Following gene: 152976924
Centisome position: 80.62
GC content: 46.18
Gene sequence:
>825_bases ATGTCTCAATGGTACAATCATTTTCATACAAATTGTGGTTGTAATAAAACAACATATATTCATTCGTGCTGTTTTAGCTG CGATGGGACGGTTCCTAAAATTGGTCCAACCGGTCCAACGGGAATAACAGGAGCAACCGGTCCAACGGGAATAACAGGAG CAACCGGTCCAACGGGAATAACAGGAGCAACCGGTCCAACGGGAATAACAGGAGCAACCGGTCCAACGGGAATAACAGGA GCAACCGGTCCAACGGGAATAACAGGAGCAACCGGTCCAACGGGAATAACAGGAGCAACCGGTCCAATGGGAATAACAGG AGCAACCGGTCCAACGGGAATAACAGGAGCAACCGGCCCGACCGGTTCAACGGGAGCAACAGGAGCAACTGGCCCAACTG GTCCAACGGGTGTTAGTGTAACGGCAACGTACGCTTTTGCCAATAATACATCTGGTGGGGCTATATCTGTTCTTCTTGGC GGAACGAATGTACCGCTTCCAAATAATCAAAATATTGGTCCAGGTATTACTGTTTCTGGTGGGAATACCGTATTTACTGT TGCAAGTGCAGGGAACTACTATATTTCATATACAATTAATATAACAGCTTCGTTATTAGTAAGTTCTCGAATTACTGTTA ATGGCTCACCGCTTGCGGGAACCATTAATTCCCCACTAGTGGCAACGACTTCATTTAGCGCGACAATTATTACAACGCTT GCGGCTGGTGACGCGATTAGTTTGCAGCTATTTGGAATATTGGCTGTTGCAACTTTATCAACTACTACCCCTGGAGCCGT TTTGACGATTATTAGATTGAGTTGA
Upstream 100 bases:
>100_bases AGTGTTGCGTATCAAAGATTGGTTGTAGACCGAATCAGTTAATGTTTTATTCGGTTACGAAAGTGTATGTTAAAGATAAA AAATGAAGGAAGGAATCACC
Downstream 100 bases:
>100_bases TTTGAAATTAGAAATATTAGGTAATGAAGTTCCCTCTATTGTAATAAATGAATAGGGGAACAAGTACATAGAGGAAAAGT ATTGAATAGTTGCATAAAAG
Product: triple helix repeat-containing collagen
Products: NA
Alternate protein names: Triple Helix Repeat-Containing Collagen; Collagen-Like Protein; BclA Protein; Collagen Triple Helix Repeat Domain-Containing Protein; Copper Amine Oxidase Domain Protein
Number of amino acids: Translated: 274; Mature: 273
Protein sequence:
>274_residues MSQWYNHFHTNCGCNKTTYIHSCCFSCDGTVPKIGPTGPTGITGATGPTGITGATGPTGITGATGPTGITGATGPTGITG ATGPTGITGATGPTGITGATGPMGITGATGPTGITGATGPTGSTGATGATGPTGPTGVSVTATYAFANNTSGGAISVLLG GTNVPLPNNQNIGPGITVSGGNTVFTVASAGNYYISYTINITASLLVSSRITVNGSPLAGTINSPLVATTSFSATIITTL AAGDAISLQLFGILAVATLSTTTPGAVLTIIRLS
Sequences:
>Translated_274_residues MSQWYNHFHTNCGCNKTTYIHSCCFSCDGTVPKIGPTGPTGITGATGPTGITGATGPTGITGATGPTGITGATGPTGITG ATGPTGITGATGPTGITGATGPMGITGATGPTGITGATGPTGSTGATGATGPTGPTGVSVTATYAFANNTSGGAISVLLG GTNVPLPNNQNIGPGITVSGGNTVFTVASAGNYYISYTINITASLLVSSRITVNGSPLAGTINSPLVATTSFSATIITTL AAGDAISLQLFGILAVATLSTTTPGAVLTIIRLS >Mature_273_residues SQWYNHFHTNCGCNKTTYIHSCCFSCDGTVPKIGPTGPTGITGATGPTGITGATGPTGITGATGPTGITGATGPTGITGA TGPTGITGATGPTGITGATGPMGITGATGPTGITGATGPTGSTGATGATGPTGPTGVSVTATYAFANNTSGGAISVLLGG TNVPLPNNQNIGPGITVSGGNTVFTVASAGNYYISYTINITASLLVSSRITVNGSPLAGTINSPLVATTSFSATIITTLA AGDAISLQLFGILAVATLSTTTPGAVLTIIRLS
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI56847616, Length=276, Percent_Identity=30.4347826086957, Blast_Score=88, Evalue=8e-18, Organism=Homo sapiens, GI65301115, Length=256, Percent_Identity=31.640625, Blast_Score=84, Evalue=1e-16, Organism=Homo sapiens, GI111118976, Length=99, Percent_Identity=50.5050505050505, Blast_Score=76, Evalue=3e-14, Organism=Homo sapiens, GI111118974, Length=99, Percent_Identity=50.5050505050505, Blast_Score=75, Evalue=4e-14, Organism=Homo sapiens, GI98985806, Length=103, Percent_Identity=44.6601941747573, Blast_Score=73, Evalue=3e-13, Organism=Homo sapiens, GI98985810, Length=103, Percent_Identity=44.6601941747573, Blast_Score=73, Evalue=4e-13, Organism=Homo sapiens, GI299523257, Length=103, Percent_Identity=44.6601941747573, Blast_Score=72, Evalue=4e-13, Organism=Homo sapiens, GI183583553, Length=103, Percent_Identity=38.8349514563107, Blast_Score=72, Evalue=6e-13, Organism=Homo sapiens, GI299523253, Length=103, Percent_Identity=44.6601941747573, Blast_Score=71, Evalue=9e-13, Organism=Homo sapiens, GI240255535, Length=110, Percent_Identity=40, Blast_Score=70, Evalue=2e-12, Organism=Homo sapiens, GI55743106, Length=110, Percent_Identity=40, Blast_Score=70, Evalue=2e-12, Organism=Homo sapiens, GI55743098, Length=110, Percent_Identity=40, Blast_Score=70, Evalue=2e-12, Organism=Homo sapiens, GI110735435, Length=115, Percent_Identity=43.4782608695652, Blast_Score=68, Evalue=8e-12, Organism=Homo sapiens, GI115392133, Length=103, Percent_Identity=44.6601941747573, Blast_Score=66, Evalue=4e-11, Organism=Homo sapiens, GI4502951, Length=104, Percent_Identity=43.2692307692308, Blast_Score=65, Evalue=8e-11, Organism=Caenorhabditis elegans, GI17551704, Length=105, Percent_Identity=42.8571428571429, Blast_Score=65, Evalue=6e-11, Organism=Drosophila melanogaster, GI45549584, Length=142, Percent_Identity=33.8028169014084, Blast_Score=72, Evalue=3e-13,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 26116; Mature: 25984
Theoretical pI: Translated: 8.21; Mature: 8.21
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.8 %Cys (Translated Protein) 0.7 %Met (Translated Protein) 2.6 %Cys+Met (Translated Protein) 1.8 %Cys (Mature Protein) 0.4 %Met (Mature Protein) 2.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSQWYNHFHTNCGCNKTTYIHSCCFSCDGTVPKIGPTGPTGITGATGPTGITGATGPTGI CCHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC TGATGPTGITGATGPTGITGATGPTGITGATGPTGITGATGPMGITGATGPTGITGATGP CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC TGSTGATGATGPTGPTGVSVTATYAFANNTSGGAISVLLGGTNVPLPNNQNIGPGITVSG CCCCCCCCCCCCCCCCCEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCCCCCCCEEECC GNTVFTVASAGNYYISYTINITASLLVSSRITVNGSPLAGTINSPLVATTSFSATIITTL CCEEEEEEECCCEEEEEEEEEEEEEEEEEEEEECCCCCCCCCCCCEEEEECCCEEEEEEE AAGDAISLQLFGILAVATLSTTTPGAVLTIIRLS ECCCEEEEHHHHEEHHHHHCCCCCCEEEEEEEEC >Mature Secondary Structure SQWYNHFHTNCGCNKTTYIHSCCFSCDGTVPKIGPTGPTGITGATGPTGITGATGPTGI CHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC TGATGPTGITGATGPTGITGATGPTGITGATGPTGITGATGPMGITGATGPTGITGATGP CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC TGSTGATGATGPTGPTGVSVTATYAFANNTSGGAISVLLGGTNVPLPNNQNIGPGITVSG CCCCCCCCCCCCCCCCCEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCCCCCCCEEECC GNTVFTVASAGNYYISYTINITASLLVSSRITVNGSPLAGTINSPLVATTSFSATIITTL CCEEEEEEECCCEEEEEEEEEEEEEEEEEEEEECCCCCCCCCCCCEEEEECCCEEEEEEE AAGDAISLQLFGILAVATLSTTTPGAVLTIIRLS ECCCEEEEHHHHEEHHHHHCCCCCCEEEEEEEEC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA