Definition Bacillus thuringiensis str. Al Hakam chromosome, complete genome.
Accession NC_008600
Length 5,257,091

Click here to switch to the map view.

The map label for this gene is 118478463

Identifier: 118478463

GI number: 118478463

Start: 3000477

End: 3003104

Strand: Reverse

Name: 118478463

Synonym: BALH_2835

Alternate gene names: NA

Gene position: 3003104-3000477 (Counterclockwise)

Preceding gene: 118478464

Following gene: 118478462

Centisome position: 57.12

GC content: 38.62

Gene sequence:

>2628_bases
GTGATGAGCATACAGTCAATCATAACGAAAGAAACTTTAAAAAAGAAAGATACAAACATTGAAATACAAGAAAAGAATAT
GAATGATTTAGTTGAATCTGCTAGCCGAGTGATTGCACCACTTTGGCCGATTGCTACATTTGCAGCTCATCACCCTTGGA
TGGGGCTTGAAAAGCAATCATTTGAACAAGTTGCAGATTGGTTAAAAGAAGCTCGTAATGTAGATATATATCCTAGTGCT
TCTATGATTCATTCGGCAAAAGCGAAAGGTGAGATTGAGGAATCATTTTTACAAAGTGGCTTGTCTCGTTGGCTTGATTC
ACAATCTTTTCATATCCCGCGAAAGAAAGTGGAGCAGTTTTGCCAAGCAGCTTTAAAATTAGAGGAATTACCTTCTAGTC
TATTATCATCGCCACAATTAAATAAACTAGCAGAAGAAATGAGTTATATAAATACAGAGAGTATGAAAGACTCTTTTTTG
CAACCTGTAAGTTCATTTATAGAGAATCAGAAGGGTGAAAACCTATCTGATATTCTTAATTATCACATCATTAAATGGTG
TAAATTATATTTGGATGATTCCGGGTCAAGTTGGACGATGCCAAATCGAGAGCAAGGTTTGTATCGTGCGTGGCATCACC
TTATTAAATTTGATCCAGCGCTTAGTAAAAATGAGCGTAGCGTTTTAAAAGATTGGCCAGAGGATGCCGAGATAGCTTTA
ACAAGAGCGTTATCTGAATTAGGAATTTCTGAATCTAACAAGCAAGCTTACCTTGAAGGTCACTTGCTTGCTTTGCCTGG
GTGGGCAGGAATGATACTATGGCGTTCCCAACAGTCAACTCAGGAGCAGGAACTTCTCATACAATATTTAGCAGTTCGAA
TCTCTATGGAGCTGGCCATTGTCAAACCTTATTTACCTATAAAAAATCAAAAGGCTGAGAAAAAAATTGCGATTGTCCCA
CTTATAGCTTCTTGGATCTATTGGGGGAATATTTCAACCCTGAAATGGTCACAAATGTCTGCAGCTGAACAAAGCGAATT
ATTAGCATTTGCTTATCGTTTTGATGAAAATATTCGCAGGAAACTTTGGTTGGAAGCTTGGGAACAGACGCATGCAGAGC
AATTAAAGAAGAAGATTTCTTCTAAACAACGTGCAACAAATGATAAAAAACGTGCACTAGCTCAATTAGCATTCTGTATT
GATGTACGTTCAGAACCATTTCGTCGACATCTAGAAAAATTAGGTCCATTCGAAACGTTTGGAATAGCTGGTTTCTTTGG
GTTACCAATTGCGACTAGTGAACTAGGCAGTAGCGACAGCCATCCTTCTTTGCCAGTTATATTAAAACCGAAACATCAAA
TAAAAGAGCTCACGGATGAAAATGAATTTAAAAATTATCAACAACGTAAGAAGATAGACAGTTCAGTAAGCTATACGTTT
AAAACGATGAAACAAAATGTACTGACAAGTATGCTATTACCTGAGGTAAGTGGCCCTTTACTTGGTTTACAAATGGTAAC
AAGGAGTTTTGTGCCAAGAAGAGTAGGTAGTTTCCTTCGTAATCTTCGTAAAACGATGTTACAAAAACCTGATACAACTT
TCTCACTTAATCATGTTCATGATACAAAATGTGAGATACCTATCGGTTTTACGAAAGAAGAAAAAGTGAACTATGTGCGT
CAAGCTTTAAAGATGGTGGGATTGACAGAAAAATTCGCGCCTTTAGTTGTAATGTGCGGACACAGTAGTCAAAGTACGAA
CAACCCTTATGCTGCAGCGCTTGAGTGCGGTGCTTGTGGCGGAGCAGCAGGGGGATTCAATGCAAAAGTTTTCGCTACTT
TATGTAACCTTCCAGAAGTAAGAGAGGCGTTGTCTGCTGAAGGCATTAAAATCCCTGAGGATACCATTTTTGCAGCAGCT
GAACATAAAACAACAGTGGATGAATTGGAATGGATTTACATCCCAGAACTTTCGGAAACTGCGCAAGAAGCATTTGATAG
TATTGAGTCGGTTATGCCAAATGTGAGCCAGCATGCAAATAGAGAGCGTTTAATGCAATTACCTAATTTTAAAACGGAAA
TAAAAAATCCGAGTAAAGAGGCCCATCGATTTGCAGAAGATTGGAGTGAAATACGTCCGGAATGGGGATTAGCCCGTAAT
GCATCTTTTATTATCGGACAACGAGAATTAACTCAGGATTGTGACTTAGAAGGAAGGGCTTTTCTTCATAATTATGATTG
GAAACAGGATGGAAGTGGAGATATATTAGCTAGCATCATAGCGGGACCAGGAACAGTAGCGCAGTGGATCAATCTACAAT
ATTACGCTTCAACTGTAGCCCCTCATTATTATGGTAGTGGAAACAAAACAACCCAAACCGTAACGGCAGGTCTTGGTGTT
ATGCAAGGAAATGCAAGTGACTTGTTGCCAGGCTTACCTTGGCAATCCGTTATGCAATCAGATCGTGAGACGTATCATTC
ACCACTTCGTTTATTAATTGTAATTCAAGCACCTACTAAATATATAGAACATTTGCTAAATAATGATTTCACATTTCGAG
AAAAAGTTCAAAATGGATGGGTTAGACTTGCCAGTGTTGATCCAGAGGGACGTTGGAAAAACTGGTAA

Upstream 100 bases:

>100_bases
TAATGTATCTTTGGTTCGTTCGAGTAGGTGAGGCGAGACAAAAGTCAGTAGAAAGTCATCCAAGCTATCTGAAACATTAT
GTATCAAAGGGAGGTAATCA

Downstream 100 bases:

>100_bases
CAATCATGTAATGAATACTCAACTATTGTTACTAGTTTAAAAGTTCTATAAATACAATCATGAAAAAGGAGTGCTATATA
TGAATGCAAAGAAGAAAAAG

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 875; Mature: 875

Protein sequence:

>875_residues
MMSIQSIITKETLKKKDTNIEIQEKNMNDLVESASRVIAPLWPIATFAAHHPWMGLEKQSFEQVADWLKEARNVDIYPSA
SMIHSAKAKGEIEESFLQSGLSRWLDSQSFHIPRKKVEQFCQAALKLEELPSSLLSSPQLNKLAEEMSYINTESMKDSFL
QPVSSFIENQKGENLSDILNYHIIKWCKLYLDDSGSSWTMPNREQGLYRAWHHLIKFDPALSKNERSVLKDWPEDAEIAL
TRALSELGISESNKQAYLEGHLLALPGWAGMILWRSQQSTQEQELLIQYLAVRISMELAIVKPYLPIKNQKAEKKIAIVP
LIASWIYWGNISTLKWSQMSAAEQSELLAFAYRFDENIRRKLWLEAWEQTHAEQLKKKISSKQRATNDKKRALAQLAFCI
DVRSEPFRRHLEKLGPFETFGIAGFFGLPIATSELGSSDSHPSLPVILKPKHQIKELTDENEFKNYQQRKKIDSSVSYTF
KTMKQNVLTSMLLPEVSGPLLGLQMVTRSFVPRRVGSFLRNLRKTMLQKPDTTFSLNHVHDTKCEIPIGFTKEEKVNYVR
QALKMVGLTEKFAPLVVMCGHSSQSTNNPYAAALECGACGGAAGGFNAKVFATLCNLPEVREALSAEGIKIPEDTIFAAA
EHKTTVDELEWIYIPELSETAQEAFDSIESVMPNVSQHANRERLMQLPNFKTEIKNPSKEAHRFAEDWSEIRPEWGLARN
ASFIIGQRELTQDCDLEGRAFLHNYDWKQDGSGDILASIIAGPGTVAQWINLQYYASTVAPHYYGSGNKTTQTVTAGLGV
MQGNASDLLPGLPWQSVMQSDRETYHSPLRLLIVIQAPTKYIEHLLNNDFTFREKVQNGWVRLASVDPEGRWKNW

Sequences:

>Translated_875_residues
MMSIQSIITKETLKKKDTNIEIQEKNMNDLVESASRVIAPLWPIATFAAHHPWMGLEKQSFEQVADWLKEARNVDIYPSA
SMIHSAKAKGEIEESFLQSGLSRWLDSQSFHIPRKKVEQFCQAALKLEELPSSLLSSPQLNKLAEEMSYINTESMKDSFL
QPVSSFIENQKGENLSDILNYHIIKWCKLYLDDSGSSWTMPNREQGLYRAWHHLIKFDPALSKNERSVLKDWPEDAEIAL
TRALSELGISESNKQAYLEGHLLALPGWAGMILWRSQQSTQEQELLIQYLAVRISMELAIVKPYLPIKNQKAEKKIAIVP
LIASWIYWGNISTLKWSQMSAAEQSELLAFAYRFDENIRRKLWLEAWEQTHAEQLKKKISSKQRATNDKKRALAQLAFCI
DVRSEPFRRHLEKLGPFETFGIAGFFGLPIATSELGSSDSHPSLPVILKPKHQIKELTDENEFKNYQQRKKIDSSVSYTF
KTMKQNVLTSMLLPEVSGPLLGLQMVTRSFVPRRVGSFLRNLRKTMLQKPDTTFSLNHVHDTKCEIPIGFTKEEKVNYVR
QALKMVGLTEKFAPLVVMCGHSSQSTNNPYAAALECGACGGAAGGFNAKVFATLCNLPEVREALSAEGIKIPEDTIFAAA
EHKTTVDELEWIYIPELSETAQEAFDSIESVMPNVSQHANRERLMQLPNFKTEIKNPSKEAHRFAEDWSEIRPEWGLARN
ASFIIGQRELTQDCDLEGRAFLHNYDWKQDGSGDILASIIAGPGTVAQWINLQYYASTVAPHYYGSGNKTTQTVTAGLGV
MQGNASDLLPGLPWQSVMQSDRETYHSPLRLLIVIQAPTKYIEHLLNNDFTFREKVQNGWVRLASVDPEGRWKNW
>Mature_875_residues
MMSIQSIITKETLKKKDTNIEIQEKNMNDLVESASRVIAPLWPIATFAAHHPWMGLEKQSFEQVADWLKEARNVDIYPSA
SMIHSAKAKGEIEESFLQSGLSRWLDSQSFHIPRKKVEQFCQAALKLEELPSSLLSSPQLNKLAEEMSYINTESMKDSFL
QPVSSFIENQKGENLSDILNYHIIKWCKLYLDDSGSSWTMPNREQGLYRAWHHLIKFDPALSKNERSVLKDWPEDAEIAL
TRALSELGISESNKQAYLEGHLLALPGWAGMILWRSQQSTQEQELLIQYLAVRISMELAIVKPYLPIKNQKAEKKIAIVP
LIASWIYWGNISTLKWSQMSAAEQSELLAFAYRFDENIRRKLWLEAWEQTHAEQLKKKISSKQRATNDKKRALAQLAFCI
DVRSEPFRRHLEKLGPFETFGIAGFFGLPIATSELGSSDSHPSLPVILKPKHQIKELTDENEFKNYQQRKKIDSSVSYTF
KTMKQNVLTSMLLPEVSGPLLGLQMVTRSFVPRRVGSFLRNLRKTMLQKPDTTFSLNHVHDTKCEIPIGFTKEEKVNYVR
QALKMVGLTEKFAPLVVMCGHSSQSTNNPYAAALECGACGGAAGGFNAKVFATLCNLPEVREALSAEGIKIPEDTIFAAA
EHKTTVDELEWIYIPELSETAQEAFDSIESVMPNVSQHANRERLMQLPNFKTEIKNPSKEAHRFAEDWSEIRPEWGLARN
ASFIIGQRELTQDCDLEGRAFLHNYDWKQDGSGDILASIIAGPGTVAQWINLQYYASTVAPHYYGSGNKTTQTVTAGLGV
MQGNASDLLPGLPWQSVMQSDRETYHSPLRLLIVIQAPTKYIEHLLNNDFTFREKVQNGWVRLASVDPEGRWKNW

Specific function: Unknown

COG id: COG3002

COG function: function code S; Uncharacterized protein conserved in bacteria

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Belongs to the UPF0753 family

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): Y2835_BACAH (A0RFW4)

Other databases:

- EMBL:   CP000485
- RefSeq:   YP_895614.1
- STRING:   A0RFW4
- EnsemblBacteria:   EBBACT00000068184
- GeneID:   4542747
- GenomeReviews:   CP000485_GR
- KEGG:   btl:BALH_2835
- NMPDR:   fig|412694.5.peg.2737
- eggNOG:   COG3002
- GeneTree:   EBGT00070000032270
- HOGENOM:   HBG520117
- OMA:   FAEDWSE
- ProtClustDB:   CLSK884660
- BioCyc:   BTHU412694:BALH_2835-MONOMER
- HAMAP:   MF_01871
- InterPro:   IPR018752

Pfam domain/function: PF10070 DUF2309

EC number: NA

Molecular weight: Translated: 99298; Mature: 99298

Theoretical pI: Translated: 7.01; Mature: 7.01

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
2.4 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MMSIQSIITKETLKKKDTNIEIQEKNMNDLVESASRVIAPLWPIATFAAHHPWMGLEKQS
CCCHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHH
FEQVADWLKEARNVDIYPSASMIHSAKAKGEIEESFLQSGLSRWLDSQSFHIPRKKVEQF
HHHHHHHHHHHCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHH
CQAALKLEELPSSLLSSPQLNKLAEEMSYINTESMKDSFLQPVSSFIENQKGENLSDILN
HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH
YHIIKWCKLYLDDSGSSWTMPNREQGLYRAWHHLIKFDPALSKNERSVLKDWPEDAEIAL
HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHCCCCCHHHHH
TRALSELGISESNKQAYLEGHLLALPGWAGMILWRSQQSTQEQELLIQYLAVRISMELAI
HHHHHHHCCCCCCCCEEECCEEEEECCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHEEE
VKPYLPIKNQKAEKKIAIVPLIASWIYWGNISTLKWSQMSAAEQSELLAFAYRFDENIRR
ECCCCCCCCCCCCCCEEHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHH
KLWLEAWEQTHAEQLKKKISSKQRATNDKKRALAQLAFCIDVRSEPFRRHLEKLGPFETF
HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCHHH
GIAGFFGLPIATSELGSSDSHPSLPVILKPKHQIKELTDENEFKNYQQRKKIDSSVSYTF
CCCHHHCCCCCHHHCCCCCCCCCCCEEECCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH
KTMKQNVLTSMLLPEVSGPLLGLQMVTRSFVPRRVGSFLRNLRKTMLQKPDTTFSLNHVH
HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCC
DTKCEIPIGFTKEEKVNYVRQALKMVGLTEKFAPLVVMCGHSSQSTNNPYAAALECGACG
CCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHCCCEEEEECCCCCCCCCCCEEEEECCCCC
GAAGGFNAKVFATLCNLPEVREALSAEGIKIPEDTIFAAAEHKTTVDELEWIYIPELSET
CCCCCCCHHHHHHHHCCHHHHHHHCCCCCCCCCHHEEEHHCCCCHHHHCCEEECCCHHHH
AQEAFDSIESVMPNVSQHANRERLMQLPNFKTEIKNPSKEAHRFAEDWSEIRPEWGLARN
HHHHHHHHHHHCCCHHHHCCHHHHHHCCCCHHHCCCCHHHHHHHHHHHHHHCCCCCCCCC
ASFIIGQRELTQDCDLEGRAFLHNYDWKQDGSGDILASIIAGPGTVAQWINLQYYASTVA
CEEEECCHHHCCCCCCCCCEEEECCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHC
PHYYGSGNKTTQTVTAGLGVMQGNASDLLPGLPWQSVMQSDRETYHSPLRLLIVIQAPTK
CCEECCCCCCHHHHHHCCCEECCCHHHCCCCCCHHHHHHHHHHHHCCCEEEEEEEECCHH
YIEHLLNNDFTFREKVQNGWVRLASVDPEGRWKNW
HHHHHHCCCCHHHHHHHCCEEEEEEECCCCCCCCC
>Mature Secondary Structure
MMSIQSIITKETLKKKDTNIEIQEKNMNDLVESASRVIAPLWPIATFAAHHPWMGLEKQS
CCCHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHH
FEQVADWLKEARNVDIYPSASMIHSAKAKGEIEESFLQSGLSRWLDSQSFHIPRKKVEQF
HHHHHHHHHHHCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHH
CQAALKLEELPSSLLSSPQLNKLAEEMSYINTESMKDSFLQPVSSFIENQKGENLSDILN
HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH
YHIIKWCKLYLDDSGSSWTMPNREQGLYRAWHHLIKFDPALSKNERSVLKDWPEDAEIAL
HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHCCCCCHHHHH
TRALSELGISESNKQAYLEGHLLALPGWAGMILWRSQQSTQEQELLIQYLAVRISMELAI
HHHHHHHCCCCCCCCEEECCEEEEECCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHEEE
VKPYLPIKNQKAEKKIAIVPLIASWIYWGNISTLKWSQMSAAEQSELLAFAYRFDENIRR
ECCCCCCCCCCCCCCEEHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHH
KLWLEAWEQTHAEQLKKKISSKQRATNDKKRALAQLAFCIDVRSEPFRRHLEKLGPFETF
HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCHHH
GIAGFFGLPIATSELGSSDSHPSLPVILKPKHQIKELTDENEFKNYQQRKKIDSSVSYTF
CCCHHHCCCCCHHHCCCCCCCCCCCEEECCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH
KTMKQNVLTSMLLPEVSGPLLGLQMVTRSFVPRRVGSFLRNLRKTMLQKPDTTFSLNHVH
HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCC
DTKCEIPIGFTKEEKVNYVRQALKMVGLTEKFAPLVVMCGHSSQSTNNPYAAALECGACG
CCCCCCCCCCCCHHHHHHHHHHHHHHCCHHHCCCEEEEECCCCCCCCCCCEEEEECCCCC
GAAGGFNAKVFATLCNLPEVREALSAEGIKIPEDTIFAAAEHKTTVDELEWIYIPELSET
CCCCCCCHHHHHHHHCCHHHHHHHCCCCCCCCCHHEEEHHCCCCHHHHCCEEECCCHHHH
AQEAFDSIESVMPNVSQHANRERLMQLPNFKTEIKNPSKEAHRFAEDWSEIRPEWGLARN
HHHHHHHHHHHCCCHHHHCCHHHHHHCCCCHHHCCCCHHHHHHHHHHHHHHCCCCCCCCC
ASFIIGQRELTQDCDLEGRAFLHNYDWKQDGSGDILASIIAGPGTVAQWINLQYYASTVA
CEEEECCHHHCCCCCCCCCEEEECCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHC
PHYYGSGNKTTQTVTAGLGVMQGNASDLLPGLPWQSVMQSDRETYHSPLRLLIVIQAPTK
CCEECCCCCCHHHHHHCCCEECCCHHHCCCCCCHHHHHHHHHHHHCCCEEEEEEEECCHH
YIEHLLNNDFTFREKVQNGWVRLASVDPEGRWKNW
HHHHHHCCCCHHHHHHHCCEEEEEEECCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA