Definition Bacteroides thetaiotaomicron VPI-5482 chromosome, complete genome.
Accession NC_004663
Length 6,260,361

Click here to switch to the map view.

The map label for this gene is 29349538

Identifier: 29349538

GI number: 29349538

Start: 5415909

End: 5417906

Strand: Reverse

Name: 29349538

Synonym: BT_4130

Alternate gene names: NA

Gene position: 5417906-5415909 (Counterclockwise)

Preceding gene: 29349539

Following gene: 29349537

Centisome position: 86.54

GC content: 49.95

Gene sequence:

>1998_bases
ATGAGCGTAGACACTAATAACGCAGCTTTTCAGGACGCCCTGAATCTTATTCAGTATACCCGCCAGTCGGTTTTTCTGAC
CGGAAAAGCGGGTACAGGTAAATCTACATTCCTACGTTATGTCTGCGAACATACCAAGAAGAAACATGTCGTTCTCGCAC
CTACCGGCATCGCAGCCATCAACGCCGGAGGAAGTACGATGCACAGTTTCTTCAAACTCCCCTTCTATCCGTTACTGCCG
GACGATCCAAATTTAAGTCTCCAGAGAGGACGCATTCACGAGTTCTTCAAGTACACCAAACCGCACCGGAAATTACTGGA
ACAGATTGAACTGGTCATCATAGACGAAATCTCTATGGTACGGGCGGACCTCATTGACGCCATCGACCGCATCTTACGTG
TATATTCACATAATTTACGGGAACCTTTCGGCGGCAAACAACTGTTATTGGTAGGCGACGTTTTCCAGCTGGAACCTGTC
GTGAAGAATGACGAGCGGGAGATTCTGAACCGTGCCTACCCTACTCCATACTTCTTCTCGGCAAGAGTATTCAGCCAGAT
CGATCTGGTATCCATCGAACTCCAGAAAGTATACCGGCAAACTGACTCCGTCTTTGTCAGCGTTCTCGACCATATCCGTA
CCAACACCGCCGGAGCAGCCGACCTGCAACTGCTGAACACCCGATACGGAAGCCATATCGAAGAATCGGAAGCCGATATG
TACATCACGCTTGCCACCCGAAGGGATACGGTTGACTCGATCAATGAGAAGAAACTGGCCGAGCTGGCAGGAGAACCGAT
CACCTTTGAGGGAAGCATTGAAGGGGATTTTCCCGAAAGTAGTCTGCCGACCTCACAGGAACTCGTTCTGAAACCGGGTG
CTCAAATCATCTTTATCAAGAATGATTTCGACCGCCGGTGGGTAAACGGTACCATCGGAGTGATTGCCGGTATCGACGAG
GAAGAAGAAACGATATACGTCATCACCGATGACGGCAAGGAATGCGATGTAAAACGGGAATCATGGCGTAACATCCGCTA
TCGCTACAACGAAAAGACGAAAGAGATTGAAGAGGAAGTACTGGGCAGCTTCACCCAATATCCCATTCGGCTGGCTTGGG
CCATTACCGTCCACAAGAGTCAGGGGTTGACTTTCAGCAGAGTAGTGATTGACTTCACGGGAGGTGTATTTGCCGGCGGA
CAGGCGTATGTAGCACTCAGCCGTTGTACCTCACTGGATGGCATCCAGCTCAAAAAGCCGATCAACCGGGCGGATATCTT
TGTGCGTCCCGAAATCGTAAACTTTGCCGGACGGTTCAACGACCGGCAAGCCATCGACAAGGCTCTGAAACAGGCGCAGG
CCGATGTCCAGTATGCCGCTGCCGCACGTGCATTCGACAAAGGGGATATGGAAGAATGCCTGGAACAATTTTTCCGTGCC
ATTCACTCCCGTTATGACATAGAGAAACCTGTTCCCCGCCGATTAATCCGCCGGAAACTGGGGATTATCAACACCTTGCA
GGAGCAGAACAAAAAGCTCAAAGAGCAAATGCGGGAACAGCAGGAACGTCTGCGGCAATATGCCCACGAATATCTGTTAA
TGGGAAATGAATGTATCACCCAAGCCCACGACGCCCGTGCCGCCATCGCCAATTATGACAAAGCGCTCAGCCTCGACCCT
AATTATATAGACGCCTGGATACGGAAAGGAATCACCCTGTTCAACAGCAAAGAGTATTTTGATGCGGAAAACTGTTTCAA
TACCGCTGTCAGCCTTCATCCTGCAAACTTTAAAGCTGTGTACAACCGTGGAAAACTGCGTCTGAAAATCGATAATACTG
AAGGAGCAATTGCCGACCTGGACAAGGCTACAAGTCTGAAACCCGAGCATGCCGGTGCACACGAGCTTTTCGGAGATGCC
CTGTTGAGAGTGGGGAAAGAAGTGGAAGCCGCCATACAATGGAGAATTGCCGAGGAACTCAGAAAGAAAAAATCATAA

Upstream 100 bases:

>100_bases
CCCCCATTGATGAGGACGGCATCAGCAAGGCTATGAAACATTTCGGGATTATCTAATTCATTTATCCGGTCAGCACATCT
GCTTATCCGTCACTAACTTT

Downstream 100 bases:

>100_bases
ATAAGTTGGTAGTGTATCAGAAGTCTTCTGATTGTGAAAGTTATTGAATATCATCTTACATCAAAAACTTTTTATTAACT
TTGCCTTCGTATGTACGTCA

Product: putative helicase

Products: NA

Alternate protein names: Helicase; TPR Domain Protein; TPR Domain-Containing Protein; ATPase; TPR Repeat-Containing Protein; Exonuclease V Subunit Alpha; Helicase-Related Protein; 5-3' DNA Helicase; Aaa ATPase; Glycosysltransferase; Hrdc Domain Protein; HRDC Domain Protein; Helicase-Family Protein; RecD-Like Exodeoxyribonuclease V Alpha Chain; RRM3/PIF1 Helicase-Like Protein; Helicase Protein; ATP-Dependent ExoDNAse Alpha Subunit; DNA Helicase AAA ATPase; DNA Helicase/AAA ATPase; Tetratricopeptide Repeat-Containing Protein; Helicase RecD/TraA Family

Number of amino acids: Translated: 665; Mature: 664

Protein sequence:

>665_residues
MSVDTNNAAFQDALNLIQYTRQSVFLTGKAGTGKSTFLRYVCEHTKKKHVVLAPTGIAAINAGGSTMHSFFKLPFYPLLP
DDPNLSLQRGRIHEFFKYTKPHRKLLEQIELVIIDEISMVRADLIDAIDRILRVYSHNLREPFGGKQLLLVGDVFQLEPV
VKNDEREILNRAYPTPYFFSARVFSQIDLVSIELQKVYRQTDSVFVSVLDHIRTNTAGAADLQLLNTRYGSHIEESEADM
YITLATRRDTVDSINEKKLAELAGEPITFEGSIEGDFPESSLPTSQELVLKPGAQIIFIKNDFDRRWVNGTIGVIAGIDE
EEETIYVITDDGKECDVKRESWRNIRYRYNEKTKEIEEEVLGSFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGG
QAYVALSRCTSLDGIQLKKPINRADIFVRPEIVNFAGRFNDRQAIDKALKQAQADVQYAAAARAFDKGDMEECLEQFFRA
IHSRYDIEKPVPRRLIRRKLGIINTLQEQNKKLKEQMREQQERLRQYAHEYLLMGNECITQAHDARAAIANYDKALSLDP
NYIDAWIRKGITLFNSKEYFDAENCFNTAVSLHPANFKAVYNRGKLRLKIDNTEGAIADLDKATSLKPEHAGAHELFGDA
LLRVGKEVEAAIQWRIAEELRKKKS

Sequences:

>Translated_665_residues
MSVDTNNAAFQDALNLIQYTRQSVFLTGKAGTGKSTFLRYVCEHTKKKHVVLAPTGIAAINAGGSTMHSFFKLPFYPLLP
DDPNLSLQRGRIHEFFKYTKPHRKLLEQIELVIIDEISMVRADLIDAIDRILRVYSHNLREPFGGKQLLLVGDVFQLEPV
VKNDEREILNRAYPTPYFFSARVFSQIDLVSIELQKVYRQTDSVFVSVLDHIRTNTAGAADLQLLNTRYGSHIEESEADM
YITLATRRDTVDSINEKKLAELAGEPITFEGSIEGDFPESSLPTSQELVLKPGAQIIFIKNDFDRRWVNGTIGVIAGIDE
EEETIYVITDDGKECDVKRESWRNIRYRYNEKTKEIEEEVLGSFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGG
QAYVALSRCTSLDGIQLKKPINRADIFVRPEIVNFAGRFNDRQAIDKALKQAQADVQYAAAARAFDKGDMEECLEQFFRA
IHSRYDIEKPVPRRLIRRKLGIINTLQEQNKKLKEQMREQQERLRQYAHEYLLMGNECITQAHDARAAIANYDKALSLDP
NYIDAWIRKGITLFNSKEYFDAENCFNTAVSLHPANFKAVYNRGKLRLKIDNTEGAIADLDKATSLKPEHAGAHELFGDA
LLRVGKEVEAAIQWRIAEELRKKKS
>Mature_664_residues
SVDTNNAAFQDALNLIQYTRQSVFLTGKAGTGKSTFLRYVCEHTKKKHVVLAPTGIAAINAGGSTMHSFFKLPFYPLLPD
DPNLSLQRGRIHEFFKYTKPHRKLLEQIELVIIDEISMVRADLIDAIDRILRVYSHNLREPFGGKQLLLVGDVFQLEPVV
KNDEREILNRAYPTPYFFSARVFSQIDLVSIELQKVYRQTDSVFVSVLDHIRTNTAGAADLQLLNTRYGSHIEESEADMY
ITLATRRDTVDSINEKKLAELAGEPITFEGSIEGDFPESSLPTSQELVLKPGAQIIFIKNDFDRRWVNGTIGVIAGIDEE
EETIYVITDDGKECDVKRESWRNIRYRYNEKTKEIEEEVLGSFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGGQ
AYVALSRCTSLDGIQLKKPINRADIFVRPEIVNFAGRFNDRQAIDKALKQAQADVQYAAAARAFDKGDMEECLEQFFRAI
HSRYDIEKPVPRRLIRRKLGIINTLQEQNKKLKEQMREQQERLRQYAHEYLLMGNECITQAHDARAAIANYDKALSLDPN
YIDAWIRKGITLFNSKEYFDAENCFNTAVSLHPANFKAVYNRGKLRLKIDNTEGAIADLDKATSLKPEHAGAHELFGDAL
LRVGKEVEAAIQWRIAEELRKKKS

Specific function: Unknown

COG id: COG0507

COG function: function code L; ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI82546872, Length=403, Percent_Identity=34.4913151364764, Blast_Score=183, Evalue=4e-46,
Organism=Caenorhabditis elegans, GI25143421, Length=406, Percent_Identity=31.5270935960591, Blast_Score=167, Evalue=2e-41,
Organism=Saccharomyces cerevisiae, GI6321820, Length=489, Percent_Identity=28.2208588957055, Blast_Score=141, Evalue=4e-34,
Organism=Saccharomyces cerevisiae, GI6323579, Length=342, Percent_Identity=32.7485380116959, Blast_Score=134, Evalue=5e-32,
Organism=Drosophila melanogaster, GI19920652, Length=439, Percent_Identity=30.751708428246, Blast_Score=166, Evalue=5e-41,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 75684; Mature: 75553

Theoretical pI: Translated: 7.07; Mature: 7.07

Prosite motif: PS50005 TPR L=RR ; PS50293 TPR_REGION

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
1.1 %Met     (Translated Protein)
2.0 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
0.9 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSVDTNNAAFQDALNLIQYTRQSVFLTGKAGTGKSTFLRYVCEHTKKKHVVLAPTGIAAI
CCCCCCCHHHHHHHHHHHHHHHEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCEEEE
NAGGSTMHSFFKLPFYPLLPDDPNLSLQRGRIHEFFKYTKPHRKLLEQIELVIIDEISMV
ECCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHEEHHHHHH
RADLIDAIDRILRVYSHNLREPFGGKQLLLVGDVFQLEPVVKNDEREILNRAYPTPYFFS
HHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECHHHCCCCCCCCHHHHHHHCCCCCCEEH
ARVFSQIDLVSIELQKVYRQTDSVFVSVLDHIRTNTAGAADLQLLNTRYGSHIEESEADM
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCE
YITLATRRDTVDSINEKKLAELAGEPITFEGSIEGDFPESSLPTSQELVLKPGAQIIFIK
EEEEEECCCCHHHCCHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCEEECCCCEEEEEE
NDFDRRWVNGTIGVIAGIDEEEETIYVITDDGKECDVKRESWRNIRYRYNEKTKEIEEEV
CCCCCHHCCCCEEEEECCCCCCCEEEEEECCCCCCCCCHHHHCCCCEEECCHHHHHHHHH
LGSFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGGQAYVALSRCTSLDGIQLKKP
HHHHHCCCEEEEEEEEEECCCCCEEEEEEEEECCCEEECCHHEEEEHHHCCCCCEEECCC
INRADIFVRPEIVNFAGRFNDRQAIDKALKQAQADVQYAAAARAFDKGDMEECLEQFFRA
CCCCCEEECHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH
IHSRYDIEKPVPRRLIRRKLGIINTLQEQNKKLKEQMREQQERLRQYAHEYLLMGNECIT
HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH
QAHDARAAIANYDKALSLDPNYIDAWIRKGITLFNSKEYFDAENCFNTAVSLHPANFKAV
HHHHHHHHHHCCCCCCCCCCHHHHHHHHCCEEEECCCCCCCHHHHHHHHEEECCCCHHEE
YNRGKLRLKIDNTEGAIADLDKATSLKPEHAGAHELFGDALLRVGKEVEAAIQWRIAEEL
ECCCEEEEEEECCCCCHHHCHHHCCCCCCCCCHHHHHHHHHHHHCHHHHHHHHHHHHHHH
RKKKS
HHCCC
>Mature Secondary Structure 
SVDTNNAAFQDALNLIQYTRQSVFLTGKAGTGKSTFLRYVCEHTKKKHVVLAPTGIAAI
CCCCCCHHHHHHHHHHHHHHHEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCEEEE
NAGGSTMHSFFKLPFYPLLPDDPNLSLQRGRIHEFFKYTKPHRKLLEQIELVIIDEISMV
ECCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHEEHHHHHH
RADLIDAIDRILRVYSHNLREPFGGKQLLLVGDVFQLEPVVKNDEREILNRAYPTPYFFS
HHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECHHHCCCCCCCCHHHHHHHCCCCCCEEH
ARVFSQIDLVSIELQKVYRQTDSVFVSVLDHIRTNTAGAADLQLLNTRYGSHIEESEADM
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCE
YITLATRRDTVDSINEKKLAELAGEPITFEGSIEGDFPESSLPTSQELVLKPGAQIIFIK
EEEEEECCCCHHHCCHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCEEECCCCEEEEEE
NDFDRRWVNGTIGVIAGIDEEEETIYVITDDGKECDVKRESWRNIRYRYNEKTKEIEEEV
CCCCCHHCCCCEEEEECCCCCCCEEEEEECCCCCCCCCHHHHCCCCEEECCHHHHHHHHH
LGSFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGGQAYVALSRCTSLDGIQLKKP
HHHHHCCCEEEEEEEEEECCCCCEEEEEEEEECCCEEECCHHEEEEHHHCCCCCEEECCC
INRADIFVRPEIVNFAGRFNDRQAIDKALKQAQADVQYAAAARAFDKGDMEECLEQFFRA
CCCCCEEECHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH
IHSRYDIEKPVPRRLIRRKLGIINTLQEQNKKLKEQMREQQERLRQYAHEYLLMGNECIT
HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH
QAHDARAAIANYDKALSLDPNYIDAWIRKGITLFNSKEYFDAENCFNTAVSLHPANFKAV
HHHHHHHHHHCCCCCCCCCCHHHHHHHHCCEEEECCCCCCCHHHHHHHHEEECCCCHHEE
YNRGKLRLKIDNTEGAIADLDKATSLKPEHAGAHELFGDALLRVGKEVEAAIQWRIAEEL
ECCCEEEEEEECCCCCHHHCHHHCCCCCCCCCHHHHHHHHHHHHCHHHHHHHHHHHHHHH
RKKKS
HHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA