| Definition | Bacteroides thetaiotaomicron VPI-5482 chromosome, complete genome. |
|---|---|
| Accession | NC_004663 |
| Length | 6,260,361 |
Click here to switch to the map view.
The map label for this gene is 29349538
Identifier: 29349538
GI number: 29349538
Start: 5415909
End: 5417906
Strand: Reverse
Name: 29349538
Synonym: BT_4130
Alternate gene names: NA
Gene position: 5417906-5415909 (Counterclockwise)
Preceding gene: 29349539
Following gene: 29349537
Centisome position: 86.54
GC content: 49.95
Gene sequence:
>1998_bases ATGAGCGTAGACACTAATAACGCAGCTTTTCAGGACGCCCTGAATCTTATTCAGTATACCCGCCAGTCGGTTTTTCTGAC CGGAAAAGCGGGTACAGGTAAATCTACATTCCTACGTTATGTCTGCGAACATACCAAGAAGAAACATGTCGTTCTCGCAC CTACCGGCATCGCAGCCATCAACGCCGGAGGAAGTACGATGCACAGTTTCTTCAAACTCCCCTTCTATCCGTTACTGCCG GACGATCCAAATTTAAGTCTCCAGAGAGGACGCATTCACGAGTTCTTCAAGTACACCAAACCGCACCGGAAATTACTGGA ACAGATTGAACTGGTCATCATAGACGAAATCTCTATGGTACGGGCGGACCTCATTGACGCCATCGACCGCATCTTACGTG TATATTCACATAATTTACGGGAACCTTTCGGCGGCAAACAACTGTTATTGGTAGGCGACGTTTTCCAGCTGGAACCTGTC GTGAAGAATGACGAGCGGGAGATTCTGAACCGTGCCTACCCTACTCCATACTTCTTCTCGGCAAGAGTATTCAGCCAGAT CGATCTGGTATCCATCGAACTCCAGAAAGTATACCGGCAAACTGACTCCGTCTTTGTCAGCGTTCTCGACCATATCCGTA CCAACACCGCCGGAGCAGCCGACCTGCAACTGCTGAACACCCGATACGGAAGCCATATCGAAGAATCGGAAGCCGATATG TACATCACGCTTGCCACCCGAAGGGATACGGTTGACTCGATCAATGAGAAGAAACTGGCCGAGCTGGCAGGAGAACCGAT CACCTTTGAGGGAAGCATTGAAGGGGATTTTCCCGAAAGTAGTCTGCCGACCTCACAGGAACTCGTTCTGAAACCGGGTG CTCAAATCATCTTTATCAAGAATGATTTCGACCGCCGGTGGGTAAACGGTACCATCGGAGTGATTGCCGGTATCGACGAG GAAGAAGAAACGATATACGTCATCACCGATGACGGCAAGGAATGCGATGTAAAACGGGAATCATGGCGTAACATCCGCTA TCGCTACAACGAAAAGACGAAAGAGATTGAAGAGGAAGTACTGGGCAGCTTCACCCAATATCCCATTCGGCTGGCTTGGG CCATTACCGTCCACAAGAGTCAGGGGTTGACTTTCAGCAGAGTAGTGATTGACTTCACGGGAGGTGTATTTGCCGGCGGA CAGGCGTATGTAGCACTCAGCCGTTGTACCTCACTGGATGGCATCCAGCTCAAAAAGCCGATCAACCGGGCGGATATCTT TGTGCGTCCCGAAATCGTAAACTTTGCCGGACGGTTCAACGACCGGCAAGCCATCGACAAGGCTCTGAAACAGGCGCAGG CCGATGTCCAGTATGCCGCTGCCGCACGTGCATTCGACAAAGGGGATATGGAAGAATGCCTGGAACAATTTTTCCGTGCC ATTCACTCCCGTTATGACATAGAGAAACCTGTTCCCCGCCGATTAATCCGCCGGAAACTGGGGATTATCAACACCTTGCA GGAGCAGAACAAAAAGCTCAAAGAGCAAATGCGGGAACAGCAGGAACGTCTGCGGCAATATGCCCACGAATATCTGTTAA TGGGAAATGAATGTATCACCCAAGCCCACGACGCCCGTGCCGCCATCGCCAATTATGACAAAGCGCTCAGCCTCGACCCT AATTATATAGACGCCTGGATACGGAAAGGAATCACCCTGTTCAACAGCAAAGAGTATTTTGATGCGGAAAACTGTTTCAA TACCGCTGTCAGCCTTCATCCTGCAAACTTTAAAGCTGTGTACAACCGTGGAAAACTGCGTCTGAAAATCGATAATACTG AAGGAGCAATTGCCGACCTGGACAAGGCTACAAGTCTGAAACCCGAGCATGCCGGTGCACACGAGCTTTTCGGAGATGCC CTGTTGAGAGTGGGGAAAGAAGTGGAAGCCGCCATACAATGGAGAATTGCCGAGGAACTCAGAAAGAAAAAATCATAA
Upstream 100 bases:
>100_bases CCCCCATTGATGAGGACGGCATCAGCAAGGCTATGAAACATTTCGGGATTATCTAATTCATTTATCCGGTCAGCACATCT GCTTATCCGTCACTAACTTT
Downstream 100 bases:
>100_bases ATAAGTTGGTAGTGTATCAGAAGTCTTCTGATTGTGAAAGTTATTGAATATCATCTTACATCAAAAACTTTTTATTAACT TTGCCTTCGTATGTACGTCA
Product: putative helicase
Products: NA
Alternate protein names: Helicase; TPR Domain Protein; TPR Domain-Containing Protein; ATPase; TPR Repeat-Containing Protein; Exonuclease V Subunit Alpha; Helicase-Related Protein; 5-3' DNA Helicase; Aaa ATPase; Glycosysltransferase; Hrdc Domain Protein; HRDC Domain Protein; Helicase-Family Protein; RecD-Like Exodeoxyribonuclease V Alpha Chain; RRM3/PIF1 Helicase-Like Protein; Helicase Protein; ATP-Dependent ExoDNAse Alpha Subunit; DNA Helicase AAA ATPase; DNA Helicase/AAA ATPase; Tetratricopeptide Repeat-Containing Protein; Helicase RecD/TraA Family
Number of amino acids: Translated: 665; Mature: 664
Protein sequence:
>665_residues MSVDTNNAAFQDALNLIQYTRQSVFLTGKAGTGKSTFLRYVCEHTKKKHVVLAPTGIAAINAGGSTMHSFFKLPFYPLLP DDPNLSLQRGRIHEFFKYTKPHRKLLEQIELVIIDEISMVRADLIDAIDRILRVYSHNLREPFGGKQLLLVGDVFQLEPV VKNDEREILNRAYPTPYFFSARVFSQIDLVSIELQKVYRQTDSVFVSVLDHIRTNTAGAADLQLLNTRYGSHIEESEADM YITLATRRDTVDSINEKKLAELAGEPITFEGSIEGDFPESSLPTSQELVLKPGAQIIFIKNDFDRRWVNGTIGVIAGIDE EEETIYVITDDGKECDVKRESWRNIRYRYNEKTKEIEEEVLGSFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGG QAYVALSRCTSLDGIQLKKPINRADIFVRPEIVNFAGRFNDRQAIDKALKQAQADVQYAAAARAFDKGDMEECLEQFFRA IHSRYDIEKPVPRRLIRRKLGIINTLQEQNKKLKEQMREQQERLRQYAHEYLLMGNECITQAHDARAAIANYDKALSLDP NYIDAWIRKGITLFNSKEYFDAENCFNTAVSLHPANFKAVYNRGKLRLKIDNTEGAIADLDKATSLKPEHAGAHELFGDA LLRVGKEVEAAIQWRIAEELRKKKS
Sequences:
>Translated_665_residues MSVDTNNAAFQDALNLIQYTRQSVFLTGKAGTGKSTFLRYVCEHTKKKHVVLAPTGIAAINAGGSTMHSFFKLPFYPLLP DDPNLSLQRGRIHEFFKYTKPHRKLLEQIELVIIDEISMVRADLIDAIDRILRVYSHNLREPFGGKQLLLVGDVFQLEPV VKNDEREILNRAYPTPYFFSARVFSQIDLVSIELQKVYRQTDSVFVSVLDHIRTNTAGAADLQLLNTRYGSHIEESEADM YITLATRRDTVDSINEKKLAELAGEPITFEGSIEGDFPESSLPTSQELVLKPGAQIIFIKNDFDRRWVNGTIGVIAGIDE EEETIYVITDDGKECDVKRESWRNIRYRYNEKTKEIEEEVLGSFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGG QAYVALSRCTSLDGIQLKKPINRADIFVRPEIVNFAGRFNDRQAIDKALKQAQADVQYAAAARAFDKGDMEECLEQFFRA IHSRYDIEKPVPRRLIRRKLGIINTLQEQNKKLKEQMREQQERLRQYAHEYLLMGNECITQAHDARAAIANYDKALSLDP NYIDAWIRKGITLFNSKEYFDAENCFNTAVSLHPANFKAVYNRGKLRLKIDNTEGAIADLDKATSLKPEHAGAHELFGDA LLRVGKEVEAAIQWRIAEELRKKKS >Mature_664_residues SVDTNNAAFQDALNLIQYTRQSVFLTGKAGTGKSTFLRYVCEHTKKKHVVLAPTGIAAINAGGSTMHSFFKLPFYPLLPD DPNLSLQRGRIHEFFKYTKPHRKLLEQIELVIIDEISMVRADLIDAIDRILRVYSHNLREPFGGKQLLLVGDVFQLEPVV KNDEREILNRAYPTPYFFSARVFSQIDLVSIELQKVYRQTDSVFVSVLDHIRTNTAGAADLQLLNTRYGSHIEESEADMY ITLATRRDTVDSINEKKLAELAGEPITFEGSIEGDFPESSLPTSQELVLKPGAQIIFIKNDFDRRWVNGTIGVIAGIDEE EETIYVITDDGKECDVKRESWRNIRYRYNEKTKEIEEEVLGSFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGGQ AYVALSRCTSLDGIQLKKPINRADIFVRPEIVNFAGRFNDRQAIDKALKQAQADVQYAAAARAFDKGDMEECLEQFFRAI HSRYDIEKPVPRRLIRRKLGIINTLQEQNKKLKEQMREQQERLRQYAHEYLLMGNECITQAHDARAAIANYDKALSLDPN YIDAWIRKGITLFNSKEYFDAENCFNTAVSLHPANFKAVYNRGKLRLKIDNTEGAIADLDKATSLKPEHAGAHELFGDAL LRVGKEVEAAIQWRIAEELRKKKS
Specific function: Unknown
COG id: COG0507
COG function: function code L; ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI82546872, Length=403, Percent_Identity=34.4913151364764, Blast_Score=183, Evalue=4e-46, Organism=Caenorhabditis elegans, GI25143421, Length=406, Percent_Identity=31.5270935960591, Blast_Score=167, Evalue=2e-41, Organism=Saccharomyces cerevisiae, GI6321820, Length=489, Percent_Identity=28.2208588957055, Blast_Score=141, Evalue=4e-34, Organism=Saccharomyces cerevisiae, GI6323579, Length=342, Percent_Identity=32.7485380116959, Blast_Score=134, Evalue=5e-32, Organism=Drosophila melanogaster, GI19920652, Length=439, Percent_Identity=30.751708428246, Blast_Score=166, Evalue=5e-41,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 75684; Mature: 75553
Theoretical pI: Translated: 7.07; Mature: 7.07
Prosite motif: PS50005 TPR L=RR ; PS50293 TPR_REGION
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 1.1 %Met (Translated Protein) 2.0 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 0.9 %Met (Mature Protein) 1.8 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSVDTNNAAFQDALNLIQYTRQSVFLTGKAGTGKSTFLRYVCEHTKKKHVVLAPTGIAAI CCCCCCCHHHHHHHHHHHHHHHEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCEEEE NAGGSTMHSFFKLPFYPLLPDDPNLSLQRGRIHEFFKYTKPHRKLLEQIELVIIDEISMV ECCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHEEHHHHHH RADLIDAIDRILRVYSHNLREPFGGKQLLLVGDVFQLEPVVKNDEREILNRAYPTPYFFS HHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECHHHCCCCCCCCHHHHHHHCCCCCCEEH ARVFSQIDLVSIELQKVYRQTDSVFVSVLDHIRTNTAGAADLQLLNTRYGSHIEESEADM HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCE YITLATRRDTVDSINEKKLAELAGEPITFEGSIEGDFPESSLPTSQELVLKPGAQIIFIK EEEEEECCCCHHHCCHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCEEECCCCEEEEEE NDFDRRWVNGTIGVIAGIDEEEETIYVITDDGKECDVKRESWRNIRYRYNEKTKEIEEEV CCCCCHHCCCCEEEEECCCCCCCEEEEEECCCCCCCCCHHHHCCCCEEECCHHHHHHHHH LGSFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGGQAYVALSRCTSLDGIQLKKP HHHHHCCCEEEEEEEEEECCCCCEEEEEEEEECCCEEECCHHEEEEHHHCCCCCEEECCC INRADIFVRPEIVNFAGRFNDRQAIDKALKQAQADVQYAAAARAFDKGDMEECLEQFFRA CCCCCEEECHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH IHSRYDIEKPVPRRLIRRKLGIINTLQEQNKKLKEQMREQQERLRQYAHEYLLMGNECIT HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH QAHDARAAIANYDKALSLDPNYIDAWIRKGITLFNSKEYFDAENCFNTAVSLHPANFKAV HHHHHHHHHHCCCCCCCCCCHHHHHHHHCCEEEECCCCCCCHHHHHHHHEEECCCCHHEE YNRGKLRLKIDNTEGAIADLDKATSLKPEHAGAHELFGDALLRVGKEVEAAIQWRIAEEL ECCCEEEEEEECCCCCHHHCHHHCCCCCCCCCHHHHHHHHHHHHCHHHHHHHHHHHHHHH RKKKS HHCCC >Mature Secondary Structure SVDTNNAAFQDALNLIQYTRQSVFLTGKAGTGKSTFLRYVCEHTKKKHVVLAPTGIAAI CCCCCCHHHHHHHHHHHHHHHEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCEEEE NAGGSTMHSFFKLPFYPLLPDDPNLSLQRGRIHEFFKYTKPHRKLLEQIELVIIDEISMV ECCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHEEHHHHHH RADLIDAIDRILRVYSHNLREPFGGKQLLLVGDVFQLEPVVKNDEREILNRAYPTPYFFS HHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECHHHCCCCCCCCHHHHHHHCCCCCCEEH ARVFSQIDLVSIELQKVYRQTDSVFVSVLDHIRTNTAGAADLQLLNTRYGSHIEESEADM HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCE YITLATRRDTVDSINEKKLAELAGEPITFEGSIEGDFPESSLPTSQELVLKPGAQIIFIK EEEEEECCCCHHHCCHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCEEECCCCEEEEEE NDFDRRWVNGTIGVIAGIDEEEETIYVITDDGKECDVKRESWRNIRYRYNEKTKEIEEEV CCCCCHHCCCCEEEEECCCCCCCEEEEEECCCCCCCCCHHHHCCCCEEECCHHHHHHHHH LGSFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGGQAYVALSRCTSLDGIQLKKP HHHHHCCCEEEEEEEEEECCCCCEEEEEEEEECCCEEECCHHEEEEHHHCCCCCEEECCC INRADIFVRPEIVNFAGRFNDRQAIDKALKQAQADVQYAAAARAFDKGDMEECLEQFFRA CCCCCEEECHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH IHSRYDIEKPVPRRLIRRKLGIINTLQEQNKKLKEQMREQQERLRQYAHEYLLMGNECIT HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH QAHDARAAIANYDKALSLDPNYIDAWIRKGITLFNSKEYFDAENCFNTAVSLHPANFKAV HHHHHHHHHHCCCCCCCCCCHHHHHHHHCCEEEECCCCCCCHHHHHHHHEEECCCCHHEE YNRGKLRLKIDNTEGAIADLDKATSLKPEHAGAHELFGDALLRVGKEVEAAIQWRIAEEL ECCCEEEEEEECCCCCHHHCHHHCCCCCCCCCHHHHHHHHHHHHCHHHHHHHHHHHHHHH RKKKS HHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA