Definition Hyperthermus butylicus DSM 5456 chromosome, complete genome.
Accession NC_008818
Length 1,667,163

Click here to switch to the map view.

The map label for this gene is 124027294

Identifier: 124027294

GI number: 124027294

Start: 387034

End: 388392

Strand: Reverse

Name: 124027294

Synonym: Hbut_0402

Alternate gene names: NA

Gene position: 388392-387034 (Counterclockwise)

Preceding gene: 124027295

Following gene: 124027293

Centisome position: 23.3

GC content: 51.88

Gene sequence:

>1359_bases
ATGGCTGTCATTAGGGAGGTTAAGCCGGCTAGGGAGATGCGTAGGATAGGCTCGCACAGTCATATACGCGGTCTGGGCCT
CGACGAGAAGGGTCGTGCAAAGTTCATTGCTGACGGTATGGTTGGCCAGGTTGAAGCACGCGAAGCAGCGGGCATAGTGG
TGCAAATGATCAAGGAGGGCAGAATGGCTGGTCGCGGGGTCCTCATAGTGGGCCCTAGCGGTACCGGTAAGACGGCAATA
GCAGTGGGTATCGCCAAGGAGCTCGGCGAGGACACACCGTTCGTTGCAATGTCTGGCTCCGAGATCTATAGTAGTGAGCT
AAAGAAAACTGAGGTACTAATGCAGGCTATTAGGAAGGCTATCGGTGTAAGGATAAAGGTGCACAAGGATGTCTATGAGG
GTGTGGTAACGAGGATACGTATAGCGTACGTAAAGCATCCATTCAACCCGTACGTAAGAGTGCCCAGCGAGGCCGAGATA
ACGCTCGAAACCAGGGATGACAGTAGGACGCTAAGAGTAGGAGAGGAAGTTGCAGCCCAGCTCATACAGCTAAGAGTGCG
TAAAGGCGACGTAATATGGATTGATGCTGAGACAGGCGAGGTATACAAGGTTGGCAGGGCATGCGAGAAGGAGAGTAAGC
GCTACGACGTATCCTACTTCCGCTGCGTGGACATACCCGACGGCCCGGTGAGGAAGAGGAAGGAGATAGTTCATACACTA
ACCCTTCATGACCTAGATGTCGCCTATGCTGCTCAGCGTACAGCTTTTGCAACACTACTCGGCATGCCAGCTACCAGGGA
GATACCTAGCGAGGTTAGGCAGCGTGTGGACGAGGAAGTTAAGAAGATGATTAATGAGGGTAGGGCAGAGCTTGTGCCCG
GCGTACTATTCATAGATGACGCCCATATGTTGGACATAGAGGCCTTTAGCTTCCTAACGAGGGCCATGGAGAGCGAGCTA
GCACCAATACTCGTACTTGCAACAAACCGTGGTGTTACGAAGATACGTGGCACAGACATAGAGTCACCTCATGGCATACC
GCTAGACCTCCTCGATAGACTGCTAATCATTAAGACTAGGCCATACAAGGCGGAGGAGATACGTGAGATTCTACGTATAA
GGGCTGATGAGGAGGAAATACCGTTAACCGAGGAGGCGCTAGAGGAGCTAACAAAGCTCGGTGTTGAGAGGAGCCTCCGC
TACGCGGTACAGTTAATGGAGCCGGCAAGGATAATAGCTGAACGTGAGGGCCGTAATAAGGTTACAGCTGAGGATGTGAA
AAAGGCTGCAGAATACTTCGTTGACGTGAGGGAGAGCATTAGGTACATCCGAGAACTTGAAGAGGAGTTCCTTAAGTAG

Upstream 100 bases:

>100_bases
GAGTTCTATGCCGCTGATGATGCTTCGCGGACTGAGCCCCTTTAGGGTAAGGTTTTTGTAAAGCGCCCCTTTCGCGGCCT
TTATATCGGGTGGTATTGGT

Downstream 100 bases:

>100_bases
CGCTAAGCCCCGGTTTCCTGTGTGTCTTATGCTGAGCGGAGCGGAAGGCTAGTTTTATGCAACCCCTAGCCGTCACTTTA
TACTTCTGGAACACCCACCC

Product: RuvB-like 2

Products: NA

Alternate protein names: TIP49 Domain Protein; TIP49-Like Protein; TIP49 Domain-Containing Protein; Tbp-Interacting Protein Tip; DNA Helicase; TIP49-Like; TBP-Interacting Proten; TIP49 C-Terminal Domain Family Protein; DNA Helicase TIP

Number of amino acids: Translated: 452; Mature: 451

Protein sequence:

>452_residues
MAVIREVKPAREMRRIGSHSHIRGLGLDEKGRAKFIADGMVGQVEAREAAGIVVQMIKEGRMAGRGVLIVGPSGTGKTAI
AVGIAKELGEDTPFVAMSGSEIYSSELKKTEVLMQAIRKAIGVRIKVHKDVYEGVVTRIRIAYVKHPFNPYVRVPSEAEI
TLETRDDSRTLRVGEEVAAQLIQLRVRKGDVIWIDAETGEVYKVGRACEKESKRYDVSYFRCVDIPDGPVRKRKEIVHTL
TLHDLDVAYAAQRTAFATLLGMPATREIPSEVRQRVDEEVKKMINEGRAELVPGVLFIDDAHMLDIEAFSFLTRAMESEL
APILVLATNRGVTKIRGTDIESPHGIPLDLLDRLLIIKTRPYKAEEIREILRIRADEEEIPLTEEALEELTKLGVERSLR
YAVQLMEPARIIAEREGRNKVTAEDVKKAAEYFVDVRESIRYIRELEEEFLK

Sequences:

>Translated_452_residues
MAVIREVKPAREMRRIGSHSHIRGLGLDEKGRAKFIADGMVGQVEAREAAGIVVQMIKEGRMAGRGVLIVGPSGTGKTAI
AVGIAKELGEDTPFVAMSGSEIYSSELKKTEVLMQAIRKAIGVRIKVHKDVYEGVVTRIRIAYVKHPFNPYVRVPSEAEI
TLETRDDSRTLRVGEEVAAQLIQLRVRKGDVIWIDAETGEVYKVGRACEKESKRYDVSYFRCVDIPDGPVRKRKEIVHTL
TLHDLDVAYAAQRTAFATLLGMPATREIPSEVRQRVDEEVKKMINEGRAELVPGVLFIDDAHMLDIEAFSFLTRAMESEL
APILVLATNRGVTKIRGTDIESPHGIPLDLLDRLLIIKTRPYKAEEIREILRIRADEEEIPLTEEALEELTKLGVERSLR
YAVQLMEPARIIAEREGRNKVTAEDVKKAAEYFVDVRESIRYIRELEEEFLK
>Mature_451_residues
AVIREVKPAREMRRIGSHSHIRGLGLDEKGRAKFIADGMVGQVEAREAAGIVVQMIKEGRMAGRGVLIVGPSGTGKTAIA
VGIAKELGEDTPFVAMSGSEIYSSELKKTEVLMQAIRKAIGVRIKVHKDVYEGVVTRIRIAYVKHPFNPYVRVPSEAEIT
LETRDDSRTLRVGEEVAAQLIQLRVRKGDVIWIDAETGEVYKVGRACEKESKRYDVSYFRCVDIPDGPVRKRKEIVHTLT
LHDLDVAYAAQRTAFATLLGMPATREIPSEVRQRVDEEVKKMINEGRAELVPGVLFIDDAHMLDIEAFSFLTRAMESELA
PILVLATNRGVTKIRGTDIESPHGIPLDLLDRLLIIKTRPYKAEEIREILRIRADEEEIPLTEEALEELTKLGVERSLRY
AVQLMEPARIIAEREGRNKVTAEDVKKAAEYFVDVRESIRYIRELEEEFLK

Specific function: Unknown

COG id: COG1224

COG function: function code K; DNA helicase TIP49, TBP-interacting protein

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI4506753, Length=455, Percent_Identity=46.3736263736264, Blast_Score=413, Evalue=1e-115,
Organism=Homo sapiens, GI5730023, Length=449, Percent_Identity=46.7706013363029, Blast_Score=411, Evalue=1e-115,
Organism=Caenorhabditis elegans, GI17542510, Length=449, Percent_Identity=43.8752783964365, Blast_Score=367, Evalue=1e-102,
Organism=Caenorhabditis elegans, GI17558290, Length=454, Percent_Identity=42.0704845814978, Blast_Score=336, Evalue=1e-92,
Organism=Saccharomyces cerevisiae, GI6325021, Length=449, Percent_Identity=45.43429844098, Blast_Score=395, Evalue=1e-111,
Organism=Saccharomyces cerevisiae, GI6320396, Length=443, Percent_Identity=44.920993227991, Blast_Score=376, Evalue=1e-105,
Organism=Drosophila melanogaster, GI17737635, Length=448, Percent_Identity=46.4285714285714, Blast_Score=404, Evalue=1e-113,
Organism=Drosophila melanogaster, GI21358125, Length=454, Percent_Identity=45.1541850220264, Blast_Score=367, Evalue=1e-102,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 50890; Mature: 50758

Theoretical pI: Translated: 6.75; Mature: 6.75

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
2.7 %Met     (Translated Protein)
3.1 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
2.4 %Met     (Mature Protein)
2.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAVIREVKPAREMRRIGSHSHIRGLGLDEKGRAKFIADGMVGQVEAREAAGIVVQMIKEG
CCCHHHCCHHHHHHHCCCCCCEEECCCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHC
RMAGRGVLIVGPSGTGKTAIAVGIAKELGEDTPFVAMSGSEIYSSELKKTEVLMQAIRKA
CCCCCEEEEECCCCCCCHHHEEHHHHHCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHH
IGVRIKVHKDVYEGVVTRIRIAYVKHPFNPYVRVPSEAEITLETRDDSRTLRVGEEVAAQ
HCEEEEEHHHHHHHHHHHHHHHEEECCCCCCEECCCCCEEEEEECCCCCHHHHHHHHHHH
LIQLRVRKGDVIWIDAETGEVYKVGRACEKESKRYDVSYFRCVDIPDGPVRKRKEIVHTL
HHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHHH
TLHDLDVAYAAQRTAFATLLGMPATREIPSEVRQRVDEEVKKMINEGRAELVPGVLFIDD
HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHCCCEEEECC
AHMLDIEAFSFLTRAMESELAPILVLATNRGVTKIRGTDIESPHGIPLDLLDRLLIIKTR
CCEEHHHHHHHHHHHHHHCCCCEEEEEECCCCEEECCCCCCCCCCCCHHHHHHHHEEECC
PYKAEEIREILRIRADEEEIPLTEEALEELTKLGVERSLRYAVQLMEPARIIAEREGRNK
CCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC
VTAEDVKKAAEYFVDVRESIRYIRELEEEFLK
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
>Mature Secondary Structure 
AVIREVKPAREMRRIGSHSHIRGLGLDEKGRAKFIADGMVGQVEAREAAGIVVQMIKEG
CCHHHCCHHHHHHHCCCCCCEEECCCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHC
RMAGRGVLIVGPSGTGKTAIAVGIAKELGEDTPFVAMSGSEIYSSELKKTEVLMQAIRKA
CCCCCEEEEECCCCCCCHHHEEHHHHHCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHH
IGVRIKVHKDVYEGVVTRIRIAYVKHPFNPYVRVPSEAEITLETRDDSRTLRVGEEVAAQ
HCEEEEEHHHHHHHHHHHHHHHEEECCCCCCEECCCCCEEEEEECCCCCHHHHHHHHHHH
LIQLRVRKGDVIWIDAETGEVYKVGRACEKESKRYDVSYFRCVDIPDGPVRKRKEIVHTL
HHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHHH
TLHDLDVAYAAQRTAFATLLGMPATREIPSEVRQRVDEEVKKMINEGRAELVPGVLFIDD
HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHCCCEEEECC
AHMLDIEAFSFLTRAMESELAPILVLATNRGVTKIRGTDIESPHGIPLDLLDRLLIIKTR
CCEEHHHHHHHHHHHHHHCCCCEEEEEECCCCEEECCCCCCCCCCCCHHHHHHHHEEECC
PYKAEEIREILRIRADEEEIPLTEEALEELTKLGVERSLRYAVQLMEPARIIAEREGRNK
CCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC
VTAEDVKKAAEYFVDVRESIRYIRELEEEFLK
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA