| Definition | Thermoanaerobacter sp. X514 chromosome, complete genome. |
|---|---|
| Accession | NC_010320 |
| Length | 2,457,259 |
Click here to switch to the map view.
The map label for this gene is yiaY [H]
Identifier: 167040564
GI number: 167040564
Start: 1945757
End: 1946926
Strand: Reverse
Name: yiaY [H]
Synonym: Teth514_1935
Alternate gene names: 167040564
Gene position: 1946926-1945757 (Counterclockwise)
Preceding gene: 167040565
Following gene: 167040563
Centisome position: 79.23
GC content: 34.87
Gene sequence:
>1170_bases GTGAAAATATTTAAATTCCATATGCCCCCTATAAATTTAATAGGTGTGGGATGTTTAAAAGATGTGGGAAGGGAGATCAA AAAATTAGGTTTTAAAAAAGGAATTATTGTTACAGATAAAGTACTTGTCAGAGCTGGGCTTGTGAATAATGTAATTAGTG TTTTAGAAGAAGAAGGAATAGAATATGTTGTCTTTGATGAAACAAAACCCAACCCTACAATTAAAAATGTAACAAATGGA CTTAAGCTTTTGATAGAGAATAAGTGTGATTTTATTATTTCGTGCGGCGGAGGATCAGCTCATGACTGCGCAAAAGGGAT AGGCCTCATTGCTAAAGAGAAGAATTTCATTGATGAGGTAGAGCGTCTAGACAAAGTAAAGTGTGGTGGTTGGAATAGTG CATTATTACTGCCCCTAGTTGCTATAAATACCACGGCTGGAACAGGTAGTGAAGTTACTAAATTTGCTATAATTACAGAT GAAGAAAAACGTATTAAAATGCCAATTGTGGATTGGCGCATTACACCTCTAATAGCAGTAAATGATCCTCTCTTGATGAT AGGTATGCCAAAATCTCTAACAGCTGCAAGTGGCATGGATGCACTAACTCACGCTATTGAAGCTTACATTTCGATTGATG CAAATCCATTTACAGATGCACTTGCTTTGAAAGCTATTGAAATTATATTCAACTACCTTAAAAGAGCGGTAGAAAATGGA AATGATATTGAAGCAAGAGAAAAGATGGCATATGCAGAGTTCTTGGCGGGGATTGCTTTTAATAACGCAGGTTTAGGTTA TGTCCATGCTATGGCTCATCAATTAGGAGGATTTTACGATCTTCCTCATGGTGTATGTAATGCCGTATTATTACCTCATG TTTTGGAATATAATCTTGAGGCAGTTCAAAATAAACTTATATATATAGCGAAAGCGATGGGTATAGATGTAGATAAATTA ACAACAAAAGAAATAGGAGGCAAAATTATTGAAAGCATAAACCAGCTCTCTCAAGAGATTGGTATACCATCGAGGTTAAA AGAACTGGGGGTAAAAGAAGAAGACATTAAAGAGTTATCGCAAAATGCATTAAAAGATGTATGTGGTTTTACAAATCCTA AAAAGGCAACATTAGAAGATATTATTAATATTTTCAAGTCTGCAATGTAA
Upstream 100 bases:
>100_bases AATTACAGTTATAAACTAGGAGAGTATTCTTATTTATTGATTGAGGAAGGATTGTATTTTCTATAAACTTTAAAAAAGTT TTTGATATGAGGTGATAAAT
Downstream 100 bases:
>100_bases AAACTAGCATAAAATTTTTAGCGGGAATAAGTTAATTAAATTTTACCTTGCTTCTAAATGCCACAACGCATAGAGTTATA AATAGATTTATAAATAATGA
Product: iron-containing alcohol dehydrogenase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 389; Mature: 389
Protein sequence:
>389_residues MKIFKFHMPPINLIGVGCLKDVGREIKKLGFKKGIIVTDKVLVRAGLVNNVISVLEEEGIEYVVFDETKPNPTIKNVTNG LKLLIENKCDFIISCGGGSAHDCAKGIGLIAKEKNFIDEVERLDKVKCGGWNSALLLPLVAINTTAGTGSEVTKFAIITD EEKRIKMPIVDWRITPLIAVNDPLLMIGMPKSLTAASGMDALTHAIEAYISIDANPFTDALALKAIEIIFNYLKRAVENG NDIEAREKMAYAEFLAGIAFNNAGLGYVHAMAHQLGGFYDLPHGVCNAVLLPHVLEYNLEAVQNKLIYIAKAMGIDVDKL TTKEIGGKIIESINQLSQEIGIPSRLKELGVKEEDIKELSQNALKDVCGFTNPKKATLEDIINIFKSAM
Sequences:
>Translated_389_residues MKIFKFHMPPINLIGVGCLKDVGREIKKLGFKKGIIVTDKVLVRAGLVNNVISVLEEEGIEYVVFDETKPNPTIKNVTNG LKLLIENKCDFIISCGGGSAHDCAKGIGLIAKEKNFIDEVERLDKVKCGGWNSALLLPLVAINTTAGTGSEVTKFAIITD EEKRIKMPIVDWRITPLIAVNDPLLMIGMPKSLTAASGMDALTHAIEAYISIDANPFTDALALKAIEIIFNYLKRAVENG NDIEAREKMAYAEFLAGIAFNNAGLGYVHAMAHQLGGFYDLPHGVCNAVLLPHVLEYNLEAVQNKLIYIAKAMGIDVDKL TTKEIGGKIIESINQLSQEIGIPSRLKELGVKEEDIKELSQNALKDVCGFTNPKKATLEDIINIFKSAM >Mature_389_residues MKIFKFHMPPINLIGVGCLKDVGREIKKLGFKKGIIVTDKVLVRAGLVNNVISVLEEEGIEYVVFDETKPNPTIKNVTNG LKLLIENKCDFIISCGGGSAHDCAKGIGLIAKEKNFIDEVERLDKVKCGGWNSALLLPLVAINTTAGTGSEVTKFAIITD EEKRIKMPIVDWRITPLIAVNDPLLMIGMPKSLTAASGMDALTHAIEAYISIDANPFTDALALKAIEIIFNYLKRAVENG NDIEAREKMAYAEFLAGIAFNNAGLGYVHAMAHQLGGFYDLPHGVCNAVLLPHVLEYNLEAVQNKLIYIAKAMGIDVDKL TTKEIGGKIIESINQLSQEIGIPSRLKELGVKEEDIKELSQNALKDVCGFTNPKKATLEDIINIFKSAM
Specific function: Unknown
COG id: COG1454
COG function: function code C; Alcohol dehydrogenase, class IV
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the iron-containing alcohol dehydrogenase family [H]
Homologues:
Organism=Homo sapiens, GI133922590, Length=422, Percent_Identity=27.9620853080569, Blast_Score=156, Evalue=4e-38, Organism=Escherichia coli, GI48994951, Length=389, Percent_Identity=54.7557840616967, Blast_Score=426, Evalue=1e-120, Organism=Escherichia coli, GI1789163, Length=373, Percent_Identity=44.7721179624665, Blast_Score=311, Evalue=3e-86, Organism=Escherichia coli, GI87082107, Length=388, Percent_Identity=35.8247422680412, Blast_Score=250, Evalue=9e-68, Organism=Escherichia coli, GI1787493, Length=385, Percent_Identity=35.0649350649351, Blast_Score=225, Evalue=5e-60, Organism=Escherichia coli, GI1789386, Length=373, Percent_Identity=24.9329758713137, Blast_Score=102, Evalue=4e-23, Organism=Caenorhabditis elegans, GI17537053, Length=412, Percent_Identity=28.1553398058252, Blast_Score=162, Evalue=2e-40, Organism=Saccharomyces cerevisiae, GI6321181, Length=385, Percent_Identity=49.0909090909091, Blast_Score=363, Evalue=1e-101, Organism=Drosophila melanogaster, GI24657991, Length=413, Percent_Identity=28.8135593220339, Blast_Score=157, Evalue=8e-39,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001670 - InterPro: IPR018211 [H]
Pfam domain/function: PF00465 Fe-ADH [H]
EC number: =1.1.1.1 [H]
Molecular weight: Translated: 42462; Mature: 42462
Theoretical pI: Translated: 6.19; Mature: 6.19
Prosite motif: PS00013 PROKAR_LIPOPROTEIN ; PS00913 ADH_IRON_1 ; PS00060 ADH_IRON_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.8 %Cys (Translated Protein) 2.6 %Met (Translated Protein) 4.4 %Cys+Met (Translated Protein) 1.8 %Cys (Mature Protein) 2.6 %Met (Mature Protein) 4.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKIFKFHMPPINLIGVGCLKDVGREIKKLGFKKGIIVTDKVLVRAGLVNNVISVLEEEGI CCEEEECCCCHHHEEHHHHHHHHHHHHHCCCCCCEEEHHHHHHHHHHHHHHHHHHHHCCC EYVVFDETKPNPTIKNVTNGLKLLIENKCDFIISCGGGSAHDCAKGIGLIAKEKNFIDEV EEEEECCCCCCCCHHHHHHHHHHHEECCCCEEEECCCCCHHHHHHHCCCCCCCHHHHHHH ERLDKVKCGGWNSALLLPLVAINTTAGTGSEVTKFAIITDEEKRIKMPIVDWRITPLIAV HHHHCCCCCCCCCHHHHHHHHEECCCCCCCCCEEEEEEECCCCEEECCEECEEECEEEEE NDPLLMIGMPKSLTAASGMDALTHAIEAYISIDANPFTDALALKAIEIIFNYLKRAVENG CCCEEEEECCCHHHHHCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCC NDIEAREKMAYAEFLAGIAFNNAGLGYVHAMAHQLGGFYDLPHGVCNAVLLPHVLEYNLE CCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH AVQNKLIYIAKAMGIDVDKLTTKEIGGKIIESINQLSQEIGIPSRLKELGVKEEDIKELS HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCHHHHHHHH QNALKDVCGFTNPKKATLEDIINIFKSAM HHHHHHHHCCCCCCHHHHHHHHHHHHHCC >Mature Secondary Structure MKIFKFHMPPINLIGVGCLKDVGREIKKLGFKKGIIVTDKVLVRAGLVNNVISVLEEEGI CCEEEECCCCHHHEEHHHHHHHHHHHHHCCCCCCEEEHHHHHHHHHHHHHHHHHHHHCCC EYVVFDETKPNPTIKNVTNGLKLLIENKCDFIISCGGGSAHDCAKGIGLIAKEKNFIDEV EEEEECCCCCCCCHHHHHHHHHHHEECCCCEEEECCCCCHHHHHHHCCCCCCCHHHHHHH ERLDKVKCGGWNSALLLPLVAINTTAGTGSEVTKFAIITDEEKRIKMPIVDWRITPLIAV HHHHCCCCCCCCCHHHHHHHHEECCCCCCCCCEEEEEEECCCCEEECCEECEEECEEEEE NDPLLMIGMPKSLTAASGMDALTHAIEAYISIDANPFTDALALKAIEIIFNYLKRAVENG CCCEEEEECCCHHHHHCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCC NDIEAREKMAYAEFLAGIAFNNAGLGYVHAMAHQLGGFYDLPHGVCNAVLLPHVLEYNLE CCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH AVQNKLIYIAKAMGIDVDKLTTKEIGGKIIESINQLSQEIGIPSRLKELGVKEEDIKELS HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCHHHHHHHH QNALKDVCGFTNPKKATLEDIINIFKSAM HHHHHHHHCCCCCCHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 8041620; 9278503; 7768815 [H]