Definition Sulfolobus islandicus M.14.25 chromosome, complete genome.
Accession NC_012588
Length 2,608,832

Click here to switch to the map view.

The map label for this gene is alkK [H]

Identifier: 227826885

GI number: 227826885

Start: 458445

End: 459986

Strand: Direct

Name: alkK [H]

Synonym: M1425_0517

Alternate gene names: 227826885

Gene position: 458445-459986 (Clockwise)

Preceding gene: 227826884

Following gene: 227826887

Centisome position: 17.57

GC content: 35.28

Gene sequence:

>1542_bases
ATGCAAGAGGGTTTTACCGTATTCTCGCTTCTTAAAAGGGCAGTAACTATAGCGCCAGATAAGGAAATAGTAGATCCTTT
TCGCAATGTTAGACAGTCATATAAGGAGACTTATGAAAGAATAATAGGTATATCTAACTCAATGCTATCGATTGGAATAT
CTAAGGGAAGTATAATAGGGGTTGCAGATTATAACACCCTTAAACTCGTTGAGCTATTATTCGCTTCTAGCTTAATAGGT
ACAATAATATATCCAGTTAATGTCAAACTACCTTACGATCAATTACTTTATACAATTAAGCACGCTAGGGTAGAATGGTT
ATTCGCTTCAAAAGATTTCATATCCTTATTTAAGGACTTCACTAAGGAGAAAATTATTAGTATAGATTCCAATGATACTA
AGATAACTTACGATGATTTGGTAAGTAGAAAACTAGTTAAAGAACCCGAAATTTACGTTAAAGGAAGTGATCCATACTCT
ATCTTATTTACGTCAGGTACAACAGGGTTACCTAAAGCGGTTATGTATACTAATGAAAAAACGGTTCATGGAGCAATAGG
TATGGTGCACCAGCTATCACTTTACAATAGCCCTTCTTCGTTGAAGAACAACGATATCATATTAGGTCTTATACCATTTT
ATCATTTATGGTCATGGGGTTCACTATTTCATGCTTCTTACCTAGGCGCTAAATACGTCACAAGTGGAAAATTTGAGCCA
ATAAAAACATTGGAAATAATAGAAAAGGAAAAGGTAACCTGGCTTAACGCTGTCCCCACAATGATGTATATGTTACTAAG
CGCAGCCAAACAAGGGCAACTAAATGGCTTAAAAACTTTGATAGGCGGTTCTCCAATATCATCTAATCTGGCTAAGAAGT
TAAAAGAAAGTGGAGTATCTTTTGCATCAATATATGGTGGAACAGATATGTTAGCAATTTCGATTACTATCATTCCCGCA
AATACCAATATACAAAGCATCGAAGATTACTCGAGAGTATATACTCATCCTCTTCCTTTCGTGGAATTGAAAGTAGTTAA
GCCAGACGGTAAAGAGGCTAAAGTAGGAGAGATAGGACATTTATGGGTTAAAACCCCTTGGCTACCTGGTGAGTATCTTA
ACGATTTAGACAATACTAAATCTTCCTACGAGGATGGTTGGTTCAAAACTGGAGATATAGCTATGATTATAGATGACTAC
CATACTATAAGAATCTTGGATAGGGAGAAGGACTTAATAAAGAGTGGAGGAGAGTGGATAATACCTAGTATAATTGAGTC
AATAATATCTGAAGTAAATGGAGTAGATCTCGTTGCTGTAATAGGTAGAATAGATGAAAAATGGGGAGAAAGACCTATAG
CATTAGTCAAAGGTAAGGGATCAAATTTAAAAGAAAACATAATCAGCCATTTAAGAAGTGCTTCAACTCAAGGTTTGATA
CCAAAATGGTGGATTCCAGATGATATAGTTATTGTAGATGATTTACCCCTAACCAGCACTGGAAAGGTTAACAAAAAAGT
TTTAAAAGAGAGAACTAAATGA

Upstream 100 bases:

>100_bases
ACTATGAGAGATTTCTAATGGGTAGAAGCATTCTACGAGATTATATATCTGAAATACGTGAGCTTATAAATATAGAAGAG
ATGAAATAGAGGTTTTAGAA

Downstream 100 bases:

>100_bases
TGATAATAAGATATAATTTCATGAAAATACTATAACTTATAAATGGTGGTGAAAAGACAATTTTGGAAAACATCTTTCAG
ATAGAAACATCATACCATCT

Product: AMP-dependent synthetase and ligase

Products: NA

Alternate protein names: Medium-chain acyl-CoA synthetase [H]

Number of amino acids: Translated: 513; Mature: 513

Protein sequence:

>513_residues
MQEGFTVFSLLKRAVTIAPDKEIVDPFRNVRQSYKETYERIIGISNSMLSIGISKGSIIGVADYNTLKLVELLFASSLIG
TIIYPVNVKLPYDQLLYTIKHARVEWLFASKDFISLFKDFTKEKIISIDSNDTKITYDDLVSRKLVKEPEIYVKGSDPYS
ILFTSGTTGLPKAVMYTNEKTVHGAIGMVHQLSLYNSPSSLKNNDIILGLIPFYHLWSWGSLFHASYLGAKYVTSGKFEP
IKTLEIIEKEKVTWLNAVPTMMYMLLSAAKQGQLNGLKTLIGGSPISSNLAKKLKESGVSFASIYGGTDMLAISITIIPA
NTNIQSIEDYSRVYTHPLPFVELKVVKPDGKEAKVGEIGHLWVKTPWLPGEYLNDLDNTKSSYEDGWFKTGDIAMIIDDY
HTIRILDREKDLIKSGGEWIIPSIIESIISEVNGVDLVAVIGRIDEKWGERPIALVKGKGSNLKENIISHLRSASTQGLI
PKWWIPDDIVIVDDLPLTSTGKVNKKVLKERTK

Sequences:

>Translated_513_residues
MQEGFTVFSLLKRAVTIAPDKEIVDPFRNVRQSYKETYERIIGISNSMLSIGISKGSIIGVADYNTLKLVELLFASSLIG
TIIYPVNVKLPYDQLLYTIKHARVEWLFASKDFISLFKDFTKEKIISIDSNDTKITYDDLVSRKLVKEPEIYVKGSDPYS
ILFTSGTTGLPKAVMYTNEKTVHGAIGMVHQLSLYNSPSSLKNNDIILGLIPFYHLWSWGSLFHASYLGAKYVTSGKFEP
IKTLEIIEKEKVTWLNAVPTMMYMLLSAAKQGQLNGLKTLIGGSPISSNLAKKLKESGVSFASIYGGTDMLAISITIIPA
NTNIQSIEDYSRVYTHPLPFVELKVVKPDGKEAKVGEIGHLWVKTPWLPGEYLNDLDNTKSSYEDGWFKTGDIAMIIDDY
HTIRILDREKDLIKSGGEWIIPSIIESIISEVNGVDLVAVIGRIDEKWGERPIALVKGKGSNLKENIISHLRSASTQGLI
PKWWIPDDIVIVDDLPLTSTGKVNKKVLKERTK
>Mature_513_residues
MQEGFTVFSLLKRAVTIAPDKEIVDPFRNVRQSYKETYERIIGISNSMLSIGISKGSIIGVADYNTLKLVELLFASSLIG
TIIYPVNVKLPYDQLLYTIKHARVEWLFASKDFISLFKDFTKEKIISIDSNDTKITYDDLVSRKLVKEPEIYVKGSDPYS
ILFTSGTTGLPKAVMYTNEKTVHGAIGMVHQLSLYNSPSSLKNNDIILGLIPFYHLWSWGSLFHASYLGAKYVTSGKFEP
IKTLEIIEKEKVTWLNAVPTMMYMLLSAAKQGQLNGLKTLIGGSPISSNLAKKLKESGVSFASIYGGTDMLAISITIIPA
NTNIQSIEDYSRVYTHPLPFVELKVVKPDGKEAKVGEIGHLWVKTPWLPGEYLNDLDNTKSSYEDGWFKTGDIAMIIDDY
HTIRILDREKDLIKSGGEWIIPSIIESIISEVNGVDLVAVIGRIDEKWGERPIALVKGKGSNLKENIISHLRSASTQGLI
PKWWIPDDIVIVDDLPLTSTGKVNKKVLKERTK

Specific function: Unknown

COG id: COG0318

COG function: function code IQ; Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the ATP-dependent AMP-binding enzyme family [H]

Homologues:

Organism=Homo sapiens, GI156151445, Length=549, Percent_Identity=25.3187613843352, Blast_Score=137, Evalue=3e-32,
Organism=Homo sapiens, GI187761345, Length=551, Percent_Identity=25.4083484573503, Blast_Score=112, Evalue=1e-24,
Organism=Homo sapiens, GI187761343, Length=551, Percent_Identity=25.4083484573503, Blast_Score=112, Evalue=1e-24,
Organism=Homo sapiens, GI42544132, Length=469, Percent_Identity=22.3880597014925, Blast_Score=79, Evalue=1e-14,
Organism=Homo sapiens, GI28416953, Length=532, Percent_Identity=21.6165413533835, Blast_Score=79, Evalue=1e-14,
Organism=Homo sapiens, GI122937307, Length=462, Percent_Identity=22.5108225108225, Blast_Score=70, Evalue=7e-12,
Organism=Escherichia coli, GI145693145, Length=521, Percent_Identity=23.2245681381958, Blast_Score=120, Evalue=2e-28,
Organism=Escherichia coli, GI1788107, Length=358, Percent_Identity=25.6983240223464, Blast_Score=103, Evalue=3e-23,
Organism=Escherichia coli, GI221142682, Length=508, Percent_Identity=24.2125984251968, Blast_Score=101, Evalue=9e-23,
Organism=Escherichia coli, GI1786810, Length=518, Percent_Identity=24.3243243243243, Blast_Score=98, Evalue=1e-21,
Organism=Escherichia coli, GI1786801, Length=451, Percent_Identity=21.9512195121951, Blast_Score=82, Evalue=6e-17,
Organism=Escherichia coli, GI1788595, Length=298, Percent_Identity=25.1677852348993, Blast_Score=79, Evalue=9e-16,
Organism=Caenorhabditis elegans, GI71994694, Length=561, Percent_Identity=26.0249554367201, Blast_Score=137, Evalue=2e-32,
Organism=Caenorhabditis elegans, GI71994703, Length=561, Percent_Identity=25.8467023172906, Blast_Score=137, Evalue=2e-32,
Organism=Caenorhabditis elegans, GI71994690, Length=561, Percent_Identity=26.0249554367201, Blast_Score=137, Evalue=2e-32,
Organism=Caenorhabditis elegans, GI17560308, Length=554, Percent_Identity=25.0902527075812, Blast_Score=130, Evalue=2e-30,
Organism=Caenorhabditis elegans, GI17531443, Length=503, Percent_Identity=25.844930417495, Blast_Score=124, Evalue=1e-28,
Organism=Caenorhabditis elegans, GI17559526, Length=389, Percent_Identity=26.2210796915167, Blast_Score=122, Evalue=6e-28,
Organism=Caenorhabditis elegans, GI71996755, Length=491, Percent_Identity=25.4582484725051, Blast_Score=120, Evalue=2e-27,
Organism=Caenorhabditis elegans, GI17558820, Length=480, Percent_Identity=26.25, Blast_Score=112, Evalue=5e-25,
Organism=Caenorhabditis elegans, GI32563687, Length=467, Percent_Identity=25.0535331905782, Blast_Score=106, Evalue=4e-23,
Organism=Caenorhabditis elegans, GI17557194, Length=507, Percent_Identity=23.6686390532544, Blast_Score=94, Evalue=1e-19,
Organism=Caenorhabditis elegans, GI71985884, Length=328, Percent_Identity=28.6585365853659, Blast_Score=92, Evalue=7e-19,
Organism=Caenorhabditis elegans, GI17560140, Length=451, Percent_Identity=21.9512195121951, Blast_Score=90, Evalue=3e-18,
Organism=Caenorhabditis elegans, GI17538037, Length=513, Percent_Identity=23.9766081871345, Blast_Score=89, Evalue=5e-18,
Organism=Caenorhabditis elegans, GI25141359, Length=358, Percent_Identity=23.1843575418994, Blast_Score=72, Evalue=5e-13,
Organism=Drosophila melanogaster, GI21355181, Length=518, Percent_Identity=26.0617760617761, Blast_Score=145, Evalue=4e-35,
Organism=Drosophila melanogaster, GI24653035, Length=488, Percent_Identity=26.2295081967213, Blast_Score=123, Evalue=2e-28,
Organism=Drosophila melanogaster, GI24581924, Length=532, Percent_Identity=23.6842105263158, Blast_Score=121, Evalue=1e-27,
Organism=Drosophila melanogaster, GI24648253, Length=483, Percent_Identity=26.0869565217391, Blast_Score=119, Evalue=5e-27,
Organism=Drosophila melanogaster, GI24648255, Length=485, Percent_Identity=26.1855670103093, Blast_Score=117, Evalue=2e-26,
Organism=Drosophila melanogaster, GI161076582, Length=497, Percent_Identity=25.3521126760563, Blast_Score=114, Evalue=2e-25,
Organism=Drosophila melanogaster, GI21356441, Length=488, Percent_Identity=25.4098360655738, Blast_Score=108, Evalue=7e-24,
Organism=Drosophila melanogaster, GI24648260, Length=490, Percent_Identity=24.8979591836735, Blast_Score=105, Evalue=1e-22,
Organism=Drosophila melanogaster, GI21356947, Length=507, Percent_Identity=25.0493096646943, Blast_Score=104, Evalue=1e-22,
Organism=Drosophila melanogaster, GI18859661, Length=396, Percent_Identity=26.010101010101, Blast_Score=99, Evalue=8e-21,
Organism=Drosophila melanogaster, GI24656500, Length=518, Percent_Identity=22.972972972973, Blast_Score=93, Evalue=4e-19,
Organism=Drosophila melanogaster, GI20130357, Length=441, Percent_Identity=24.7165532879819, Blast_Score=91, Evalue=2e-18,
Organism=Drosophila melanogaster, GI21358303, Length=539, Percent_Identity=23.5621521335807, Blast_Score=89, Evalue=9e-18,
Organism=Drosophila melanogaster, GI24648257, Length=530, Percent_Identity=23.9622641509434, Blast_Score=89, Evalue=1e-17,
Organism=Drosophila melanogaster, GI19922652, Length=492, Percent_Identity=22.3577235772358, Blast_Score=84, Evalue=2e-16,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR020845
- InterPro:   IPR000873 [H]

Pfam domain/function: PF00501 AMP-binding [H]

EC number: NA

Molecular weight: Translated: 57313; Mature: 57313

Theoretical pI: Translated: 8.79; Mature: 8.79

Prosite motif: PS00455 AMP_BINDING

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.0 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
1.8 %Cys+Met (Translated Protein)
0.0 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQEGFTVFSLLKRAVTIAPDKEIVDPFRNVRQSYKETYERIIGISNSMLSIGISKGSIIG
CCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEE
VADYNTLKLVELLFASSLIGTIIYPVNVKLPYDQLLYTIKHARVEWLFASKDFISLFKDF
EECCCHHHHHHHHHHHHHHHHEEEEEEEECCHHHHHHHHHHHHEEEEEECHHHHHHHHHH
TKEKIISIDSNDTKITYDDLVSRKLVKEPEIYVKGSDPYSILFTSGTTGLPKAVMYTNEK
HHHHEEEECCCCCEEEHHHHHHHHHHCCCCEEEECCCCEEEEEECCCCCCCCEEEEECCC
TVHGAIGMVHQLSLYNSPSSLKNNDIILGLIPFYHLWSWGSLFHASYLGAKYVTSGKFEP
CHHHHHHHHHHHHHCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHCCCHHCCCCCCCC
IKTLEIIEKEKVTWLNAVPTMMYMLLSAAKQGQLNGLKTLIGGSPISSNLAKKLKESGVS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCC
FASIYGGTDMLAISITIIPANTNIQSIEDYSRVYTHPLPFVELKVVKPDGKEAKVGEIGH
EEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCCCCE
LWVKTPWLPGEYLNDLDNTKSSYEDGWFKTGDIAMIIDDYHTIRILDREKDLIKSGGEWI
EEEECCCCCHHHHHHHHCCCCCCCCCCEECCCEEEEEECCEEEEEECCCHHHHHCCCCEE
IPSIIESIISEVNGVDLVAVIGRIDEKWGERPIALVKGKGSNLKENIISHLRSASTQGLI
HHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCC
PKWWIPDDIVIVDDLPLTSTGKVNKKVLKERTK
CCCCCCCCEEEEECCCCCCCCCHHHHHHHHCCC
>Mature Secondary Structure
MQEGFTVFSLLKRAVTIAPDKEIVDPFRNVRQSYKETYERIIGISNSMLSIGISKGSIIG
CCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEE
VADYNTLKLVELLFASSLIGTIIYPVNVKLPYDQLLYTIKHARVEWLFASKDFISLFKDF
EECCCHHHHHHHHHHHHHHHHEEEEEEEECCHHHHHHHHHHHHEEEEEECHHHHHHHHHH
TKEKIISIDSNDTKITYDDLVSRKLVKEPEIYVKGSDPYSILFTSGTTGLPKAVMYTNEK
HHHHEEEECCCCCEEEHHHHHHHHHHCCCCEEEECCCCEEEEEECCCCCCCCEEEEECCC
TVHGAIGMVHQLSLYNSPSSLKNNDIILGLIPFYHLWSWGSLFHASYLGAKYVTSGKFEP
CHHHHHHHHHHHHHCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHCCCHHCCCCCCCC
IKTLEIIEKEKVTWLNAVPTMMYMLLSAAKQGQLNGLKTLIGGSPISSNLAKKLKESGVS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCC
FASIYGGTDMLAISITIIPANTNIQSIEDYSRVYTHPLPFVELKVVKPDGKEAKVGEIGH
EEEECCCCEEEEEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCCCCE
LWVKTPWLPGEYLNDLDNTKSSYEDGWFKTGDIAMIIDDYHTIRILDREKDLIKSGGEWI
EEEECCCCCHHHHHHHHCCCCCCCCCCEECCCEEEEEECCEEEEEECCCHHHHHCCCCEE
IPSIIESIISEVNGVDLVAVIGRIDEKWGERPIALVKGKGSNLKENIISHLRSASTQGLI
HHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCC
PKWWIPDDIVIVDDLPLTSTGKVNKKVLKERTK
CCCCCCCCEEEEECCCCCCCCCHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 1453953 [H]