Definition | Shigella flexneri 2a str. 2457T, complete genome. |
---|---|
Accession | NC_004741 |
Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is yidK
Identifier: 30064991
GI number: 30064991
Start: 3880216
End: 3881931
Strand: Reverse
Name: yidK
Synonym: S3986
Alternate gene names: 30064991
Gene position: 3881931-3880216 (Counterclockwise)
Preceding gene: 30064993
Following gene: 30064989
Centisome position: 84.4
GC content: 52.91
Gene sequence:
>1716_bases ATGAATTCGTTACAAATCTTGAGTTTTGTCGGTTTTACGCTGCTGGTGGCGGTGATCACCTGGTGGAAGGTTCGCAAAAC AGATACCGGATCGCAACAAGGCTATTTTCTTGCCGGACGTTCACTAAAAGCGCCGGTTATTGCCGCTTCGTTAATGTTAA CCAACCTTTCCACGGAACAACTGGTTGGTCTTTCCGGGCAGGCCTACAAAAGCGGCATGTCGGTGATGGGCTGGGAAGTG ACTTCAGCGGTGACGCTGATCTTCCTCGCGCTAATCTTTTTACCGCGCTATCTGAAGCGCGGCATTGCCACCATCCCCGA TTTTCTGGAGGAACGTTATGATAAAACGACGCGTATTATCATCGACTTCTGCTTCCTCATTGCCACCGGCGTCTGCTTTC TGCCGATTGTTCTCTACTCCGGCGCGTTGGCGCTCAACAGCCTGTTTCACGTCGGGGAATCGCTACAGATTTCTCACGGT GCGGCTATCTGGCTACTGGTAATTTTGCTTGGTCTGGCGGGAATTTTGTATGCGGTGATCGGCGGACTGCGCGCAATGGC AGTGGCGGACTCCATCAACGGTATTGGGCTGGTTATCGGCGGGTTGATGGTGCCGGTATTTGGCCTGATCGCGATAGGCA AGGGCAGCTTTATGCAGGGCATTGAGCAAATTACCACCGTTCACGCCGAGAAATTAAACTCAGTCGGTGGCCCGACCGAT CCCTTGCCGATTGGCGCGGCATTTACCGGTTTAATTCTGGTGAACACCTTTTACTGGTGTACAAATCAGGGCATCGTGCA ACGCACGCTGGCGTCAAAAAGCCTGGCGGAAGGGCAAAAGGGGGCGCTGTTAACGGCGGTGCTGAAAATGCTCGACCCGC TGGTACTGGTGCTGCCAGGGTTGATTGCGTTTCATCTGTATCAGGATCTACCGAAAGCCGACATGGCCTACCCGACGCTG GTCAATAACGTTCTGCCAGTGCCACTGGAGGGTTTCTTCGGCGCGGTGTTATTTGGTGCGGTGATCAGTACCTTCAACGG CTTTCTGAATAGCGCCAGTACATTATTCAGTATGGGTATTTACCGTCGCATCATTAACCAGAATGCCGAGCCGCAGCAGC TGGTCACCGTCGGGCGCAAATTTGGTTTCTTTATCGCTATCGTTTCGGTTTTGGTAGCGCCGTGGATCGCGAACGCGCCG CAGGGGCTGTATAGCTGGATGAAACAGCTCAACGGCATTTACAACGTGCCGCTGGTTACCATCATCATTATGGGCTTTTT CTTTCCGCGCATCCCGGCGCTGGCGGCAAAAGTGGCGATGGGGATTGGCATAATCAGCTACATCACCATCAACTATCTGG TGAAGTTCGACTTCCATTTCCTCTATGTGCTGGCCTGTACGTTCTGCATCAACGTGGTCGTGATGCTGGTGATCGGTTTT ATCAAACCGCGCGCCACGCCGTTCACCTTCAAAGATGCGTTTGCGGTGGACATGAAACCGTGGAAAAACGTCAAGATCGC GTCAATTGGTATCCTGTTCGCGATGATTGGCGTCTATGTCGGGCTGGCTGAATTCGGCGGCTACGGTACGCGCTGGTTAG CGATGATCAGTTATTTCATCGCTGCCGTAGTGATTGTCTACCTGATTTTTGACAGCTGGTGGCATCGTCACGACCCAGCC GTAACCTTTACTCCCGACGCGAAGGATAGCCTATGA
Upstream 100 bases:
>100_bases ACGACGTTTGTATTTTAAGAATCTGACTGCCTGACCCGACGCATTTTACCTCTCCCTATATTCATGCGTCCGGGACATAA ACATAAATAAGGGCTATGAG
Downstream 100 bases:
>100_bases AACGCCCCAATTGACCGATACCCAGGCCACCAATATGGTCGGTTGCTATAGCGGTAAACCGCTGAATACGCAAAATATTG ATAGTCTGGCGGCGGAAGGT
Product: putative symporter YidK
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 571; Mature: 571
Protein sequence:
>571_residues MNSLQILSFVGFTLLVAVITWWKVRKTDTGSQQGYFLAGRSLKAPVIAASLMLTNLSTEQLVGLSGQAYKSGMSVMGWEV TSAVTLIFLALIFLPRYLKRGIATIPDFLEERYDKTTRIIIDFCFLIATGVCFLPIVLYSGALALNSLFHVGESLQISHG AAIWLLVILLGLAGILYAVIGGLRAMAVADSINGIGLVIGGLMVPVFGLIAIGKGSFMQGIEQITTVHAEKLNSVGGPTD PLPIGAAFTGLILVNTFYWCTNQGIVQRTLASKSLAEGQKGALLTAVLKMLDPLVLVLPGLIAFHLYQDLPKADMAYPTL VNNVLPVPLEGFFGAVLFGAVISTFNGFLNSASTLFSMGIYRRIINQNAEPQQLVTVGRKFGFFIAIVSVLVAPWIANAP QGLYSWMKQLNGIYNVPLVTIIIMGFFFPRIPALAAKVAMGIGIISYITINYLVKFDFHFLYVLACTFCINVVVMLVIGF IKPRATPFTFKDAFAVDMKPWKNVKIASIGILFAMIGVYVGLAEFGGYGTRWLAMISYFIAAVVIVYLIFDSWWHRHDPA VTFTPDAKDSL
Sequences:
>Translated_571_residues MNSLQILSFVGFTLLVAVITWWKVRKTDTGSQQGYFLAGRSLKAPVIAASLMLTNLSTEQLVGLSGQAYKSGMSVMGWEV TSAVTLIFLALIFLPRYLKRGIATIPDFLEERYDKTTRIIIDFCFLIATGVCFLPIVLYSGALALNSLFHVGESLQISHG AAIWLLVILLGLAGILYAVIGGLRAMAVADSINGIGLVIGGLMVPVFGLIAIGKGSFMQGIEQITTVHAEKLNSVGGPTD PLPIGAAFTGLILVNTFYWCTNQGIVQRTLASKSLAEGQKGALLTAVLKMLDPLVLVLPGLIAFHLYQDLPKADMAYPTL VNNVLPVPLEGFFGAVLFGAVISTFNGFLNSASTLFSMGIYRRIINQNAEPQQLVTVGRKFGFFIAIVSVLVAPWIANAP QGLYSWMKQLNGIYNVPLVTIIIMGFFFPRIPALAAKVAMGIGIISYITINYLVKFDFHFLYVLACTFCINVVVMLVIGF IKPRATPFTFKDAFAVDMKPWKNVKIASIGILFAMIGVYVGLAEFGGYGTRWLAMISYFIAAVVIVYLIFDSWWHRHDPA VTFTPDAKDSL >Mature_571_residues MNSLQILSFVGFTLLVAVITWWKVRKTDTGSQQGYFLAGRSLKAPVIAASLMLTNLSTEQLVGLSGQAYKSGMSVMGWEV TSAVTLIFLALIFLPRYLKRGIATIPDFLEERYDKTTRIIIDFCFLIATGVCFLPIVLYSGALALNSLFHVGESLQISHG AAIWLLVILLGLAGILYAVIGGLRAMAVADSINGIGLVIGGLMVPVFGLIAIGKGSFMQGIEQITTVHAEKLNSVGGPTD PLPIGAAFTGLILVNTFYWCTNQGIVQRTLASKSLAEGQKGALLTAVLKMLDPLVLVLPGLIAFHLYQDLPKADMAYPTL VNNVLPVPLEGFFGAVLFGAVISTFNGFLNSASTLFSMGIYRRIINQNAEPQQLVTVGRKFGFFIAIVSVLVAPWIANAP QGLYSWMKQLNGIYNVPLVTIIIMGFFFPRIPALAAKVAMGIGIISYITINYLVKFDFHFLYVLACTFCINVVVMLVIGF IKPRATPFTFKDAFAVDMKPWKNVKIASIGILFAMIGVYVGLAEFGGYGTRWLAMISYFIAAVVIVYLIFDSWWHRHDPA VTFTPDAKDSL
Specific function: Unknown
COG id: COG4146
COG function: function code R; Predicted symporter
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the sodium:solute symporter (SSF) (TC 2.A.21) family [H]
Homologues:
Organism=Homo sapiens, GI110835708, Length=461, Percent_Identity=29.0672451193059, Blast_Score=184, Evalue=3e-46, Organism=Homo sapiens, GI17941285, Length=456, Percent_Identity=29.6052631578947, Blast_Score=175, Evalue=1e-43, Organism=Homo sapiens, GI14140236, Length=489, Percent_Identity=26.3803680981595, Blast_Score=171, Evalue=2e-42, Organism=Homo sapiens, GI4507031, Length=468, Percent_Identity=27.3504273504274, Blast_Score=171, Evalue=3e-42, Organism=Homo sapiens, GI4507033, Length=472, Percent_Identity=29.6610169491525, Blast_Score=166, Evalue=5e-41, Organism=Homo sapiens, GI206597483, Length=505, Percent_Identity=27.5247524752475, Blast_Score=161, Evalue=1e-39, Organism=Homo sapiens, GI109659836, Length=454, Percent_Identity=29.7356828193833, Blast_Score=157, Evalue=4e-38, Organism=Homo sapiens, GI206597487, Length=530, Percent_Identity=26.9811320754717, Blast_Score=152, Evalue=9e-37, Organism=Homo sapiens, GI109659839, Length=470, Percent_Identity=28.5106382978723, Blast_Score=151, Evalue=2e-36, Organism=Homo sapiens, GI256985183, Length=435, Percent_Identity=25.2873563218391, Blast_Score=91, Evalue=4e-18, Organism=Homo sapiens, GI157671931, Length=451, Percent_Identity=22.8381374722838, Blast_Score=85, Evalue=2e-16, Organism=Homo sapiens, GI4507035, Length=440, Percent_Identity=22.7272727272727, Blast_Score=83, Evalue=6e-16, Organism=Homo sapiens, GI167466278, Length=337, Percent_Identity=25.5192878338279, Blast_Score=76, Evalue=1e-13, Organism=Escherichia coli, GI1790113, Length=571, Percent_Identity=98.5989492119089, Blast_Score=1130, Evalue=0.0, Organism=Escherichia coli, GI87082237, Length=472, Percent_Identity=23.0932203389831, Blast_Score=70, Evalue=4e-13, Organism=Caenorhabditis elegans, GI115533094, Length=433, Percent_Identity=20.554272517321, Blast_Score=70, Evalue=3e-12, Organism=Drosophila melanogaster, GI221459584, Length=335, Percent_Identity=28.955223880597, Blast_Score=83, Evalue=6e-16, Organism=Drosophila melanogaster, GI24645928, Length=520, Percent_Identity=23.8461538461538, Blast_Score=82, Evalue=8e-16, Organism=Drosophila melanogaster, GI19920916, Length=359, Percent_Identity=23.6768802228412, Blast_Score=77, Evalue=3e-14, Organism=Drosophila melanogaster, GI161076631, Length=356, Percent_Identity=23.876404494382, Blast_Score=77, Evalue=5e-14, Organism=Drosophila melanogaster, GI28573698, Length=342, Percent_Identity=24.8538011695906, Blast_Score=74, Evalue=2e-13, Organism=Drosophila melanogaster, GI221459586, Length=417, Percent_Identity=23.7410071942446, Blast_Score=73, Evalue=7e-13, Organism=Drosophila melanogaster, GI24651739, Length=439, Percent_Identity=21.1845102505695, Blast_Score=71, Evalue=2e-12, Organism=Drosophila melanogaster, GI221459588, Length=413, Percent_Identity=22.7602905569007, Blast_Score=69, Evalue=1e-11, Organism=Drosophila melanogaster, GI24641117, Length=299, Percent_Identity=22.742474916388, Blast_Score=67, Evalue=4e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001734 - InterPro: IPR018212 - InterPro: IPR019900 [H]
Pfam domain/function: PF00474 SSF [H]
EC number: NA
Molecular weight: Translated: 62138; Mature: 62138
Theoretical pI: Translated: 9.47; Mature: 9.47
Prosite motif: PS00456 NA_SOLUT_SYMP_1 ; PS50283 NA_SOLUT_SYMP_3
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 3.0 %Met (Translated Protein) 3.9 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 3.0 %Met (Mature Protein) 3.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNSLQILSFVGFTLLVAVITWWKVRKTDTGSQQGYFLAGRSLKAPVIAASLMLTNLSTEQ CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCHHHHHHHHHHCCCHHH LVGLSGQAYKSGMSVMGWEVTSAVTLIFLALIFLPRYLKRGIATIPDFLEERYDKTTRII HHCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IDFCFLIATGVCFLPIVLYSGALALNSLFHVGESLQISHGAAIWLLVILLGLAGILYAVI HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCHHHHHHHHHHHHHHHHHHHHH GGLRAMAVADSINGIGLVIGGLMVPVFGLIAIGKGSFMQGIEQITTVHAEKLNSVGGPTD HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCC PLPIGAAFTGLILVNTFYWCTNQGIVQRTLASKSLAEGQKGALLTAVLKMLDPLVLVLPG CCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH LIAFHLYQDLPKADMAYPTLVNNVLPVPLEGFFGAVLFGAVISTFNGFLNSASTLFSMGI HHHHHHHHHCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH YRRIINQNAEPQQLVTVGRKFGFFIAIVSVLVAPWIANAPQGLYSWMKQLNGIYNVPLVT HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCHHHHH IIIMGFFFPRIPALAAKVAMGIGIISYITINYLVKFDFHFLYVLACTFCINVVVMLVIGF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IKPRATPFTFKDAFAVDMKPWKNVKIASIGILFAMIGVYVGLAEFGGYGTRWLAMISYFI HCCCCCCCEECCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH AAVVIVYLIFDSWWHRHDPAVTFTPDAKDSL HHHHHHHHHHHHHHHCCCCCEEECCCCCCCC >Mature Secondary Structure MNSLQILSFVGFTLLVAVITWWKVRKTDTGSQQGYFLAGRSLKAPVIAASLMLTNLSTEQ CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCHHHHHHHHHHCCCHHH LVGLSGQAYKSGMSVMGWEVTSAVTLIFLALIFLPRYLKRGIATIPDFLEERYDKTTRII HHCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IDFCFLIATGVCFLPIVLYSGALALNSLFHVGESLQISHGAAIWLLVILLGLAGILYAVI HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCHHHHHHHHHHHHHHHHHHHHH GGLRAMAVADSINGIGLVIGGLMVPVFGLIAIGKGSFMQGIEQITTVHAEKLNSVGGPTD HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCC PLPIGAAFTGLILVNTFYWCTNQGIVQRTLASKSLAEGQKGALLTAVLKMLDPLVLVLPG CCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH LIAFHLYQDLPKADMAYPTLVNNVLPVPLEGFFGAVLFGAVISTFNGFLNSASTLFSMGI HHHHHHHHHCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH YRRIINQNAEPQQLVTVGRKFGFFIAIVSVLVAPWIANAPQGLYSWMKQLNGIYNVPLVT HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCHHHHH IIIMGFFFPRIPALAAKVAMGIGIISYITINYLVKFDFHFLYVLACTFCINVVVMLVIGF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IKPRATPFTFKDAFAVDMKPWKNVKIASIGILFAMIGVYVGLAEFGGYGTRWLAMISYFI HCCCCCCCEECCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH AAVVIVYLIFDSWWHRHDPAVTFTPDAKDSL HHHHHHHHHHHHHHHCCCCCEEECCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 7686882; 9278503 [H]