| Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
|---|---|
| Accession | NC_008312 |
| Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is yfmR [H]
Identifier: 113474190
GI number: 113474190
Start: 457720
End: 459648
Strand: Direct
Name: yfmR [H]
Synonym: Tery_0299
Alternate gene names: 113474190
Gene position: 457720-459648 (Clockwise)
Preceding gene: 113474189
Following gene: 113474191
Centisome position: 5.91
GC content: 33.28
Gene sequence:
>1929_bases ATGAGTATGTTCACACTTCAATCAGTCAAAAAAGACTTTGGTATCAAAGAAATTTTAAAAGATGCCAGCTTCAGTTTAGA TCCTAAGGACAAAGTTGGTTTAATTGGTACAAATGGCTCAGGAAAATCAACTCTTTTAAAAATGATTGCTGGTTTAGAAC CCACAGATAGCGGTGAAATTTTAGTTAATCAAAATGTGAGGATTATTTATTTGCCTCAACAACCAGATTTGAATGAAAAT AATACTGTATTAGAACAGGTTTTTGCCAACAGTGGTAAACAAACGGCACTGATAAAAGAGTACCAAGAACTATCAGAAAA ACTAGCTCATTATCCAGAAGATAGTCAGTTAATGGCGCGTTTTTCACAGGTAACTCAAAAGATGAATTATACTAATGCTT GGGAATTAGAAACTAAGGCTAAAATCATCTTGAGTAAATTAGGAATTCAAGATTTTGAGGTTAAAGTTGGTAATTTATCT GGGGGTTATCGTAAACGAATTGCTTTAGGGACAGCCTTACTTTCTGAACCTGATGTTTTGTTAATGGATGAGCCAACAAA CCATCTAGATGCTATGTCTGTAGAATGGTTACAAAGTTACTTAAAGACTTTTTCTGGAGCAATTTTATTAATTACTCACG ACCGTTATTTTCTTGACCGGGTTACTAATAAAATTCTGGAAATAGACCGAGGAGATATTTACACTTATTCTGGTAACTAT TCCTACTATTTAGAAAAAAAAGCTTTAGCTGAAGAATCTATCGTAAGTTCTCAAAAGAAGCATAAAGGTCTATTACGTAG AGAATTAGAATGGTTAAAACGAGGACCAAAAGCTCGCAGCACTAAACAAAAAGCTCGAATTCAACGCATAGAAGAAATGC AAGGAAGAGAATTCAAAGAATCTTTAGGAAAGGTAGATATTTCTACTGCTGGTCGTCGAATTGGCAAAAAGGTTATTGAG TTAGAAAATATTTCTAAATCTTATAATGGTAGAACTATAATTAAAGATTTTACTTATGAGTTTATCCCAGAAGATAGAGT AGGAATTATTGGTAGTAATGGTATGGGAAAATCAACCTTAATGGATATTATTACAGGTAGGGTAAAACCTGATTATGGCA AAGTAGAAATTGGGACAACAATACATATAGGTTATTTTGACCAATATTCAGAAAACTTGCTGGATGCTGGGGCTGAAAAT CAACGGGTTATAGATTATATAAAAGATGTGGCAGAATATGTTCAAACTGCAGATGGTACTCAAATTAGTGCTTCTCAAAT GTTGGAAAGATTTATGTTTCCAGGTAATCAACAATATGCTCCTATCTATAAACTTTCTGGTGGGGAAAAAAGACGATTAT TTCTGTTGAAAGTATTGATGAGTGCTCCTAATGTTTTGATTTTGGATGAACCAACTAATGATTTGGATGTACAAACTTTG GCGGTGTTGGAAGAATATCTCGAAGAGTTTAATGGTTGTGTAATTGTGGTTTCTCATGATCGCTATTTTCTTGATAGAAC TGTAGATAGAATTTTTGCTTTTGAAGGGGTAGGAAATTTACGACAATATCCAGGAAATTATTCACTTTATTTAGATTATA AAGAAGCAGAAGCAACAGCAAAAAAATTTTCAGAAGTTGAGGAAAAGCTAAAAGAAAGTCAGCCTAAACAACAAAAATCA GAAAGAGTTTCTCGTGATCAAAGGGGGCAAAGAAAATTATCATCAAAAGAGAAACGAGAATATGAGAACTTAGAGAAAAA AATTGCTCAGTTAGAAACTGAGAAAGCTGAAGCAGAAAATGAACTTTATAATTCACCTCCAACAGAGGTTAGTGAAGTTC AAAAACTTCATGAAAAGGTAGAAGGTTTAGGACAAGAAATTGATATTGCTACAGAAAGATGGTTAGAGTTAGCAGAGATA GATTCTTAA
Upstream 100 bases:
>100_bases ATTTTTTACAGCGTTTGCTTTAAGCTATAATTCATGTATAATCTAAATTATAGTTGAATTAGTTAAAAAAGATTATTGTT TTATAGTTGATCATAAAATT
Downstream 100 bases:
>100_bases ATTATAGTATAATAAATAATATTTAGTAAAATATAATTCAGTTATTTCAGTTATCAGGATAATTTAATGGCATGAACTGT CAAGAACTCTTAAAAAAATA
Product: ABC transporter-like protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 642; Mature: 641
Protein sequence:
>642_residues MSMFTLQSVKKDFGIKEILKDASFSLDPKDKVGLIGTNGSGKSTLLKMIAGLEPTDSGEILVNQNVRIIYLPQQPDLNEN NTVLEQVFANSGKQTALIKEYQELSEKLAHYPEDSQLMARFSQVTQKMNYTNAWELETKAKIILSKLGIQDFEVKVGNLS GGYRKRIALGTALLSEPDVLLMDEPTNHLDAMSVEWLQSYLKTFSGAILLITHDRYFLDRVTNKILEIDRGDIYTYSGNY SYYLEKKALAEESIVSSQKKHKGLLRRELEWLKRGPKARSTKQKARIQRIEEMQGREFKESLGKVDISTAGRRIGKKVIE LENISKSYNGRTIIKDFTYEFIPEDRVGIIGSNGMGKSTLMDIITGRVKPDYGKVEIGTTIHIGYFDQYSENLLDAGAEN QRVIDYIKDVAEYVQTADGTQISASQMLERFMFPGNQQYAPIYKLSGGEKRRLFLLKVLMSAPNVLILDEPTNDLDVQTL AVLEEYLEEFNGCVIVVSHDRYFLDRTVDRIFAFEGVGNLRQYPGNYSLYLDYKEAEATAKKFSEVEEKLKESQPKQQKS ERVSRDQRGQRKLSSKEKREYENLEKKIAQLETEKAEAENELYNSPPTEVSEVQKLHEKVEGLGQEIDIATERWLELAEI DS
Sequences:
>Translated_642_residues MSMFTLQSVKKDFGIKEILKDASFSLDPKDKVGLIGTNGSGKSTLLKMIAGLEPTDSGEILVNQNVRIIYLPQQPDLNEN NTVLEQVFANSGKQTALIKEYQELSEKLAHYPEDSQLMARFSQVTQKMNYTNAWELETKAKIILSKLGIQDFEVKVGNLS GGYRKRIALGTALLSEPDVLLMDEPTNHLDAMSVEWLQSYLKTFSGAILLITHDRYFLDRVTNKILEIDRGDIYTYSGNY SYYLEKKALAEESIVSSQKKHKGLLRRELEWLKRGPKARSTKQKARIQRIEEMQGREFKESLGKVDISTAGRRIGKKVIE LENISKSYNGRTIIKDFTYEFIPEDRVGIIGSNGMGKSTLMDIITGRVKPDYGKVEIGTTIHIGYFDQYSENLLDAGAEN QRVIDYIKDVAEYVQTADGTQISASQMLERFMFPGNQQYAPIYKLSGGEKRRLFLLKVLMSAPNVLILDEPTNDLDVQTL AVLEEYLEEFNGCVIVVSHDRYFLDRTVDRIFAFEGVGNLRQYPGNYSLYLDYKEAEATAKKFSEVEEKLKESQPKQQKS ERVSRDQRGQRKLSSKEKREYENLEKKIAQLETEKAEAENELYNSPPTEVSEVQKLHEKVEGLGQEIDIATERWLELAEI DS >Mature_641_residues SMFTLQSVKKDFGIKEILKDASFSLDPKDKVGLIGTNGSGKSTLLKMIAGLEPTDSGEILVNQNVRIIYLPQQPDLNENN TVLEQVFANSGKQTALIKEYQELSEKLAHYPEDSQLMARFSQVTQKMNYTNAWELETKAKIILSKLGIQDFEVKVGNLSG GYRKRIALGTALLSEPDVLLMDEPTNHLDAMSVEWLQSYLKTFSGAILLITHDRYFLDRVTNKILEIDRGDIYTYSGNYS YYLEKKALAEESIVSSQKKHKGLLRRELEWLKRGPKARSTKQKARIQRIEEMQGREFKESLGKVDISTAGRRIGKKVIEL ENISKSYNGRTIIKDFTYEFIPEDRVGIIGSNGMGKSTLMDIITGRVKPDYGKVEIGTTIHIGYFDQYSENLLDAGAENQ RVIDYIKDVAEYVQTADGTQISASQMLERFMFPGNQQYAPIYKLSGGEKRRLFLLKVLMSAPNVLILDEPTNDLDVQTLA VLEEYLEEFNGCVIVVSHDRYFLDRTVDRIFAFEGVGNLRQYPGNYSLYLDYKEAEATAKKFSEVEEKLKESQPKQQKSE RVSRDQRGQRKLSSKEKREYENLEKKIAQLETEKAEAENELYNSPPTEVSEVQKLHEKVEGLGQEIDIATERWLELAEID S
Specific function: Unknown
COG id: COG0488
COG function: function code R; ATPase components of ABC transporters with duplicated ATPase domains
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 ABC transporter domains [H]
Homologues:
Organism=Homo sapiens, GI148612853, Length=531, Percent_Identity=26.9303201506591, Blast_Score=209, Evalue=9e-54, Organism=Homo sapiens, GI10947137, Length=535, Percent_Identity=28.0373831775701, Blast_Score=193, Evalue=3e-49, Organism=Homo sapiens, GI27881506, Length=535, Percent_Identity=28.0373831775701, Blast_Score=193, Evalue=4e-49, Organism=Homo sapiens, GI69354671, Length=529, Percent_Identity=30.0567107750473, Blast_Score=186, Evalue=7e-47, Organism=Homo sapiens, GI10947135, Length=529, Percent_Identity=30.0567107750473, Blast_Score=186, Evalue=8e-47, Organism=Homo sapiens, GI31657092, Length=200, Percent_Identity=29, Blast_Score=69, Evalue=2e-11, Organism=Homo sapiens, GI9955963, Length=211, Percent_Identity=29.3838862559242, Blast_Score=68, Evalue=3e-11, Organism=Escherichia coli, GI2367384, Length=571, Percent_Identity=37.3029772329247, Blast_Score=370, Evalue=1e-103, Organism=Escherichia coli, GI1787182, Length=647, Percent_Identity=33.0757341576507, Blast_Score=355, Evalue=6e-99, Organism=Escherichia coli, GI1789751, Length=617, Percent_Identity=29.9837925445705, Blast_Score=270, Evalue=2e-73, Organism=Escherichia coli, GI1787041, Length=544, Percent_Identity=29.5955882352941, Blast_Score=237, Evalue=2e-63, Organism=Escherichia coli, GI1786975, Length=500, Percent_Identity=25.2, Blast_Score=98, Evalue=1e-21, Organism=Escherichia coli, GI1788165, Length=203, Percent_Identity=29.5566502463054, Blast_Score=96, Evalue=7e-21, Organism=Escherichia coli, GI1789672, Length=263, Percent_Identity=27.3764258555133, Blast_Score=89, Evalue=7e-19, Organism=Escherichia coli, GI1790525, Length=549, Percent_Identity=22.2222222222222, Blast_Score=87, Evalue=3e-18, Organism=Escherichia coli, GI1786872, Length=250, Percent_Identity=27.2, Blast_Score=85, Evalue=2e-17, Organism=Escherichia coli, GI1788897, Length=528, Percent_Identity=21.0227272727273, Blast_Score=82, Evalue=1e-16, Organism=Escherichia coli, GI1786398, Length=270, Percent_Identity=28.5185185185185, Blast_Score=79, Evalue=1e-15, Organism=Escherichia coli, GI1787370, Length=254, Percent_Identity=25.9842519685039, Blast_Score=75, Evalue=1e-14, Organism=Escherichia coli, GI1789891, Length=251, Percent_Identity=25.4980079681275, Blast_Score=71, Evalue=2e-13, Organism=Escherichia coli, GI1789593, Length=221, Percent_Identity=23.0769230769231, Blast_Score=70, Evalue=3e-13, Organism=Escherichia coli, GI48994997, Length=220, Percent_Identity=23.6363636363636, Blast_Score=70, Evalue=4e-13, Organism=Escherichia coli, GI1789873, Length=226, Percent_Identity=27.8761061946903, Blast_Score=70, Evalue=5e-13, Organism=Escherichia coli, GI1788225, Length=225, Percent_Identity=26.6666666666667, Blast_Score=69, Evalue=8e-13, Organism=Escherichia coli, GI48994943, Length=260, Percent_Identity=23.8461538461538, Blast_Score=67, Evalue=4e-12, Organism=Escherichia coli, GI1787105, Length=222, Percent_Identity=27.027027027027, Blast_Score=63, Evalue=6e-11, Organism=Escherichia coli, GI48994883, Length=228, Percent_Identity=26.7543859649123, Blast_Score=62, Evalue=8e-11, Organism=Caenorhabditis elegans, GI17559834, Length=545, Percent_Identity=27.7064220183486, Blast_Score=220, Evalue=2e-57, Organism=Caenorhabditis elegans, GI17555318, Length=535, Percent_Identity=27.2897196261682, Blast_Score=186, Evalue=3e-47, Organism=Caenorhabditis elegans, GI17553372, Length=520, Percent_Identity=27.8846153846154, Blast_Score=186, Evalue=4e-47, Organism=Caenorhabditis elegans, GI17565586, Length=208, Percent_Identity=26.4423076923077, Blast_Score=73, Evalue=4e-13, Organism=Caenorhabditis elegans, GI17510237, Length=195, Percent_Identity=27.1794871794872, Blast_Score=72, Evalue=1e-12, Organism=Caenorhabditis elegans, GI193211017, Length=285, Percent_Identity=25.6140350877193, Blast_Score=69, Evalue=1e-11, Organism=Caenorhabditis elegans, GI71996809, Length=285, Percent_Identity=25.6140350877193, Blast_Score=68, Evalue=1e-11, Organism=Caenorhabditis elegans, GI193211015, Length=285, Percent_Identity=25.6140350877193, Blast_Score=68, Evalue=1e-11, Organism=Caenorhabditis elegans, GI17565938, Length=236, Percent_Identity=27.1186440677966, Blast_Score=65, Evalue=9e-11, Organism=Saccharomyces cerevisiae, GI6320874, Length=527, Percent_Identity=29.4117647058824, Blast_Score=214, Evalue=3e-56, Organism=Saccharomyces cerevisiae, GI6321121, Length=555, Percent_Identity=28.1081081081081, Blast_Score=199, Evalue=2e-51, Organism=Saccharomyces cerevisiae, GI6324314, Length=388, Percent_Identity=26.0309278350515, Blast_Score=129, Evalue=2e-30, Organism=Saccharomyces cerevisiae, GI6323278, Length=387, Percent_Identity=25.8397932816537, Blast_Score=124, Evalue=5e-29, Organism=Saccharomyces cerevisiae, GI6325030, Length=242, Percent_Identity=28.9256198347107, Blast_Score=110, Evalue=1e-24, Organism=Saccharomyces cerevisiae, GI6322980, Length=279, Percent_Identity=27.9569892473118, Blast_Score=66, Evalue=2e-11, Organism=Drosophila melanogaster, GI24666836, Length=540, Percent_Identity=28.8888888888889, Blast_Score=220, Evalue=2e-57, Organism=Drosophila melanogaster, GI24641342, Length=559, Percent_Identity=29.1592128801431, Blast_Score=212, Evalue=6e-55, Organism=Drosophila melanogaster, GI24642252, Length=556, Percent_Identity=28.5971223021583, Blast_Score=199, Evalue=6e-51, Organism=Drosophila melanogaster, GI18859989, Length=556, Percent_Identity=28.5971223021583, Blast_Score=199, Evalue=6e-51, Organism=Drosophila melanogaster, GI116007184, Length=266, Percent_Identity=27.4436090225564, Blast_Score=82, Evalue=8e-16, Organism=Drosophila melanogaster, GI221500365, Length=266, Percent_Identity=27.4436090225564, Blast_Score=82, Evalue=8e-16, Organism=Drosophila melanogaster, GI19920532, Length=195, Percent_Identity=30.2564102564103, Blast_Score=70, Evalue=5e-12, Organism=Drosophila melanogaster, GI24580930, Length=195, Percent_Identity=30.2564102564103, Blast_Score=70, Evalue=5e-12,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003439 - InterPro: IPR017871 - InterPro: IPR003593 [H]
Pfam domain/function: PF00005 ABC_tran [H]
EC number: NA
Molecular weight: Translated: 73185; Mature: 73054
Theoretical pI: Translated: 5.37; Mature: 5.37
Prosite motif: PS00211 ABC_TRANSPORTER_1 ; PS50893 ABC_TRANSPORTER_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.2 %Cys (Translated Protein) 2.0 %Met (Translated Protein) 2.2 %Cys+Met (Translated Protein) 0.2 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 2.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSMFTLQSVKKDFGIKEILKDASFSLDPKDKVGLIGTNGSGKSTLLKMIAGLEPTDSGEI CCCHHHHHHHHHCCHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHHHHCCCCCCCCCE LVNQNVRIIYLPQQPDLNENNTVLEQVFANSGKQTALIKEYQELSEKLAHYPEDSQLMAR EEECCEEEEEECCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHH FSQVTQKMNYTNAWELETKAKIILSKLGIQDFEVKVGNLSGGYRKRIALGTALLSEPDVL HHHHHHHCCCCCCEECHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCEE LMDEPTNHLDAMSVEWLQSYLKTFSGAILLITHDRYFLDRVTNKILEIDRGDIYTYSGNY EECCCCCCHHHHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHHHHHCCCCCCEEEECCCE SYYLEKKALAEESIVSSQKKHKGLLRRELEWLKRGPKARSTKQKARIQRIEEMQGREFKE EEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHH SLGKVDISTAGRRIGKKVIELENISKSYNGRTIIKDFTYEFIPEDRVGIIGSNGMGKSTL HHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHCCCCCEEEEECCCCCHHHH MDIITGRVKPDYGKVEIGTTIHIGYFDQYSENLLDAGAENQRVIDYIKDVAEYVQTADGT HHHHHCCCCCCCCEEEECCEEEECCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCC QISASQMLERFMFPGNQQYAPIYKLSGGEKRRLFLLKVLMSAPNVLILDEPTNDLDVQTL EECHHHHHHHHHCCCCCCCCCEEEECCCCHHHHHHHHHHHHCCCEEEEECCCCCCCHHHH AVLEEYLEEFNGCVIVVSHDRYFLDRTVDRIFAFEGVGNLRQYPGNYSLYLDYKEAEATA HHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHCCCCHHHHCCCCEEEEEEEHHHHHHH KKFSEVEEKLKESQPKQQKSERVSRDQRGQRKLSSKEKREYENLEKKIAQLETEKAEAEN HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH ELYNSPPTEVSEVQKLHEKVEGLGQEIDIATERWLELAEIDS HHCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC >Mature Secondary Structure SMFTLQSVKKDFGIKEILKDASFSLDPKDKVGLIGTNGSGKSTLLKMIAGLEPTDSGEI CCHHHHHHHHHCCHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHHHHCCCCCCCCCE LVNQNVRIIYLPQQPDLNENNTVLEQVFANSGKQTALIKEYQELSEKLAHYPEDSQLMAR EEECCEEEEEECCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHH FSQVTQKMNYTNAWELETKAKIILSKLGIQDFEVKVGNLSGGYRKRIALGTALLSEPDVL HHHHHHHCCCCCCEECHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCEE LMDEPTNHLDAMSVEWLQSYLKTFSGAILLITHDRYFLDRVTNKILEIDRGDIYTYSGNY EECCCCCCHHHHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHHHHHCCCCCCEEEECCCE SYYLEKKALAEESIVSSQKKHKGLLRRELEWLKRGPKARSTKQKARIQRIEEMQGREFKE EEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHH SLGKVDISTAGRRIGKKVIELENISKSYNGRTIIKDFTYEFIPEDRVGIIGSNGMGKSTL HHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHCCCCCEEEEECCCCCHHHH MDIITGRVKPDYGKVEIGTTIHIGYFDQYSENLLDAGAENQRVIDYIKDVAEYVQTADGT HHHHHCCCCCCCCEEEECCEEEECCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCC QISASQMLERFMFPGNQQYAPIYKLSGGEKRRLFLLKVLMSAPNVLILDEPTNDLDVQTL EECHHHHHHHHHCCCCCCCCCEEEECCCCHHHHHHHHHHHHCCCEEEEECCCCCCCHHHH AVLEEYLEEFNGCVIVVSHDRYFLDRTVDRIFAFEGVGNLRQYPGNYSLYLDYKEAEATA HHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHCCCCHHHHCCCCEEEEEEEHHHHHHH KKFSEVEEKLKESQPKQQKSERVSRDQRGQRKLSSKEKREYENLEKKIAQLETEKAEAEN HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH ELYNSPPTEVSEVQKLHEKVEGLGQEIDIATERWLELAEIDS HHCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9141694; 9384377 [H]