Definition Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome.
Accession NC_009972
Length 6,346,587

Click here to switch to the map view.

The map label for this gene is yfmR [H]

Identifier: 159897247

GI number: 159897247

Start: 825693

End: 827573

Strand: Direct

Name: yfmR [H]

Synonym: Haur_0718

Alternate gene names: 159897247

Gene position: 825693-827573 (Clockwise)

Preceding gene: 159897246

Following gene: 159897248

Centisome position: 13.01

GC content: 50.77

Gene sequence:

>1881_bases
ATGAACATTATCAATTTAGAAACTATTTCCAAAGCCTATGGCCCTAAAGTGTTGTTTGAAAATATTTCCTTGGGCTTAGC
AAGCGGCGATCGAATTGGCTTAATCGGGGTCAATGGCAGTGGTAAATCGACCTTATTAAAAATTGCTGCTGGTTTAGAAC
AACCAGATACTGGCCGCGTAACTTTGCGTAAAGATTGTCGCATTGGCTATTTGGCTCAAGATCCGCCGATGGATGCTAAC
CAAACGGTGTTGGAATATGTGTATGCGGCGGCAGGCGAAGGCGCAACCTTACTTAGCCAATATACGCTCGCTAGCGTGCA
GCTTGAGCAAGACCCGAACGATGCTAAAACGCTGGCCGCTGTTGCCGAATTGAGCGAACGGCTGACCACGCTTGATGCTT
GGGATACTGAGTTGAGCGCTCGCACGGTGCTTTCGCGACTTGGCATTACCACTCTTGATGCCCAACTTGGCACGCTCTCT
GGCGGCCAACGCAAGCGCGTGGCGATGGCGCGGGTGCTGATCGAAAAGCCCGATTTGCTGATTCTCGATGAGCCAACCAA
CCATATCGACCCAACCACCGTAGCTTGGCTGGAGGGTTATTTAGCCAATTTACAAGGCGCGTTACTGTTGATCACCCACG
ATCGCTACTTTTTGGATCGGGTGGTGACGGCGATTGTTGAGCTTGAAGATCATCAACTCTTCTCCTACCCTGGTAATTAC
GAGCGCTTTGTGGTCGAGCGGATTGAGCGCGAACGCCAACGTGCCAAGGCCGAGCTTGACCATCGCAACGAGGTGCGCCG
CGAATTGGCTTGGTTGCGTCAAGGTGCGCAAGCCCGAACCACCAAACAACAAGCCCGCGTCGATCGCGCCAATGCCTTGA
TCAACCAAGAGCGTCGTCAAGAACGTGGCACGTTAGATCTTGAATCGACAGGCCGACGGATTGGCAAAAAGCTGATCGAA
ATGCATGGCCTGAATAAGCAAATTGCTGGCAAAACCTTGTTGAGCAATTTTGAATATCAGCTTACTCGCGACGATCGGCT
GGGAATTATCGGCCCCAACGGGGTTGGTAAATCGACCTTGCTCAATTTGATCGCAGGCCGTTTACAACCCGATAGTGGCG
AATTAGTGGTTGGCGAAACTATCCACGTGGCCTATTACGATCAAAGTAGCAGCGATCTTAACCCCAACCAACGCTTGATC
GATTATGTCAGTGATGGCGCTGAGTTGGTGCAAACAGGCGAGGGTTTGCGCACCGCCAGCCAAATGCTTGAGCGCTTTTT
GTTCCCCAATAATCAGCACTGGGATTATATCCACAGCTTATCGGGGGGCGAACGCCGCCGCCTTTATTTATTGCGAACCC
TGATGCGCAATCCCAATGTGTTGCTGCTCGACGAACCCAGCAACGATCTTGATGTGCAAACCATTGCGATTCTGGAAGAA
TACATTGAGCAATTTAATGGCGCAGTGATTATCGTCTCGCACGATCGGGCTTTTCTCGATAACACGGTTGATCATTTGCT
GATTTTCGAGGGCGATGGCAAGGTTCGTCACTTTCCAGGCGATTACTCGGCCTATCGTGAAGTCTACGAACGTGAGCAAG
CAACCCTCAAGGCTGTAACCAAACCGTCAACTCAAGAGCGGCCACGCGAGCAAAAACCGCGTAAACTGAGCTTCAAAGAG
CAACGCGAATTAAGCGAACTCGAAAAAACTATCGCCAACTTGGAAGCCCGCCAAAACGAATTGAATGCAGCATTGAATAA
CGCAGGTAGCGATTATCAAGCCTATACCCGGTTGCACACCGAACTTGAGCAGGTCAGCCAAACCCTAGAACAAAGCTACG
AACGCTGGATGGAACTTTCTGAGCTAGCGTCGGCTTCGTAA

Upstream 100 bases:

>100_bases
CCACAGGCCGCAATTCGTTCCGCCCCGACCTTGCCAATATCAATGTTGGGTTTCGGTGTGCACAGGATGTTGCACCATAA
GCTAGATGTGTTAGGAATGC

Downstream 100 bases:

>100_bases
TTTTCAGCGCATGGATGGAGCACAGGAATGCAAGCAACACCCGAACTAATTTTGATTGTGGATGATCAAACCACCAATGT
GATGGTGTTACAGCGGATGT

Product: ABC transporter-like protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 626; Mature: 626

Protein sequence:

>626_residues
MNIINLETISKAYGPKVLFENISLGLASGDRIGLIGVNGSGKSTLLKIAAGLEQPDTGRVTLRKDCRIGYLAQDPPMDAN
QTVLEYVYAAAGEGATLLSQYTLASVQLEQDPNDAKTLAAVAELSERLTTLDAWDTELSARTVLSRLGITTLDAQLGTLS
GGQRKRVAMARVLIEKPDLLILDEPTNHIDPTTVAWLEGYLANLQGALLLITHDRYFLDRVVTAIVELEDHQLFSYPGNY
ERFVVERIERERQRAKAELDHRNEVRRELAWLRQGAQARTTKQQARVDRANALINQERRQERGTLDLESTGRRIGKKLIE
MHGLNKQIAGKTLLSNFEYQLTRDDRLGIIGPNGVGKSTLLNLIAGRLQPDSGELVVGETIHVAYYDQSSSDLNPNQRLI
DYVSDGAELVQTGEGLRTASQMLERFLFPNNQHWDYIHSLSGGERRRLYLLRTLMRNPNVLLLDEPSNDLDVQTIAILEE
YIEQFNGAVIIVSHDRAFLDNTVDHLLIFEGDGKVRHFPGDYSAYREVYEREQATLKAVTKPSTQERPREQKPRKLSFKE
QRELSELEKTIANLEARQNELNAALNNAGSDYQAYTRLHTELEQVSQTLEQSYERWMELSELASAS

Sequences:

>Translated_626_residues
MNIINLETISKAYGPKVLFENISLGLASGDRIGLIGVNGSGKSTLLKIAAGLEQPDTGRVTLRKDCRIGYLAQDPPMDAN
QTVLEYVYAAAGEGATLLSQYTLASVQLEQDPNDAKTLAAVAELSERLTTLDAWDTELSARTVLSRLGITTLDAQLGTLS
GGQRKRVAMARVLIEKPDLLILDEPTNHIDPTTVAWLEGYLANLQGALLLITHDRYFLDRVVTAIVELEDHQLFSYPGNY
ERFVVERIERERQRAKAELDHRNEVRRELAWLRQGAQARTTKQQARVDRANALINQERRQERGTLDLESTGRRIGKKLIE
MHGLNKQIAGKTLLSNFEYQLTRDDRLGIIGPNGVGKSTLLNLIAGRLQPDSGELVVGETIHVAYYDQSSSDLNPNQRLI
DYVSDGAELVQTGEGLRTASQMLERFLFPNNQHWDYIHSLSGGERRRLYLLRTLMRNPNVLLLDEPSNDLDVQTIAILEE
YIEQFNGAVIIVSHDRAFLDNTVDHLLIFEGDGKVRHFPGDYSAYREVYEREQATLKAVTKPSTQERPREQKPRKLSFKE
QRELSELEKTIANLEARQNELNAALNNAGSDYQAYTRLHTELEQVSQTLEQSYERWMELSELASAS
>Mature_626_residues
MNIINLETISKAYGPKVLFENISLGLASGDRIGLIGVNGSGKSTLLKIAAGLEQPDTGRVTLRKDCRIGYLAQDPPMDAN
QTVLEYVYAAAGEGATLLSQYTLASVQLEQDPNDAKTLAAVAELSERLTTLDAWDTELSARTVLSRLGITTLDAQLGTLS
GGQRKRVAMARVLIEKPDLLILDEPTNHIDPTTVAWLEGYLANLQGALLLITHDRYFLDRVVTAIVELEDHQLFSYPGNY
ERFVVERIERERQRAKAELDHRNEVRRELAWLRQGAQARTTKQQARVDRANALINQERRQERGTLDLESTGRRIGKKLIE
MHGLNKQIAGKTLLSNFEYQLTRDDRLGIIGPNGVGKSTLLNLIAGRLQPDSGELVVGETIHVAYYDQSSSDLNPNQRLI
DYVSDGAELVQTGEGLRTASQMLERFLFPNNQHWDYIHSLSGGERRRLYLLRTLMRNPNVLLLDEPSNDLDVQTIAILEE
YIEQFNGAVIIVSHDRAFLDNTVDHLLIFEGDGKVRHFPGDYSAYREVYEREQATLKAVTKPSTQERPREQKPRKLSFKE
QRELSELEKTIANLEARQNELNAALNNAGSDYQAYTRLHTELEQVSQTLEQSYERWMELSELASAS

Specific function: Unknown

COG id: COG0488

COG function: function code R; ATPase components of ABC transporters with duplicated ATPase domains

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 ABC transporter domains [H]

Homologues:

Organism=Homo sapiens, GI10947137, Length=544, Percent_Identity=30.5147058823529, Blast_Score=209, Evalue=9e-54,
Organism=Homo sapiens, GI27881506, Length=544, Percent_Identity=30.5147058823529, Blast_Score=208, Evalue=1e-53,
Organism=Homo sapiens, GI69354671, Length=550, Percent_Identity=33.6363636363636, Blast_Score=206, Evalue=8e-53,
Organism=Homo sapiens, GI10947135, Length=550, Percent_Identity=33.6363636363636, Blast_Score=205, Evalue=9e-53,
Organism=Homo sapiens, GI148612853, Length=541, Percent_Identity=27.7264325323475, Blast_Score=199, Evalue=8e-51,
Organism=Escherichia coli, GI1787182, Length=638, Percent_Identity=35.423197492163, Blast_Score=379, Evalue=1e-106,
Organism=Escherichia coli, GI2367384, Length=532, Percent_Identity=36.4661654135338, Blast_Score=362, Evalue=1e-101,
Organism=Escherichia coli, GI1789751, Length=613, Percent_Identity=32.137030995106, Blast_Score=262, Evalue=5e-71,
Organism=Escherichia coli, GI1787041, Length=534, Percent_Identity=31.8352059925094, Blast_Score=247, Evalue=2e-66,
Organism=Escherichia coli, GI145693107, Length=613, Percent_Identity=24.3066884176183, Blast_Score=88, Evalue=1e-18,
Organism=Escherichia coli, GI1788761, Length=237, Percent_Identity=29.535864978903, Blast_Score=85, Evalue=1e-17,
Organism=Escherichia coli, GI1788225, Length=222, Percent_Identity=27.4774774774775, Blast_Score=80, Evalue=5e-16,
Organism=Escherichia coli, GI1786563, Length=189, Percent_Identity=30.6878306878307, Blast_Score=76, Evalue=7e-15,
Organism=Escherichia coli, GI1786398, Length=349, Percent_Identity=26.0744985673352, Blast_Score=75, Evalue=9e-15,
Organism=Escherichia coli, GI48994997, Length=225, Percent_Identity=28.8888888888889, Blast_Score=74, Evalue=2e-14,
Organism=Escherichia coli, GI1787758, Length=205, Percent_Identity=25.3658536585366, Blast_Score=73, Evalue=6e-14,
Organism=Escherichia coli, GI87081709, Length=238, Percent_Identity=24.3697478991597, Blast_Score=72, Evalue=9e-14,
Organism=Escherichia coli, GI1786698, Length=192, Percent_Identity=31.25, Blast_Score=72, Evalue=1e-13,
Organism=Escherichia coli, GI1787547, Length=195, Percent_Identity=29.2307692307692, Blast_Score=71, Evalue=2e-13,
Organism=Escherichia coli, GI1788506, Length=256, Percent_Identity=29.296875, Blast_Score=71, Evalue=2e-13,
Organism=Escherichia coli, GI1787164, Length=201, Percent_Identity=27.8606965174129, Blast_Score=70, Evalue=6e-13,
Organism=Escherichia coli, GI48994883, Length=239, Percent_Identity=25.5230125523013, Blast_Score=69, Evalue=1e-12,
Organism=Escherichia coli, GI1789593, Length=227, Percent_Identity=26.8722466960352, Blast_Score=69, Evalue=1e-12,
Organism=Escherichia coli, GI1786872, Length=239, Percent_Identity=27.1966527196653, Blast_Score=68, Evalue=2e-12,
Organism=Escherichia coli, GI1789962, Length=201, Percent_Identity=24.8756218905473, Blast_Score=67, Evalue=3e-12,
Organism=Escherichia coli, GI1787370, Length=230, Percent_Identity=24.3478260869565, Blast_Score=65, Evalue=1e-11,
Organism=Escherichia coli, GI87081782, Length=210, Percent_Identity=28.0952380952381, Blast_Score=64, Evalue=2e-11,
Organism=Escherichia coli, GI1787029, Length=217, Percent_Identity=25.8064516129032, Blast_Score=64, Evalue=3e-11,
Organism=Escherichia coli, GI1786703, Length=206, Percent_Identity=30.0970873786408, Blast_Score=64, Evalue=3e-11,
Organism=Escherichia coli, GI1787089, Length=234, Percent_Identity=26.9230769230769, Blast_Score=64, Evalue=4e-11,
Organism=Escherichia coli, GI1787500, Length=247, Percent_Identity=24.6963562753036, Blast_Score=64, Evalue=4e-11,
Organism=Escherichia coli, GI1789586, Length=229, Percent_Identity=25.764192139738, Blast_Score=63, Evalue=5e-11,
Organism=Escherichia coli, GI1790467, Length=202, Percent_Identity=29.2079207920792, Blast_Score=63, Evalue=6e-11,
Organism=Escherichia coli, GI1786654, Length=219, Percent_Identity=29.6803652968037, Blast_Score=63, Evalue=6e-11,
Organism=Caenorhabditis elegans, GI17559834, Length=550, Percent_Identity=29.4545454545455, Blast_Score=230, Evalue=2e-60,
Organism=Caenorhabditis elegans, GI17553372, Length=562, Percent_Identity=31.6725978647687, Blast_Score=194, Evalue=1e-49,
Organism=Caenorhabditis elegans, GI17555318, Length=535, Percent_Identity=27.2897196261682, Blast_Score=184, Evalue=2e-46,
Organism=Caenorhabditis elegans, GI115534520, Length=220, Percent_Identity=30, Blast_Score=73, Evalue=4e-13,
Organism=Saccharomyces cerevisiae, GI6320874, Length=527, Percent_Identity=29.9810246679317, Blast_Score=208, Evalue=2e-54,
Organism=Saccharomyces cerevisiae, GI6321121, Length=551, Percent_Identity=28.6751361161525, Blast_Score=186, Evalue=9e-48,
Organism=Saccharomyces cerevisiae, GI6323278, Length=393, Percent_Identity=29.2620865139949, Blast_Score=137, Evalue=6e-33,
Organism=Saccharomyces cerevisiae, GI6324314, Length=396, Percent_Identity=28.5353535353535, Blast_Score=137, Evalue=8e-33,
Organism=Saccharomyces cerevisiae, GI6325030, Length=389, Percent_Identity=28.2776349614396, Blast_Score=125, Evalue=2e-29,
Organism=Drosophila melanogaster, GI24666836, Length=555, Percent_Identity=33.3333333333333, Blast_Score=228, Evalue=1e-59,
Organism=Drosophila melanogaster, GI24641342, Length=551, Percent_Identity=31.5789473684211, Blast_Score=218, Evalue=1e-56,
Organism=Drosophila melanogaster, GI24642252, Length=520, Percent_Identity=30.1923076923077, Blast_Score=207, Evalue=2e-53,
Organism=Drosophila melanogaster, GI18859989, Length=520, Percent_Identity=30.1923076923077, Blast_Score=207, Evalue=2e-53,
Organism=Drosophila melanogaster, GI116007184, Length=174, Percent_Identity=29.8850574712644, Blast_Score=76, Evalue=6e-14,
Organism=Drosophila melanogaster, GI221500365, Length=174, Percent_Identity=29.8850574712644, Blast_Score=76, Evalue=7e-14,
Organism=Drosophila melanogaster, GI24641565, Length=201, Percent_Identity=29.8507462686567, Blast_Score=67, Evalue=4e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003439
- InterPro:   IPR017871
- InterPro:   IPR003593 [H]

Pfam domain/function: PF00005 ABC_tran [H]

EC number: NA

Molecular weight: Translated: 70492; Mature: 70492

Theoretical pI: Translated: 5.21; Mature: 5.21

Prosite motif: PS00211 ABC_TRANSPORTER_1 ; PS50893 ABC_TRANSPORTER_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
1.1 %Met     (Translated Protein)
1.3 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
1.1 %Met     (Mature Protein)
1.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNIINLETISKAYGPKVLFENISLGLASGDRIGLIGVNGSGKSTLLKIAAGLEQPDTGRV
CCEEEHHHHHHHCCCCEEECCCCEEECCCCEEEEEEECCCCHHHHHHHHHCCCCCCCCCE
TLRKDCRIGYLAQDPPMDANQTVLEYVYAAAGEGATLLSQYTLASVQLEQDPNDAKTLAA
EEECCCCEEEEECCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHEEEECCCCCHHHHHHH
VAELSERLTTLDAWDTELSARTVLSRLGITTLDAQLGTLSGGQRKRVAMARVLIEKPDLL
HHHHHHHHHHHHHCCCHHHHHHHHHHCCCEEECCHHCCCCCCCHHHHHHHHHHHCCCCEE
ILDEPTNHIDPTTVAWLEGYLANLQGALLLITHDRYFLDRVVTAIVELEDHQLFSYPGNY
EEECCCCCCCCHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCEEECCCCCH
ERFVVERIERERQRAKAELDHRNEVRRELAWLRQGAQARTTKQQARVDRANALINQERRQ
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHH
ERGTLDLESTGRRIGKKLIEMHGLNKQIAGKTLLSNFEYQLTRDDRLGIIGPNGVGKSTL
HCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCEEEECCCCEEEECCCCCCHHHH
LNLIAGRLQPDSGELVVGETIHVAYYDQSSSDLNPNQRLIDYVSDGAELVQTGEGLRTAS
HHHHHHCCCCCCCCEEECCEEEEEEECCCCCCCCCHHHHHHHHHCCHHHHHCCCCHHHHH
QMLERFLFPNNQHWDYIHSLSGGERRRLYLLRTLMRNPNVLLLDEPSNDLDVQTIAILEE
HHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHH
YIEQFNGAVIIVSHDRAFLDNTVDHLLIFEGDGKVRHFPGDYSAYREVYEREQATLKAVT
HHHHHCCEEEEEECCHHHHHCCCCEEEEECCCCCEEECCCCHHHHHHHHHHHHHHHHHHC
KPSTQERPREQKPRKLSFKEQRELSELEKTIANLEARQNELNAALNNAGSDYQAYTRLHT
CCCCCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH
ELEQVSQTLEQSYERWMELSELASAS
HHHHHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure
MNIINLETISKAYGPKVLFENISLGLASGDRIGLIGVNGSGKSTLLKIAAGLEQPDTGRV
CCEEEHHHHHHHCCCCEEECCCCEEECCCCEEEEEEECCCCHHHHHHHHHCCCCCCCCCE
TLRKDCRIGYLAQDPPMDANQTVLEYVYAAAGEGATLLSQYTLASVQLEQDPNDAKTLAA
EEECCCCEEEEECCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHEEEECCCCCHHHHHHH
VAELSERLTTLDAWDTELSARTVLSRLGITTLDAQLGTLSGGQRKRVAMARVLIEKPDLL
HHHHHHHHHHHHHCCCHHHHHHHHHHCCCEEECCHHCCCCCCCHHHHHHHHHHHCCCCEE
ILDEPTNHIDPTTVAWLEGYLANLQGALLLITHDRYFLDRVVTAIVELEDHQLFSYPGNY
EEECCCCCCCCHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCEEECCCCCH
ERFVVERIERERQRAKAELDHRNEVRRELAWLRQGAQARTTKQQARVDRANALINQERRQ
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHH
ERGTLDLESTGRRIGKKLIEMHGLNKQIAGKTLLSNFEYQLTRDDRLGIIGPNGVGKSTL
HCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCEEEECCCCEEEECCCCCCHHHH
LNLIAGRLQPDSGELVVGETIHVAYYDQSSSDLNPNQRLIDYVSDGAELVQTGEGLRTAS
HHHHHHCCCCCCCCEEECCEEEEEEECCCCCCCCCHHHHHHHHHCCHHHHHCCCCHHHHH
QMLERFLFPNNQHWDYIHSLSGGERRRLYLLRTLMRNPNVLLLDEPSNDLDVQTIAILEE
HHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHH
YIEQFNGAVIIVSHDRAFLDNTVDHLLIFEGDGKVRHFPGDYSAYREVYEREQATLKAVT
HHHHHCCEEEEEECCHHHHHCCCCEEEEECCCCCEEECCCCHHHHHHHHHHHHHHHHHHC
KPSTQERPREQKPRKLSFKEQRELSELEKTIANLEARQNELNAALNNAGSDYQAYTRLHT
CCCCCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH
ELEQVSQTLEQSYERWMELSELASAS
HHHHHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: Endonuclease; Excision [C]

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9141694; 9384377 [H]