Definition | Rhodopseudomonas palustris HaA2, complete genome. |
---|---|
Accession | NC_007778 |
Length | 5,331,656 |
Click here to switch to the map view.
The map label for this gene is yheS [H]
Identifier: 86749604
GI number: 86749604
Start: 2842198
End: 2844075
Strand: Reverse
Name: yheS [H]
Synonym: RPB_2484
Alternate gene names: 86749604
Gene position: 2844075-2842198 (Counterclockwise)
Preceding gene: 86749606
Following gene: 86749600
Centisome position: 53.34
GC content: 66.35
Gene sequence:
>1878_bases ATGCTCACTTTGTCCGACATATCGGTCCGGATCGCGGGACGGCTGCTGATCGACCAGAGCACGGTGCAGATCGCGCCGGG TGCCCGGGTCGGCTTCATCGGCCGCAACGGCGCCGGCAAGTCGACGCTGTTTCACGCCATCCGCGGCGAGCTCGCGACCG AGACCGGGCGCATCACCATGCCGCCGCGCTGGCGCGTCGGCAGCCTCGCGCAGGAAGCGCCCAACGGGCCCGAGACCCTG CTCGAAGTGGTGCTCAAGGCGGATTTGGAGCGCGACGCCCTGCTCGCGGAGGCGGAGACCGCGCACGATCCGCATCGCAT CGCCGACATCCAGACCCGGCTGGTCGACATCGACGCGCATTCGGCCCCGGCGCGCGCCGCCGCCATTCTCAGCGGCCTCG GCTTTTCCGCGGCGGACCAGGCGCGCTCCTGTTCGGAATTCTCCGGCGGCTGGCGGATGCGGGTGGCGCTTGCGGCGACG CTGTTCGCAGCGCCCGATCTGCTGCTGCTCGACGAGCCGACCAACTATCTCGATCTCGAAGGCACGCTGTGGCTCGAGGA TCACCTCGCCAACTATCCGCGCACGGTGATCGTGATCAGCCACGACCGCGATTTGCTCGACACCTCGGTGAACGAGATCT TGCATCTCGATCGCGGCCGGCTGGTGCATTTCCGCGGCACCTACTCGGCCTATGCGGAATTCCGTGCCAACAAGGAGGCG CTCGACGCCAAGAACGCCAAGCGGGAAGAAGCCCGGCGCAAGCACCTGCAGGAATTCGTCGACCGCTTCAAGGCCAAGGC CTCGAAAGCGCGTCAGGCGCAGTCGCGCGTCAAGATGCTGGAGAAGATGAAGCCGGTGACGCGGCTGGTGAGCGACGACG TGCCCGACATCGTCTTTCCCGCGCCGGAGAAGACGCTGTCGCCGCCGATCATCGCCGCCGACAACGTCTCGATCGGCTAC GACCCCAAACACCCGGTGCTGCGTCATGTCACGCTGCGCGTCGACACCGAAGACCGCATTGCTTTGCTCGGCGCCAACGG CAACGGCAAGTCGACGCTGGTGAAGCTGCTGGCCGATCGGCTGACGCCGTTCTCAGGGACGGTGACGCGAGCCGACAAGC TGTCGGTCGCTTACTTCGCCCAGCACCAGCTCGACGAGCTCAACGAGGATGGCTCGCCCTACGACCACATCCGCAAGCTA ATGCCGGACGCGCCCGAGAGCAAGATCCGCGCCCGCGCCGGCCAGATCGGGTTTTCCGGCAAGGCCGCCGACACCCTGGT GAAGAGTCTGTCGGGCGGCGAAAAAGCGCGGTTGCTGTTGGGCCTCGCGACCTTCTACGGCCCGAACATGATCATTCTCG ACGAGCCGACCAACCATCTCGACATCGACAGCCGCGCGGCGCTGGCGGAGGCGATCAACGACTTTCCGGGCGCGGTCATC ATGGTGTCGCACGATCGCTACCTGATCGACGCCTGCGCCGATCAATTGTGGGTGGTCGCCGATCACAAGGTGAAGCCCTA TGACGGCGATCTCGACGACTACCGCCGCGCCGTGTTGTCGTCACGCGGCGCCCGCAGCGGTTCGCGCGAGCCCAGGGAGC GAGCCGCGGACGGAACCGGCGCCAAGCAACCACGTCAGAAGTCGGAGAAGCGCGTACCGCTGAAGCAGCAGATCGCCGAC GCAGAAGTCGAGATCGAGCGGATCACCGCGATCATCGCCAAGATCGATGCAGCGCTTGCATTGCCGGACCTGTTCACGCG CGATCCGAAACAGGCCGCACAGCTCTCCGGCGCGCGCGCGAAAGCCGAAGCTGCGCTGCAGAAGGCCGAGGAACAATGGC TGGACGCGAGCTCGGCCTACGACAAGGCGCAAGGCTGA
Upstream 100 bases:
>100_bases CACCAGACGAACGCCGCCCTCCCCACCTGTCGCGCGTGCGGCCGATCACAATCGCCCTGATCCCGGTTGCCTTTGAGGCC CGTCCCGGTCAAAACGCGCC
Downstream 100 bases:
>100_bases GACGCGAAGCTTACGGCCGTGGCGGTAGCGTCGCCGCTACGACCTACTGCGCTTCTTCGCGGCTCGCGGTTGCGGCGCCG GCTCCGGTGCGCGTTCGATC
Product: ABC transporter related
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 625; Mature: 625
Protein sequence:
>625_residues MLTLSDISVRIAGRLLIDQSTVQIAPGARVGFIGRNGAGKSTLFHAIRGELATETGRITMPPRWRVGSLAQEAPNGPETL LEVVLKADLERDALLAEAETAHDPHRIADIQTRLVDIDAHSAPARAAAILSGLGFSAADQARSCSEFSGGWRMRVALAAT LFAAPDLLLLDEPTNYLDLEGTLWLEDHLANYPRTVIVISHDRDLLDTSVNEILHLDRGRLVHFRGTYSAYAEFRANKEA LDAKNAKREEARRKHLQEFVDRFKAKASKARQAQSRVKMLEKMKPVTRLVSDDVPDIVFPAPEKTLSPPIIAADNVSIGY DPKHPVLRHVTLRVDTEDRIALLGANGNGKSTLVKLLADRLTPFSGTVTRADKLSVAYFAQHQLDELNEDGSPYDHIRKL MPDAPESKIRARAGQIGFSGKAADTLVKSLSGGEKARLLLGLATFYGPNMIILDEPTNHLDIDSRAALAEAINDFPGAVI MVSHDRYLIDACADQLWVVADHKVKPYDGDLDDYRRAVLSSRGARSGSREPRERAADGTGAKQPRQKSEKRVPLKQQIAD AEVEIERITAIIAKIDAALALPDLFTRDPKQAAQLSGARAKAEAALQKAEEQWLDASSAYDKAQG
Sequences:
>Translated_625_residues MLTLSDISVRIAGRLLIDQSTVQIAPGARVGFIGRNGAGKSTLFHAIRGELATETGRITMPPRWRVGSLAQEAPNGPETL LEVVLKADLERDALLAEAETAHDPHRIADIQTRLVDIDAHSAPARAAAILSGLGFSAADQARSCSEFSGGWRMRVALAAT LFAAPDLLLLDEPTNYLDLEGTLWLEDHLANYPRTVIVISHDRDLLDTSVNEILHLDRGRLVHFRGTYSAYAEFRANKEA LDAKNAKREEARRKHLQEFVDRFKAKASKARQAQSRVKMLEKMKPVTRLVSDDVPDIVFPAPEKTLSPPIIAADNVSIGY DPKHPVLRHVTLRVDTEDRIALLGANGNGKSTLVKLLADRLTPFSGTVTRADKLSVAYFAQHQLDELNEDGSPYDHIRKL MPDAPESKIRARAGQIGFSGKAADTLVKSLSGGEKARLLLGLATFYGPNMIILDEPTNHLDIDSRAALAEAINDFPGAVI MVSHDRYLIDACADQLWVVADHKVKPYDGDLDDYRRAVLSSRGARSGSREPRERAADGTGAKQPRQKSEKRVPLKQQIAD AEVEIERITAIIAKIDAALALPDLFTRDPKQAAQLSGARAKAEAALQKAEEQWLDASSAYDKAQG >Mature_625_residues MLTLSDISVRIAGRLLIDQSTVQIAPGARVGFIGRNGAGKSTLFHAIRGELATETGRITMPPRWRVGSLAQEAPNGPETL LEVVLKADLERDALLAEAETAHDPHRIADIQTRLVDIDAHSAPARAAAILSGLGFSAADQARSCSEFSGGWRMRVALAAT LFAAPDLLLLDEPTNYLDLEGTLWLEDHLANYPRTVIVISHDRDLLDTSVNEILHLDRGRLVHFRGTYSAYAEFRANKEA LDAKNAKREEARRKHLQEFVDRFKAKASKARQAQSRVKMLEKMKPVTRLVSDDVPDIVFPAPEKTLSPPIIAADNVSIGY DPKHPVLRHVTLRVDTEDRIALLGANGNGKSTLVKLLADRLTPFSGTVTRADKLSVAYFAQHQLDELNEDGSPYDHIRKL MPDAPESKIRARAGQIGFSGKAADTLVKSLSGGEKARLLLGLATFYGPNMIILDEPTNHLDIDSRAALAEAINDFPGAVI MVSHDRYLIDACADQLWVVADHKVKPYDGDLDDYRRAVLSSRGARSGSREPRERAADGTGAKQPRQKSEKRVPLKQQIAD AEVEIERITAIIAKIDAALALPDLFTRDPKQAAQLSGARAKAEAALQKAEEQWLDASSAYDKAQG
Specific function: Unknown
COG id: COG0488
COG function: function code R; ATPase components of ABC transporters with duplicated ATPase domains
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 ABC transporter domains [H]
Homologues:
Organism=Homo sapiens, GI148612853, Length=543, Percent_Identity=38.8581952117864, Blast_Score=373, Evalue=1e-103, Organism=Homo sapiens, GI10947137, Length=525, Percent_Identity=36.5714285714286, Blast_Score=338, Evalue=1e-92, Organism=Homo sapiens, GI27881506, Length=525, Percent_Identity=36.5714285714286, Blast_Score=337, Evalue=2e-92, Organism=Homo sapiens, GI10947135, Length=555, Percent_Identity=34.4144144144144, Blast_Score=281, Evalue=1e-75, Organism=Homo sapiens, GI69354671, Length=555, Percent_Identity=34.4144144144144, Blast_Score=281, Evalue=1e-75, Organism=Escherichia coli, GI1789751, Length=633, Percent_Identity=41.390205371248, Blast_Score=481, Evalue=1e-137, Organism=Escherichia coli, GI1787041, Length=529, Percent_Identity=35.1606805293006, Blast_Score=308, Evalue=8e-85, Organism=Escherichia coli, GI1787182, Length=616, Percent_Identity=30.5194805194805, Blast_Score=232, Evalue=5e-62, Organism=Escherichia coli, GI2367384, Length=536, Percent_Identity=31.3432835820896, Blast_Score=225, Evalue=8e-60, Organism=Escherichia coli, GI1788165, Length=205, Percent_Identity=30.7317073170732, Blast_Score=90, Evalue=5e-19, Organism=Escherichia coli, GI87081791, Length=271, Percent_Identity=26.1992619926199, Blast_Score=64, Evalue=4e-11, Organism=Escherichia coli, GI1786319, Length=184, Percent_Identity=30.4347826086957, Blast_Score=63, Evalue=7e-11, Organism=Caenorhabditis elegans, GI17553372, Length=534, Percent_Identity=37.8277153558052, Blast_Score=348, Evalue=6e-96, Organism=Caenorhabditis elegans, GI17555318, Length=528, Percent_Identity=35.7954545454545, Blast_Score=339, Evalue=3e-93, Organism=Caenorhabditis elegans, GI17559834, Length=552, Percent_Identity=34.963768115942, Blast_Score=328, Evalue=6e-90, Organism=Saccharomyces cerevisiae, GI6321121, Length=541, Percent_Identity=38.2624768946396, Blast_Score=360, Evalue=1e-100, Organism=Saccharomyces cerevisiae, GI6320874, Length=530, Percent_Identity=33.7735849056604, Blast_Score=305, Evalue=1e-83, Organism=Saccharomyces cerevisiae, GI6325030, Length=387, Percent_Identity=28.9405684754522, Blast_Score=158, Evalue=3e-39, Organism=Saccharomyces cerevisiae, GI6324314, Length=410, Percent_Identity=25.609756097561, Blast_Score=122, Evalue=1e-28, Organism=Saccharomyces cerevisiae, GI6323278, Length=410, Percent_Identity=26.0975609756098, Blast_Score=120, Evalue=4e-28, Organism=Drosophila melanogaster, GI24666836, Length=538, Percent_Identity=36.6171003717472, Blast_Score=372, Evalue=1e-103, Organism=Drosophila melanogaster, GI24642252, Length=526, Percent_Identity=36.8821292775665, Blast_Score=354, Evalue=1e-97, Organism=Drosophila melanogaster, GI18859989, Length=526, Percent_Identity=36.8821292775665, Blast_Score=354, Evalue=1e-97, Organism=Drosophila melanogaster, GI24641342, Length=542, Percent_Identity=35.0553505535055, Blast_Score=325, Evalue=4e-89, Organism=Drosophila melanogaster, GI161077321, Length=259, Percent_Identity=27.027027027027, Blast_Score=68, Evalue=2e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003439 - InterPro: IPR017871 - InterPro: IPR003593 [H]
Pfam domain/function: PF00005 ABC_tran [H]
EC number: NA
Molecular weight: Translated: 68371; Mature: 68371
Theoretical pI: Translated: 6.98; Mature: 6.98
Prosite motif: PS00211 ABC_TRANSPORTER_1 ; PS50893 ABC_TRANSPORTER_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 1.3 %Met (Translated Protein) 1.6 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.3 %Met (Mature Protein) 1.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLTLSDISVRIAGRLLIDQSTVQIAPGARVGFIGRNGAGKSTLFHAIRGELATETGRITM CCEECCCCEEEEEEEEEECCEEEECCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEC PPRWRVGSLAQEAPNGPETLLEVVLKADLERDALLAEAETAHDPHRIADIQTRLVDIDAH CCCCCCCCHHHHCCCCHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHEEEECCC SAPARAAAILSGLGFSAADQARSCSEFSGGWRMRVALAATLFAAPDLLLLDEPTNYLDLE CCCHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEHHHHHHHHHHCCCEEEEECCCCEEECC GTLWLEDHLANYPRTVIVISHDRDLLDTSVNEILHLDRGRLVHFRGTYSAYAEFRANKEA CEEEEHHHHCCCCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHCCHHH LDAKNAKREEARRKHLQEFVDRFKAKASKARQAQSRVKMLEKMKPVTRLVSDDVPDIVFP HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEC APEKTLSPPIIAADNVSIGYDPKHPVLRHVTLRVDTEDRIALLGANGNGKSTLVKLLADR CCCCCCCCCEEEECCCCCCCCCCCCCEEEEEEEECCCCCEEEEECCCCCHHHHHHHHHHH LTPFSGTVTRADKLSVAYFAQHQLDELNEDGSPYDHIRKLMPDAPESKIRARAGQIGFSG CCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCHHHHHHHHCCCCCCC KAADTLVKSLSGGEKARLLLGLATFYGPNMIILDEPTNHLDIDSRAALAEAINDFPGAVI HHHHHHHHHHCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCEEE MVSHDRYLIDACADQLWVVADHKVKPYDGDLDDYRRAVLSSRGARSGSREPRERAADGTG EEECCCHHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHCCCCCC AKQPRQKSEKRVPLKQQIADAEVEIERITAIIAKIDAALALPDLFTRDPKQAAQLSGARA CCCCHHHHHCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCHHHCCCHHHHHHHCCCHH KAEAALQKAEEQWLDASSAYDKAQG HHHHHHHHHHHHHCCCHHHHHCCCC >Mature Secondary Structure MLTLSDISVRIAGRLLIDQSTVQIAPGARVGFIGRNGAGKSTLFHAIRGELATETGRITM CCEECCCCEEEEEEEEEECCEEEECCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEC PPRWRVGSLAQEAPNGPETLLEVVLKADLERDALLAEAETAHDPHRIADIQTRLVDIDAH CCCCCCCCHHHHCCCCHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHEEEECCC SAPARAAAILSGLGFSAADQARSCSEFSGGWRMRVALAATLFAAPDLLLLDEPTNYLDLE CCCHHHHHHHHCCCCCHHHHHHHHHHHCCCCEEHHHHHHHHHHCCCEEEEECCCCEEECC GTLWLEDHLANYPRTVIVISHDRDLLDTSVNEILHLDRGRLVHFRGTYSAYAEFRANKEA CEEEEHHHHCCCCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHCCHHH LDAKNAKREEARRKHLQEFVDRFKAKASKARQAQSRVKMLEKMKPVTRLVSDDVPDIVFP HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEC APEKTLSPPIIAADNVSIGYDPKHPVLRHVTLRVDTEDRIALLGANGNGKSTLVKLLADR CCCCCCCCCEEEECCCCCCCCCCCCCEEEEEEEECCCCCEEEEECCCCCHHHHHHHHHHH LTPFSGTVTRADKLSVAYFAQHQLDELNEDGSPYDHIRKLMPDAPESKIRARAGQIGFSG CCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCHHHHHHHHCCCCCCC KAADTLVKSLSGGEKARLLLGLATFYGPNMIILDEPTNHLDIDSRAALAEAINDFPGAVI HHHHHHHHHHCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCEEE MVSHDRYLIDACADQLWVVADHKVKPYDGDLDDYRRAVLSSRGARSGSREPRERAADGTG EEECCCHHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHCCCCCC AKQPRQKSEKRVPLKQQIADAEVEIERITAIIAKIDAALALPDLFTRDPKQAAQLSGARA CCCCHHHHHCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCHHHCCCHHHHHHHCCCHH KAEAALQKAEEQWLDASSAYDKAQG HHHHHHHHHHHHHCCCHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]