| Definition | Rhodopseudomonas palustris HaA2, complete genome. |
|---|---|
| Accession | NC_007778 |
| Length | 5,331,656 |
Click here to switch to the map view.
The map label for this gene is 86747223
Identifier: 86747223
GI number: 86747223
Start: 103097
End: 105265
Strand: Reverse
Name: 86747223
Synonym: RPB_0096
Alternate gene names: NA
Gene position: 105265-103097 (Counterclockwise)
Preceding gene: 86747225
Following gene: 86747221
Centisome position: 1.97
GC content: 68.42
Gene sequence:
>2169_bases ATGAAGACCTTCCTGACCGTGAAACTGGTGCTGGTGCCGTTCGCGCTGTTCTGGGCGCTGCTGGCGATGGGCCACGTCGA CTGGGCAATCGGCGCGGGGCTCGTGCTGGCTCTGATCGGCAATGCTTGGCGTGCGTGGCGGCGCGAATTGTTCGTGCTTG AAGTCGGCGGGCTGGTGCTGTTTCTCGGCCTCGGCGGGTTGCTGCTCGTGGTGCCCGATCTCGCGGCGCCGACCGCGCTG TGGCTCTCGTTCGCGGGGCTGTCCGCGATCAGCATCGCGAGCCTGGTTGTGCGCCGGCCGTGGACCTCGGACTACGCCCG CGCCGCCTATCCGGACAATGCCGCCACGCCGCAGTTCTTCGTCATCAACGCCGCGATCACCGCGCTGTGGGCGGTGCTGT TCGCCGCGATCGCCGCCTGCCGCTATGCCGGCGCGCCGGGCGGTGTGATTGCCGCCCTCGTCATCGGCGGCGCGCTGATC TCGATCTTCGGGCCGAGGCTGGCGATCCGCTTTGTTCTGCAGCGGCTGGCCGCTTCGGGCGAAACCTATCGTTGGCCGGC GCCGTCCTTCGCGCGCGACAATGCGGCGGATTGCGACGTTGCGGTGATCGGCGCCGGCATCGGCGGGCTGACCGCCGCGG CGCTGCTCGCCGATGCGGGGCTGAAGGTGAAGGTGTTCGATCAGCACGTCGTCGCCGGCGGCTATTGCCATACCTATCTG CGCAAGGCGCATCACCTCAACAAGCCGGTGCTGTATCGCTTCGACGCCGGGCCGCACGATTTCTCCGGCGTGCAGCCGGG CGGGCCGTTCGCCACGTTGCTGCTACGACTCGGCGTCGCCGACCGCATCACGTGGGAGCGGGTGACGCAGAGCTTCCACA CCGCGCGCGGCGCAATCGACGTGCCGCCGGACTGGCGCGACTACGTCCGCGTGCTCGGCGAGCGGTTTCCGGACAGCGCC GCCGGGATCAAGTCGCTGTTCGAGGAGATCAAGGCGATCTTCGACGACATGTTCTCGACCGCCGCGAGCCGCGGCGGCGT TCCCGGGCCGCCGTCGACGATCGACGAATTGCTGGCGTTTCCGCGCGCGCATCCGCATGCCTACAAATGGATGAACAGGC CGTTCGCCGAGCTGGTCGCGGCGCATGTCCGCGATCCCGCTGTGGTCGCAGTGATCGACGGGCTGGCGGGCTATATCGGC GACGGCAGCGAGCCGCCGAGCTGCGCCCGGATGGTGCCGATCTTCGGCTACTACTTCCACGGCGGCTATTATCCGCGCGG CGGCTCGGGCGTGGTCGCCGATGCGCTGGTCGAGGCGATCGAAGCGCGCGATGGTGAGGTGCGGCTGAAGACCGCCGTGA AACAGATCATCGTCGAGAACGGAAGCGCCGCCGGCGTGGTGCTCGCCGATGGCGAGAGGGTGCGCGCCCGTGCGGTGGTG TCGAATGCCGACTTCAAGCGCACGCTGCTCGAACTGGTGCCCGCGGCGGCGCTGCCGCGCGGCGTTCGCGACGATCTCGC CGCGGCGGCGCCGGCGAATTCTTGCTTCAGCGTGCATCTCGGCGTCGATTTCGTCCCGGACCTCGGTCCCTCGACGCATC TGCACGCGCCAATGCCGCTCGGCATCGCGATGATGTCGAAATGCGACCCCACCGCCGCGCCGCCGGGACATGCGATCCTC TCCCTGATCGCGCTGGTGCCGCATGACGAGGCGAAGAGCTGGTTTCCGCAGGAGAGTGGCGGCAACGACTGGAAGGAGTG GCGGCGCTCCGAAGATTACTTGCGGCGCAAGGAGGAATTCGGCGACCGCATGATCGCCGCCGCAGAGACGGCGATCCCCG GGCTGTCGCAGCACATCGTGTATCGCACCGACGCCAGCCCGGTGACCTATGCGCGCTACGACTGGGCGAGCTTCGGCGCG ATCTACGGGATGTCGACCGCGGGACAGCTCAAAGGCTCGAAAACCGCGCTGCGCCATCTGGTGATCGCCGGCGGCGGCAA TATCGGCGCCGGGGTCGAAGCGGTGGTGATCTCCGGCGCCAATGCCGCCGAAGCGCTGGTGCCGGGATTGCTGGCGCGGG CTGGAAGCGAGGCGACAGCGTCGATCAGTTTACTTACGCCAAAGTCCCATGCCCCGGACAAAGCGGACCGCGAAGCGGTG CGCTGCTGA
Upstream 100 bases:
>100_bases TTGGTACCCGATGGTACCGGAAGCGCCGGCGGCGTCAATGTTGCGGCCGCTCAATGGCTCCGGCGCAGCCACCGGCGATG ATTCGACCCCGGAGGCCGCG
Downstream 100 bases:
>100_bases TCCGGGCCCCGGTTATTTCGGTGGCAGATCGGGGTCCCGGATCTGCGCCGCAGCGCTGACGCGCTGCCGCTTGTCCGGGA CACAGTCTCTCATTCTCAGG
Product: FAD dependent oxidoreductase
Products: NA
Alternate protein names: Carotenoid Isomerase; Amine Oxidase; Carotene Isomerase; Phytoene Dehydrogenase; Phytoene Desaturase; All-Trans-Retinol; Phytoene Dehydrogenase-Related Protein; Phytoene Dehydrogenase And Related Proteins; Amine Oxydase Deshydrogenase; Oxidoreductase; Phytoene Dehydrogenase And Related Protein; FAD-Dependent Pyridine Nucleotide-Disulfide Oxidoreductase; Carotenoid Cis-Trans Isomerase; Dehydrogenase-Related Protein; Diapophytoene Dehydrogenase Crtn; Amine Oxidase Flavin-Containing; Zeta-Phytoene Desaturase; Phytoene Dehydrogenase Or
Number of amino acids: Translated: 722; Mature: 722
Protein sequence:
>722_residues MKTFLTVKLVLVPFALFWALLAMGHVDWAIGAGLVLALIGNAWRAWRRELFVLEVGGLVLFLGLGGLLLVVPDLAAPTAL WLSFAGLSAISIASLVVRRPWTSDYARAAYPDNAATPQFFVINAAITALWAVLFAAIAACRYAGAPGGVIAALVIGGALI SIFGPRLAIRFVLQRLAASGETYRWPAPSFARDNAADCDVAVIGAGIGGLTAAALLADAGLKVKVFDQHVVAGGYCHTYL RKAHHLNKPVLYRFDAGPHDFSGVQPGGPFATLLLRLGVADRITWERVTQSFHTARGAIDVPPDWRDYVRVLGERFPDSA AGIKSLFEEIKAIFDDMFSTAASRGGVPGPPSTIDELLAFPRAHPHAYKWMNRPFAELVAAHVRDPAVVAVIDGLAGYIG DGSEPPSCARMVPIFGYYFHGGYYPRGGSGVVADALVEAIEARDGEVRLKTAVKQIIVENGSAAGVVLADGERVRARAVV SNADFKRTLLELVPAAALPRGVRDDLAAAAPANSCFSVHLGVDFVPDLGPSTHLHAPMPLGIAMMSKCDPTAAPPGHAIL SLIALVPHDEAKSWFPQESGGNDWKEWRRSEDYLRRKEEFGDRMIAAAETAIPGLSQHIVYRTDASPVTYARYDWASFGA IYGMSTAGQLKGSKTALRHLVIAGGGNIGAGVEAVVISGANAAEALVPGLLARAGSEATASISLLTPKSHAPDKADREAV RC
Sequences:
>Translated_722_residues MKTFLTVKLVLVPFALFWALLAMGHVDWAIGAGLVLALIGNAWRAWRRELFVLEVGGLVLFLGLGGLLLVVPDLAAPTAL WLSFAGLSAISIASLVVRRPWTSDYARAAYPDNAATPQFFVINAAITALWAVLFAAIAACRYAGAPGGVIAALVIGGALI SIFGPRLAIRFVLQRLAASGETYRWPAPSFARDNAADCDVAVIGAGIGGLTAAALLADAGLKVKVFDQHVVAGGYCHTYL RKAHHLNKPVLYRFDAGPHDFSGVQPGGPFATLLLRLGVADRITWERVTQSFHTARGAIDVPPDWRDYVRVLGERFPDSA AGIKSLFEEIKAIFDDMFSTAASRGGVPGPPSTIDELLAFPRAHPHAYKWMNRPFAELVAAHVRDPAVVAVIDGLAGYIG DGSEPPSCARMVPIFGYYFHGGYYPRGGSGVVADALVEAIEARDGEVRLKTAVKQIIVENGSAAGVVLADGERVRARAVV SNADFKRTLLELVPAAALPRGVRDDLAAAAPANSCFSVHLGVDFVPDLGPSTHLHAPMPLGIAMMSKCDPTAAPPGHAIL SLIALVPHDEAKSWFPQESGGNDWKEWRRSEDYLRRKEEFGDRMIAAAETAIPGLSQHIVYRTDASPVTYARYDWASFGA IYGMSTAGQLKGSKTALRHLVIAGGGNIGAGVEAVVISGANAAEALVPGLLARAGSEATASISLLTPKSHAPDKADREAV RC >Mature_722_residues MKTFLTVKLVLVPFALFWALLAMGHVDWAIGAGLVLALIGNAWRAWRRELFVLEVGGLVLFLGLGGLLLVVPDLAAPTAL WLSFAGLSAISIASLVVRRPWTSDYARAAYPDNAATPQFFVINAAITALWAVLFAAIAACRYAGAPGGVIAALVIGGALI SIFGPRLAIRFVLQRLAASGETYRWPAPSFARDNAADCDVAVIGAGIGGLTAAALLADAGLKVKVFDQHVVAGGYCHTYL RKAHHLNKPVLYRFDAGPHDFSGVQPGGPFATLLLRLGVADRITWERVTQSFHTARGAIDVPPDWRDYVRVLGERFPDSA AGIKSLFEEIKAIFDDMFSTAASRGGVPGPPSTIDELLAFPRAHPHAYKWMNRPFAELVAAHVRDPAVVAVIDGLAGYIG DGSEPPSCARMVPIFGYYFHGGYYPRGGSGVVADALVEAIEARDGEVRLKTAVKQIIVENGSAAGVVLADGERVRARAVV SNADFKRTLLELVPAAALPRGVRDDLAAAAPANSCFSVHLGVDFVPDLGPSTHLHAPMPLGIAMMSKCDPTAAPPGHAIL SLIALVPHDEAKSWFPQESGGNDWKEWRRSEDYLRRKEEFGDRMIAAAETAIPGLSQHIVYRTDASPVTYARYDWASFGA IYGMSTAGQLKGSKTALRHLVIAGGGNIGAGVEAVVISGANAAEALVPGLLARAGSEATASISLLTPKSHAPDKADREAV RC
Specific function: Unknown
COG id: COG1233
COG function: function code Q; Phytoene dehydrogenase and related proteins
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI203098013, Length=494, Percent_Identity=26.3157894736842, Blast_Score=88, Evalue=3e-17,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 76529; Mature: 76529
Theoretical pI: Translated: 7.70; Mature: 7.70
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.0 %Cys (Translated Protein) 1.4 %Met (Translated Protein) 2.4 %Cys+Met (Translated Protein) 1.0 %Cys (Mature Protein) 1.4 %Met (Mature Protein) 2.4 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKTFLTVKLVLVPFALFWALLAMGHVDWAIGAGLVLALIGNAWRAWRRELFVLEVGGLVL CCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHEEEEEEHHHHHH FLGLGGLLLVVPDLAAPTALWLSFAGLSAISIASLVVRRPWTSDYARAAYPDNAATPQFF HHHHCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCCCCCEE VINAAITALWAVLFAAIAACRYAGAPGGVIAALVIGGALISIFGPRLAIRFVLQRLAASG EHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC ETYRWPAPSFARDNAADCDVAVIGAGIGGLTAAALLADAGLKVKVFDQHVVAGGYCHTYL CCEECCCCCCCCCCCCCCCEEEEECCHHHHHHHHHHHCCCCEEEEECCHHHCCHHHHHHH RKAHHLNKPVLYRFDAGPHDFSGVQPGGPFATLLLRLGVADRITWERVTQSFHTARGAID HHHHHCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCC VPPDWRDYVRVLGERFPDSAAGIKSLFEEIKAIFDDMFSTAASRGGVPGPPSTIDELLAF CCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHC PRAHPHAYKWMNRPFAELVAAHVRDPAVVAVIDGLAGYIGDGSEPPSCARMVPIFGYYFH CCCCCHHHHHHCCCHHHHHHHHCCCCHHEEHHHHHHHHCCCCCCCCHHHHHHHHHHHHCC GGYYPRGGSGVVADALVEAIEARDGEVRLKTAVKQIIVENGSAAGVVLADGERVRARAVV CCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHH SNADFKRTLLELVPAAALPRGVRDDLAAAAPANSCFSVHLGVDFVPDLGPSTHLHAPMPL CCCHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCEEEEECCEECCCCCCCCCCCCCCCH GIAMMSKCDPTAAPPGHAILSLIALVPHDEAKSWFPQESGGNDWKEWRRSEDYLRRKEEF HHHHHHCCCCCCCCCHHHHHHHHHHCCCCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHH GDRMIAAAETAIPGLSQHIVYRTDASPVTYARYDWASFGAIYGMSTAGQLKGSKTALRHL HHHHHHHHHHHCCCCCCCEEEEECCCCCEEEECCHHHHHHHHCCCCCCCCCCHHHHHHEE VIAGGGNIGAGVEAVVISGANAAEALVPGLLARAGSEATASISLLTPKSHAPDKADREAV EEECCCCCCCCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCHHHHHC RC CC >Mature Secondary Structure MKTFLTVKLVLVPFALFWALLAMGHVDWAIGAGLVLALIGNAWRAWRRELFVLEVGGLVL CCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHEEEEEEHHHHHH FLGLGGLLLVVPDLAAPTALWLSFAGLSAISIASLVVRRPWTSDYARAAYPDNAATPQFF HHHHCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCCCCCEE VINAAITALWAVLFAAIAACRYAGAPGGVIAALVIGGALISIFGPRLAIRFVLQRLAASG EHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC ETYRWPAPSFARDNAADCDVAVIGAGIGGLTAAALLADAGLKVKVFDQHVVAGGYCHTYL CCEECCCCCCCCCCCCCCCEEEEECCHHHHHHHHHHHCCCCEEEEECCHHHCCHHHHHHH RKAHHLNKPVLYRFDAGPHDFSGVQPGGPFATLLLRLGVADRITWERVTQSFHTARGAID HHHHHCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCC VPPDWRDYVRVLGERFPDSAAGIKSLFEEIKAIFDDMFSTAASRGGVPGPPSTIDELLAF CCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHC PRAHPHAYKWMNRPFAELVAAHVRDPAVVAVIDGLAGYIGDGSEPPSCARMVPIFGYYFH CCCCCHHHHHHCCCHHHHHHHHCCCCHHEEHHHHHHHHCCCCCCCCHHHHHHHHHHHHCC GGYYPRGGSGVVADALVEAIEARDGEVRLKTAVKQIIVENGSAAGVVLADGERVRARAVV CCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHH SNADFKRTLLELVPAAALPRGVRDDLAAAAPANSCFSVHLGVDFVPDLGPSTHLHAPMPL CCCHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCEEEEECCEECCCCCCCCCCCCCCCH GIAMMSKCDPTAAPPGHAILSLIALVPHDEAKSWFPQESGGNDWKEWRRSEDYLRRKEEF HHHHHHCCCCCCCCCHHHHHHHHHHCCCCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHH GDRMIAAAETAIPGLSQHIVYRTDASPVTYARYDWASFGAIYGMSTAGQLKGSKTALRHL HHHHHHHHHHHCCCCCCCEEEEECCCCCEEEECCHHHHHHHHCCCCCCCCCCHHHHHHEE VIAGGGNIGAGVEAVVISGANAAEALVPGLLARAGSEATASISLLTPKSHAPDKADREAV EEECCCCCCCCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCHHHHHC RC CC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA