| Definition | Rhodopseudomonas palustris HaA2, complete genome. |
|---|---|
| Accession | NC_007778 |
| Length | 5,331,656 |
Click here to switch to the map view.
The map label for this gene is 86749060
Identifier: 86749060
GI number: 86749060
Start: 2205910
End: 2206569
Strand: Reverse
Name: 86749060
Synonym: RPB_1937
Alternate gene names: NA
Gene position: 2206569-2205910 (Counterclockwise)
Preceding gene: 86749061
Following gene: 86749058
Centisome position: 41.39
GC content: 72.88
Gene sequence:
>660_bases TTGGGGGCGCGTGTGGCCGATACAGAAGTAACCCGCCGTGGACGTCCGGGACCGCAAGGACCGCGTGGACGTCCGGGCGA ACCGGGACGTCCCGGACCGCAGGGACATCCGGGACGGCCGGGACCGGAAGGCCCGCGCGGCAAGCCGGGGCCGGTCGGCA AACCCGGTCCGCAAGGCAAGGCCGGCCCGCAGGGCAAGCCCGGCGCCGACGGCAAACCGGGCCCGCAGGGCCCGCAGGGC CCGCAAGGCAAGCCCGGCCCCGACGGCAAGGCTGGTCCAGTTGGGCCCGAAGGCAAGCCGGGACCGCAGGGTCCCCGCGG CGACCAGGGCCCCCGTGGCGAACAGGGACTGCGCGGCGATCAGGGCCCGCGCGGCGAGCCCGGCCCTGCCGGCGCCCTCC CCTCGATCGAGCAGGTGATGCCCTGGCTGCACCTGATCTTCGACGCCTATGAGGACTACAAGGCGAAGCGCGAACAGGAG GCGCGCGAACGCGCCGAGCGCGAGGCCACGGAGCTGCGCGAAGCGATCGAACGCGAAGCCGCCGAGCGCGAGGTCGCAGC TGCGCTCGACGAGTCGGAGCACGCCGACGACGATGAACGCGACGACGACGACGAGCGGCCCGACGGCAAGAAAAAGAAGA AGCGCAAGCACAAGGATTGA
Upstream 100 bases:
>100_bases CGTTTCGCGGATTGCTCAATCTGCCGGTCGTCTGGAACAGCGCAGCGCCGAATTGAAGCCGTCACCTCACCTCTGCTAAA GGTGGGGGGATAGGAATCGT
Downstream 100 bases:
>100_bases GTGAAGCCGCGCGCGCTAACCGAATCGCGCGCGCGAGCGCTCTCTTGCCTTCTCTGCCTCGACGTCCCGGTTGCGCGGCT CGGCCTGCGTCTGCAGCGAC
Product: triple helix repeat-containing collagen
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 219; Mature: 218
Protein sequence:
>219_residues MGARVADTEVTRRGRPGPQGPRGRPGEPGRPGPQGHPGRPGPEGPRGKPGPVGKPGPQGKAGPQGKPGADGKPGPQGPQG PQGKPGPDGKAGPVGPEGKPGPQGPRGDQGPRGEQGLRGDQGPRGEPGPAGALPSIEQVMPWLHLIFDAYEDYKAKREQE ARERAEREATELREAIEREAAEREVAAALDESEHADDDERDDDDERPDGKKKKKRKHKD
Sequences:
>Translated_219_residues MGARVADTEVTRRGRPGPQGPRGRPGEPGRPGPQGHPGRPGPEGPRGKPGPVGKPGPQGKAGPQGKPGADGKPGPQGPQG PQGKPGPDGKAGPVGPEGKPGPQGPRGDQGPRGEQGLRGDQGPRGEPGPAGALPSIEQVMPWLHLIFDAYEDYKAKREQE ARERAEREATELREAIEREAAEREVAAALDESEHADDDERDDDDERPDGKKKKKRKHKD >Mature_218_residues GARVADTEVTRRGRPGPQGPRGRPGEPGRPGPQGHPGRPGPEGPRGKPGPVGKPGPQGKAGPQGKPGADGKPGPQGPQGP QGKPGPDGKAGPVGPEGKPGPQGPRGDQGPRGEQGLRGDQGPRGEPGPAGALPSIEQVMPWLHLIFDAYEDYKAKREQEA RERAEREATELREAIEREAAEREVAAALDESEHADDDERDDDDERPDGKKKKKRKHKD
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI183583553, Length=133, Percent_Identity=42.8571428571429, Blast_Score=99, Evalue=3e-21, Organism=Homo sapiens, GI65301115, Length=249, Percent_Identity=29.718875502008, Blast_Score=84, Evalue=1e-16, Organism=Homo sapiens, GI56847616, Length=213, Percent_Identity=30.9859154929577, Blast_Score=82, Evalue=5e-16, Organism=Homo sapiens, GI115527062, Length=114, Percent_Identity=47.3684210526316, Blast_Score=75, Evalue=4e-14, Organism=Homo sapiens, GI115527066, Length=114, Percent_Identity=47.3684210526316, Blast_Score=74, Evalue=7e-14, Organism=Homo sapiens, GI115527070, Length=114, Percent_Identity=47.3684210526316, Blast_Score=74, Evalue=1e-13, Organism=Homo sapiens, GI5803080, Length=114, Percent_Identity=46.4912280701754, Blast_Score=72, Evalue=3e-13, Organism=Homo sapiens, GI115392133, Length=116, Percent_Identity=49.1379310344828, Blast_Score=72, Evalue=3e-13, Organism=Homo sapiens, GI61699226, Length=126, Percent_Identity=51.5873015873016, Blast_Score=72, Evalue=3e-13, Organism=Homo sapiens, GI48762934, Length=108, Percent_Identity=54.6296296296296, Blast_Score=71, Evalue=8e-13, Organism=Homo sapiens, GI32140760, Length=159, Percent_Identity=39.622641509434, Blast_Score=70, Evalue=2e-12, Organism=Homo sapiens, GI18780273, Length=126, Percent_Identity=41.2698412698413, Blast_Score=69, Evalue=4e-12, Organism=Homo sapiens, GI89363017, Length=121, Percent_Identity=49.5867768595041, Blast_Score=67, Evalue=2e-11, Organism=Homo sapiens, GI11386161, Length=119, Percent_Identity=51.2605042016807, Blast_Score=66, Evalue=2e-11, Organism=Homo sapiens, GI111118976, Length=112, Percent_Identity=51.7857142857143, Blast_Score=65, Evalue=5e-11, Organism=Homo sapiens, GI110735435, Length=147, Percent_Identity=44.2176870748299, Blast_Score=65, Evalue=6e-11, Organism=Homo sapiens, GI98985806, Length=100, Percent_Identity=49, Blast_Score=64, Evalue=8e-11, Organism=Homo sapiens, GI98985810, Length=100, Percent_Identity=49, Blast_Score=64, Evalue=9e-11, Organism=Homo sapiens, GI119829187, Length=93, Percent_Identity=49.4623655913978, Blast_Score=64, Evalue=1e-10, Organism=Homo sapiens, GI73486666, Length=139, Percent_Identity=46.0431654676259, Blast_Score=64, Evalue=1e-10, Organism=Caenorhabditis elegans, GI17569903, Length=121, Percent_Identity=46.2809917355372, Blast_Score=74, Evalue=4e-14, Organism=Caenorhabditis elegans, GI17535735, Length=156, Percent_Identity=42.3076923076923, Blast_Score=74, Evalue=8e-14, Organism=Caenorhabditis elegans, GI17507107, Length=134, Percent_Identity=46.2686567164179, Blast_Score=73, Evalue=9e-14, Organism=Caenorhabditis elegans, GI17506747, Length=95, Percent_Identity=52.6315789473684, Blast_Score=67, Evalue=5e-12, Organism=Caenorhabditis elegans, GI193203538, Length=150, Percent_Identity=44.6666666666667, Blast_Score=65, Evalue=3e-11, Organism=Caenorhabditis elegans, GI17551704, Length=110, Percent_Identity=49.0909090909091, Blast_Score=65, Evalue=4e-11, Organism=Caenorhabditis elegans, GI17566918, Length=122, Percent_Identity=47.5409836065574, Blast_Score=64, Evalue=4e-11, Organism=Drosophila melanogaster, GI45549584, Length=111, Percent_Identity=51.3513513513513, Blast_Score=80, Evalue=1e-15, Organism=Drosophila melanogaster, GI221379525, Length=116, Percent_Identity=47.4137931034483, Blast_Score=64, Evalue=5e-11, Organism=Drosophila melanogaster, GI221379533, Length=159, Percent_Identity=40.251572327044, Blast_Score=64, Evalue=6e-11,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 23089; Mature: 22957
Theoretical pI: Translated: 7.84; Mature: 7.84
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.0 %Cys (Translated Protein) 0.9 %Met (Translated Protein) 0.9 %Cys+Met (Translated Protein) 0.0 %Cys (Mature Protein) 0.5 %Met (Mature Protein) 0.5 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGARVADTEVTRRGRPGPQGPRGRPGEPGRPGPQGHPGRPGPEGPRGKPGPVGKPGPQGK CCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC AGPQGKPGADGKPGPQGPQGPQGKPGPDGKAGPVGPEGKPGPQGPRGDQGPRGEQGLRGD CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC QGPRGEPGPAGALPSIEQVMPWLHLIFDAYEDYKAKREQEARERAEREATELREAIEREA CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH AEREVAAALDESEHADDDERDDDDERPDGKKKKKRKHKD HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCC >Mature Secondary Structure GARVADTEVTRRGRPGPQGPRGRPGEPGRPGPQGHPGRPGPEGPRGKPGPVGKPGPQGK CCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC AGPQGKPGADGKPGPQGPQGPQGKPGPDGKAGPVGPEGKPGPQGPRGDQGPRGEQGLRGD CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC QGPRGEPGPAGALPSIEQVMPWLHLIFDAYEDYKAKREQEARERAEREATELREAIEREA CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH AEREVAAALDESEHADDDERDDDDERPDGKKKKKRKHKD HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: NA