Definition | Sphingopyxis alaskensis RB2256, complete genome. |
---|---|
Accession | NC_008048 |
Length | 3,345,170 |
Click here to switch to the map view.
The map label for this gene is 103485949
Identifier: 103485949
GI number: 103485949
Start: 467931
End: 469769
Strand: Direct
Name: 103485949
Synonym: Sala_0456
Alternate gene names: NA
Gene position: 467931-469769 (Clockwise)
Preceding gene: 103485948
Following gene: 103485951
Centisome position: 13.99
GC content: 63.13
Gene sequence:
>1839_bases ATGAAAGCCATGATCTCGACGCTGTCGCTTGCCCTGTCGCTCGCGGTGGCGAGCCCCGCCTTTGCGCAGGCGGCGCCCGC CGCGACTCCCGCCGCCGCCCCCACCGCCGCCGAGGCCGAAGCCTTTATCGCCGCGGCGGAGAAAGACCTGTTCGACTATA CCGTCGAGGCCGCCCAAGTGAACTGGGTCAATGCCACCTATATCACCGAGGACACCGACGCGATGGCGGCGCGGATCAAC GCGGTCGGCACCGAAAAGGCGGTGAAATATGCACTGGAAGCCGCGAAATATAGCGATGTTCCGGGGCTGAGCGCCGAAAC GAAGCGCAAGCTCAACATCCTGCGCACGGGCCTCGTGCTGCCCGCGCCGACGACGCCGGGCGCCGCGACCGAACTCAACC GGATCGCGACCGACCTGCAGTCGCAATATGGCAAGGGCCGCGGCACGCTGAACGGCAAGGAAATCGCCGGTTCGGATATC GAGGCGGAGATGGGCAATCTGGAACGCACCCCTGTCGAACTCGCCGAAATGTGGACGAGCTGGCACGACAATGTCGGCGC GCCGATGAAGCAGGATTATGCCCGCATGGTCGCCATCGCCAACGCGGGCGCGAAGGAACTGGGCTTTGCCGACACCGGTG CGATGTGGCGGTCGGGCTATGACATGCCGCCCGAAGAGTTTGCCAGGCTGACCGAAAAAATCTGGCAGGACATGAAGCCG CTCTATGTCGCGCTCCACACCTATGTCCGCTGGAAGCTCAACGAGAAATATGGCGACGCGGTGCAGCCCAAGACGGGACC GATCCGCGCCGACCTGCTCGGCAATATGTGGGCGCAGGAATGGGGCAATATCTATCCGCTCGTCGCGCCGCCGGGAACGG GCGATCTGGGCTATGATATCGGCGAGCTGCTCGCGGCGCAGGGCAAGACGCCGCTCGACATGGTCAAGATCGGCGAGAAT TTCTATTCGTCGCTGGGCATGGCGCCGCTGCCCGATACATTCTGGAAGCGGAGCATGTTCACCAAGCCCGCCGACCGCGA AGTCGTCTGCCACGCCTCGGCGTGGAACATCGACAACAAGGACGATATTCGCATCAAGATGTGCATCAAGGTGAATGCCG ACGATTTCGTCACCATCCACCACGAGCTGGGCCACAATTATTACCAGCGCGCCTATAACCAGCAGCCGTTCCTGTATCTG AACGGCGCCAACGATGGCTTTCACGAAGCGATCGGCGATTTTGTCGCGCTGTCGATCACGCCGCAATATCTGGTCGACAT CGGCCTGCTCGACAAGGCGAAGGTGCCGAGCGCCGACAAGGACATCGGCCTCCTGCTGCGGCAGGCGATGGACAAGGTCG CCTTCCTGCCGTTCGGCCTGCTCATCGACCGCTGGCGCTGGGGCGTGTTCGACGGGACGATCCAGCCCGCCGATTACAAC AAGGCGTGGACCGAGATGCGCACCCGATATCAGGGCATCGTTCCCCCGGCGGCCCGCCCCGCCGATGCATTCGATGCGGG GGCGAAATATCACATTCCTGGCAACACCCCCTATACGCGCTATTTCCTCGCGCGCATCCTGCAGTTCCAGTTTTACGAGG CGGCGTGCAGGCAGGCGGGGTGGAAGGGGCCGCTTCACCGCTGTTCCTTCTATGGCAACAGGGAGGTCGGCGCGAAGCTC AACGCGATGCTGGAGATGGGGGCGTCGAAGCCGTGGCCCGATGCGCTCGAAGCCTTCACCGGCAAGCGCGAGATGTCGGG CAAGGCGATGGCCGATTATTTCGCACCGCTGAAAAAATGGCTCGACGAGCAGAACAAGGGCAAGCCGCAGGGGTGGTGA
Upstream 100 bases:
>100_bases TGCGCCGACCGGTGATGAGTTCCCGCCTTCGCGGGCGCGCATGGCTTTTTCGTTCGGTGAAGCTCGTGGCGGGAACATCC ATAACAGTCAGAGGAACCCC
Downstream 100 bases:
>100_bases GGCGCCGCGGATAAGGAATGAAGGGGCCGGGGAGGGGACGTCCCGGCCCTTTTTTCGTTGGTGGGCGGGACAGGGGATAC AGTTTTCGAGGGTGGTGAGC
Product: peptidyl-dipeptidase A
Products: NA
Alternate protein names: Zinc-Dependent Metallopeptidase; Angiotensin-Converting; Dipeptidyl Carboxypeptidase; Angiotensin-Converting Family Protein; Dipeptidyl Carboxydipeptidase Family; Zinc Metallopeptidase Family Protein; Angiotensin-Converting Peptidyl Dipeptidase Protein; Peptidyl-Dipeptidase Dcp; Dipeptidyl Carboxydipeptidase Family Protein
Number of amino acids: Translated: 612; Mature: 612
Protein sequence:
>612_residues MKAMISTLSLALSLAVASPAFAQAAPAATPAAAPTAAEAEAFIAAAEKDLFDYTVEAAQVNWVNATYITEDTDAMAARIN AVGTEKAVKYALEAAKYSDVPGLSAETKRKLNILRTGLVLPAPTTPGAATELNRIATDLQSQYGKGRGTLNGKEIAGSDI EAEMGNLERTPVELAEMWTSWHDNVGAPMKQDYARMVAIANAGAKELGFADTGAMWRSGYDMPPEEFARLTEKIWQDMKP LYVALHTYVRWKLNEKYGDAVQPKTGPIRADLLGNMWAQEWGNIYPLVAPPGTGDLGYDIGELLAAQGKTPLDMVKIGEN FYSSLGMAPLPDTFWKRSMFTKPADREVVCHASAWNIDNKDDIRIKMCIKVNADDFVTIHHELGHNYYQRAYNQQPFLYL NGANDGFHEAIGDFVALSITPQYLVDIGLLDKAKVPSADKDIGLLLRQAMDKVAFLPFGLLIDRWRWGVFDGTIQPADYN KAWTEMRTRYQGIVPPAARPADAFDAGAKYHIPGNTPYTRYFLARILQFQFYEAACRQAGWKGPLHRCSFYGNREVGAKL NAMLEMGASKPWPDALEAFTGKREMSGKAMADYFAPLKKWLDEQNKGKPQGW
Sequences:
>Translated_612_residues MKAMISTLSLALSLAVASPAFAQAAPAATPAAAPTAAEAEAFIAAAEKDLFDYTVEAAQVNWVNATYITEDTDAMAARIN AVGTEKAVKYALEAAKYSDVPGLSAETKRKLNILRTGLVLPAPTTPGAATELNRIATDLQSQYGKGRGTLNGKEIAGSDI EAEMGNLERTPVELAEMWTSWHDNVGAPMKQDYARMVAIANAGAKELGFADTGAMWRSGYDMPPEEFARLTEKIWQDMKP LYVALHTYVRWKLNEKYGDAVQPKTGPIRADLLGNMWAQEWGNIYPLVAPPGTGDLGYDIGELLAAQGKTPLDMVKIGEN FYSSLGMAPLPDTFWKRSMFTKPADREVVCHASAWNIDNKDDIRIKMCIKVNADDFVTIHHELGHNYYQRAYNQQPFLYL NGANDGFHEAIGDFVALSITPQYLVDIGLLDKAKVPSADKDIGLLLRQAMDKVAFLPFGLLIDRWRWGVFDGTIQPADYN KAWTEMRTRYQGIVPPAARPADAFDAGAKYHIPGNTPYTRYFLARILQFQFYEAACRQAGWKGPLHRCSFYGNREVGAKL NAMLEMGASKPWPDALEAFTGKREMSGKAMADYFAPLKKWLDEQNKGKPQGW >Mature_612_residues MKAMISTLSLALSLAVASPAFAQAAPAATPAAAPTAAEAEAFIAAAEKDLFDYTVEAAQVNWVNATYITEDTDAMAARIN AVGTEKAVKYALEAAKYSDVPGLSAETKRKLNILRTGLVLPAPTTPGAATELNRIATDLQSQYGKGRGTLNGKEIAGSDI EAEMGNLERTPVELAEMWTSWHDNVGAPMKQDYARMVAIANAGAKELGFADTGAMWRSGYDMPPEEFARLTEKIWQDMKP LYVALHTYVRWKLNEKYGDAVQPKTGPIRADLLGNMWAQEWGNIYPLVAPPGTGDLGYDIGELLAAQGKTPLDMVKIGEN FYSSLGMAPLPDTFWKRSMFTKPADREVVCHASAWNIDNKDDIRIKMCIKVNADDFVTIHHELGHNYYQRAYNQQPFLYL NGANDGFHEAIGDFVALSITPQYLVDIGLLDKAKVPSADKDIGLLLRQAMDKVAFLPFGLLIDRWRWGVFDGTIQPADYN KAWTEMRTRYQGIVPPAARPADAFDAGAKYHIPGNTPYTRYFLARILQFQFYEAACRQAGWKGPLHRCSFYGNREVGAKL NAMLEMGASKPWPDALEAFTGKREMSGKAMADYFAPLKKWLDEQNKGKPQGW
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI23238214, Length=601, Percent_Identity=39.9334442595674, Blast_Score=444, Evalue=1e-125, Organism=Homo sapiens, GI4503273, Length=584, Percent_Identity=40.4109589041096, Blast_Score=439, Evalue=1e-123, Organism=Homo sapiens, GI295844837, Length=601, Percent_Identity=36.4392678868552, Blast_Score=374, Evalue=1e-103, Organism=Homo sapiens, GI11225609, Length=606, Percent_Identity=32.6732673267327, Blast_Score=329, Evalue=4e-90, Organism=Caenorhabditis elegans, GI71985287, Length=451, Percent_Identity=28.8248337028825, Blast_Score=207, Evalue=1e-53, Organism=Drosophila melanogaster, GI24584232, Length=591, Percent_Identity=38.2402707275804, Blast_Score=409, Evalue=1e-114, Organism=Drosophila melanogaster, GI17137008, Length=591, Percent_Identity=38.2402707275804, Blast_Score=409, Evalue=1e-114, Organism=Drosophila melanogaster, GI17137262, Length=453, Percent_Identity=39.514348785872, Blast_Score=363, Evalue=1e-100, Organism=Drosophila melanogaster, GI85724942, Length=441, Percent_Identity=36.5079365079365, Blast_Score=335, Evalue=4e-92, Organism=Drosophila melanogaster, GI24762773, Length=430, Percent_Identity=29.3023255813954, Blast_Score=175, Evalue=9e-44, Organism=Drosophila melanogaster, GI22026846, Length=597, Percent_Identity=25.7956448911223, Blast_Score=163, Evalue=2e-40, Organism=Drosophila melanogaster, GI28574153, Length=447, Percent_Identity=23.9373601789709, Blast_Score=138, Evalue=9e-33,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 67584; Mature: 67584
Theoretical pI: Translated: 6.10; Mature: 6.10
Prosite motif: PS00142 ZINC_PROTEASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 3.4 %Met (Translated Protein) 4.1 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 3.4 %Met (Mature Protein) 4.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKAMISTLSLALSLAVASPAFAQAAPAATPAAAPTAAEAEAFIAAAEKDLFDYTVEAAQV CCHHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHEEE NWVNATYITEDTDAMAARINAVGTEKAVKYALEAAKYSDVPGLSAETKRKLNILRTGLVL EEEEEEEEECCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCEE PAPTTPGAATELNRIATDLQSQYGKGRGTLNGKEIAGSDIEAEMGNLERTPVELAEMWTS CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCHHHHCCCCCCHHHHHHHHHH WHDNVGAPMKQDYARMVAIANAGAKELGFADTGAMWRSGYDMPPEEFARLTEKIWQDMKP HHCCCCCCHHHHHHHHHHHHCCCHHHCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHH LYVALHTYVRWKLNEKYGDAVQPKTGPIRADLLGNMWAQEWGNIYPLVAPPGTGDLGYDI HHHHHHHHHHEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCH GELLAAQGKTPLDMVKIGENFYSSLGMAPLPDTFWKRSMFTKPADREVVCHASAWNIDNK HHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCEEEEEEECCCCCC DDIRIKMCIKVNADDFVTIHHELGHNYYQRAYNQQPFLYLNGANDGFHEAIGDFVALSIT CCEEEEEEEEECCCCEEEEHHHHCCHHHHHHCCCCCEEEEECCCCCHHHHHHHHEEEEEC PQYLVDIGLLDKAKVPSADKDIGLLLRQAMDKVAFLPFGLLIDRWRWGVFDGTIQPADYN HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC KAWTEMRTRYQGIVPPAARPADAFDAGAKYHIPGNTPYTRYFLARILQFQFYEAACRQAG HHHHHHHHHHCCCCCCCCCCCCHHCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCC WKGPLHRCSFYGNREVGAKLNAMLEMGASKPWPDALEAFTGKREMSGKAMADYFAPLKKW CCCCHHHHHCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHCCCHHCCCHHHHHHHHHHHHH LDEQNKGKPQGW HHHCCCCCCCCC >Mature Secondary Structure MKAMISTLSLALSLAVASPAFAQAAPAATPAAAPTAAEAEAFIAAAEKDLFDYTVEAAQV CCHHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHEEE NWVNATYITEDTDAMAARINAVGTEKAVKYALEAAKYSDVPGLSAETKRKLNILRTGLVL EEEEEEEEECCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCEE PAPTTPGAATELNRIATDLQSQYGKGRGTLNGKEIAGSDIEAEMGNLERTPVELAEMWTS CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCHHHHCCCCCCHHHHHHHHHH WHDNVGAPMKQDYARMVAIANAGAKELGFADTGAMWRSGYDMPPEEFARLTEKIWQDMKP HHCCCCCCHHHHHHHHHHHHCCCHHHCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHH LYVALHTYVRWKLNEKYGDAVQPKTGPIRADLLGNMWAQEWGNIYPLVAPPGTGDLGYDI HHHHHHHHHHEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCH GELLAAQGKTPLDMVKIGENFYSSLGMAPLPDTFWKRSMFTKPADREVVCHASAWNIDNK HHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCEEEEEEECCCCCC DDIRIKMCIKVNADDFVTIHHELGHNYYQRAYNQQPFLYLNGANDGFHEAIGDFVALSIT CCEEEEEEEEECCCCEEEEHHHHCCHHHHHHCCCCCEEEEECCCCCHHHHHHHHEEEEEC PQYLVDIGLLDKAKVPSADKDIGLLLRQAMDKVAFLPFGLLIDRWRWGVFDGTIQPADYN HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC KAWTEMRTRYQGIVPPAARPADAFDAGAKYHIPGNTPYTRYFLARILQFQFYEAACRQAG HHHHHHHHHHCCCCCCCCCCCCHHCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCC WKGPLHRCSFYGNREVGAKLNAMLEMGASKPWPDALEAFTGKREMSGKAMADYFAPLKKW CCCCHHHHHCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHCCCHHCCCHHHHHHHHHHHHH LDEQNKGKPQGW HHHCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA