Definition | Shewanella pealeana ATCC 700345 chromosome, complete genome. |
---|---|
Accession | NC_009901 |
Length | 5,174,581 |
Click here to switch to the map view.
The map label for this gene is 157962161
Identifier: 157962161
GI number: 157962161
Start: 2851374
End: 2853212
Strand: Direct
Name: 157962161
Synonym: Spea_2340
Alternate gene names: NA
Gene position: 2851374-2853212 (Clockwise)
Preceding gene: 157962158
Following gene: 157962163
Centisome position: 55.1
GC content: 45.62
Gene sequence:
>1839_bases ATGAGTAAAAGCCTAGTTAAGGTTTCTAAACTTTCGGCAGTGATTGCGCTAACACTTGGCGTTTCAGCTTGTAGTAAAAT AGAGGCTGAGAGTAAGCCTGTCACAGTTGATAAAACTATAGAAGCTAAACAGTTTATCAAGGATGCTGAATCTAAAATGG CCGATCTCTCTATTGAGATTAATCGCTCTGAGTGGATCTATAGCAATTTTATCACCCAAGATACTGCGGCACTGGCAGCC AGTGTTTCTGAAAAATATACCGCTGCATCGGTAGAGCTCGCGACTACATCGGCGCAATTTGCCAATCTTCCATTAAGTGA CGCTGAAAAACGTAAACTCAATATACTTCGTAGTTCTATCGTACTCCCAGCTCCACTCGATGTGGTAAAAAATGCCGAAC TGGCCAATATTAGTGCCGAGCTAAACGGTCTATACGGTAAAGGAAAATATTGTTTCGAAGATGGTCACTGCCTAACTCAG CCTGAACTGTCTGCCATTATGGGAACATCAACTAATCCAGATGAGCTTTTAGAGGTGTGGCAAGGCTGGCGTGAAATAGC CAAACCTATGCGTCCGCTTTTTGAAAGAGAAGTGGAGCTTGCAAATGAAGGGGCCAAAGATCTGGGTTACGCAGATCTCT CTGAGCTTTGGCGTAGTAACTACGATATGAAGCCTGACGAGTTTTCCACTGAACTTGACCGATTGTGGGGGCAAGTTAAA CCTTTATATGATTCGCTACACTGCTATGTTCGTGGTGAGCTAAATGAGCAGTATGGTGATGATGTGGCACCAGCATCTGG AGCTATACCAGCCCATCTACTCGGCAATATGTGGGCACAAAGCTGGGGCAATATTTATGAGCAAGTCGCGCCAGAAAATG CCGATCCTGGATATGATGTCACTGAGCTGTTAGCCAAGCATAACTTTGATGAGATAAAAATGGTTGAGCAGGCCGAGACC TTCTTTACTTCTCTTGGCTTCGAGCCACTACCTGACACATTCTGGGAACGTTCTTTATTCGTCCAGCCTGAGGATCGCGA TGTGGTTTGTCACGCATCGGCCTGGGATCTCGATAGTCGCGACGATATCCGCATAAAGATGTGTATTCAGAAGACCGCCG AAGACTTTACCGTAATTCATCATGAGCTTGGGCATAACTTCTATCAACGCGCTTATAAAGACCAACCGTTTATCTTTAAA AATAGCGCCAACGATGGTTTCCACGAGGCGATTGGCGATACTGTTGCCTTATCGATTACGCCAGATTACTTGAAGCAAAT TGGATTACTTGACGAAGTCCCTGATGCGTCCAAAGATATTGGGCTACTACTCAAGCAAGCATTAGATAAGGTGGCGTTTT TGCCATTTGGTTTGATGATTGACCAATGGCGCTGGAAGGTCTTTAGCGGTGAAATCACACCTGAACAATATAATAAAGCA TGGTGGGAACTTCGCGAAAAATACCAAGGTGTGAAAGCTCCTATCGCCCGCAGCGAAGCAGACTTCGACCCAGGTGCTAA GTATCACGTGCCCGGTAATGTGCCCTACACGCGCTATTTCTTAGCGCATATTCTGCAATTCCAATTCCATCAATCCCTAT GTGAGATTGCTGGTGATAACGGCCCAGTTCATAGATGCAGTATTTATGGCAACAAGGAAGCCGGAGCAAAACTCAACACC ATGCTAGAAATGGGGCAAAGCAAGCCTTGGCCTGAGGCGTTAGCGGCAGTAACTGGCACTAAAGAGATGGATGCCAAAGC CGTACTCGATTACTTTGCGCCACTGCAAACTTGGTTAGATGAGCAAAACTCCACTGCAAATCGTCAATGTGGCTGGTAA
Upstream 100 bases:
>100_bases AGCAAACAGGTCAATATTTCAGTAAGCAATCTTAATCCCCTATTAATCGGCAACAAAATAAGTAAAATATGCAACACAAT AATAACAGGGGATATTGAGG
Downstream 100 bases:
>100_bases ATGCTAAATAATGCAGTGGACTGCTATTTTGTGTGAGTAATGCAAAATCCAAAAGGCTATTTTACTTAAAGTAAAATGGC CTTTTTAAAACCTAAATAAA
Product: peptidyl-dipeptidase A
Products: NA
Alternate protein names: Zinc-Dependent Metallopeptidase; Angiotensin-Converting; Dipeptidyl Carboxypeptidase; Angiotensin-Converting Family Protein; Dipeptidyl Carboxydipeptidase Family; Zinc Metallopeptidase Family Protein; Angiotensin-Converting Peptidyl Dipeptidase Protein; Peptidyl-Dipeptidase Dcp; Dipeptidyl Carboxydipeptidase Family Protein
Number of amino acids: Translated: 612; Mature: 611
Protein sequence:
>612_residues MSKSLVKVSKLSAVIALTLGVSACSKIEAESKPVTVDKTIEAKQFIKDAESKMADLSIEINRSEWIYSNFITQDTAALAA SVSEKYTAASVELATTSAQFANLPLSDAEKRKLNILRSSIVLPAPLDVVKNAELANISAELNGLYGKGKYCFEDGHCLTQ PELSAIMGTSTNPDELLEVWQGWREIAKPMRPLFEREVELANEGAKDLGYADLSELWRSNYDMKPDEFSTELDRLWGQVK PLYDSLHCYVRGELNEQYGDDVAPASGAIPAHLLGNMWAQSWGNIYEQVAPENADPGYDVTELLAKHNFDEIKMVEQAET FFTSLGFEPLPDTFWERSLFVQPEDRDVVCHASAWDLDSRDDIRIKMCIQKTAEDFTVIHHELGHNFYQRAYKDQPFIFK NSANDGFHEAIGDTVALSITPDYLKQIGLLDEVPDASKDIGLLLKQALDKVAFLPFGLMIDQWRWKVFSGEITPEQYNKA WWELREKYQGVKAPIARSEADFDPGAKYHVPGNVPYTRYFLAHILQFQFHQSLCEIAGDNGPVHRCSIYGNKEAGAKLNT MLEMGQSKPWPEALAAVTGTKEMDAKAVLDYFAPLQTWLDEQNSTANRQCGW
Sequences:
>Translated_612_residues MSKSLVKVSKLSAVIALTLGVSACSKIEAESKPVTVDKTIEAKQFIKDAESKMADLSIEINRSEWIYSNFITQDTAALAA SVSEKYTAASVELATTSAQFANLPLSDAEKRKLNILRSSIVLPAPLDVVKNAELANISAELNGLYGKGKYCFEDGHCLTQ PELSAIMGTSTNPDELLEVWQGWREIAKPMRPLFEREVELANEGAKDLGYADLSELWRSNYDMKPDEFSTELDRLWGQVK PLYDSLHCYVRGELNEQYGDDVAPASGAIPAHLLGNMWAQSWGNIYEQVAPENADPGYDVTELLAKHNFDEIKMVEQAET FFTSLGFEPLPDTFWERSLFVQPEDRDVVCHASAWDLDSRDDIRIKMCIQKTAEDFTVIHHELGHNFYQRAYKDQPFIFK NSANDGFHEAIGDTVALSITPDYLKQIGLLDEVPDASKDIGLLLKQALDKVAFLPFGLMIDQWRWKVFSGEITPEQYNKA WWELREKYQGVKAPIARSEADFDPGAKYHVPGNVPYTRYFLAHILQFQFHQSLCEIAGDNGPVHRCSIYGNKEAGAKLNT MLEMGQSKPWPEALAAVTGTKEMDAKAVLDYFAPLQTWLDEQNSTANRQCGW >Mature_611_residues SKSLVKVSKLSAVIALTLGVSACSKIEAESKPVTVDKTIEAKQFIKDAESKMADLSIEINRSEWIYSNFITQDTAALAAS VSEKYTAASVELATTSAQFANLPLSDAEKRKLNILRSSIVLPAPLDVVKNAELANISAELNGLYGKGKYCFEDGHCLTQP ELSAIMGTSTNPDELLEVWQGWREIAKPMRPLFEREVELANEGAKDLGYADLSELWRSNYDMKPDEFSTELDRLWGQVKP LYDSLHCYVRGELNEQYGDDVAPASGAIPAHLLGNMWAQSWGNIYEQVAPENADPGYDVTELLAKHNFDEIKMVEQAETF FTSLGFEPLPDTFWERSLFVQPEDRDVVCHASAWDLDSRDDIRIKMCIQKTAEDFTVIHHELGHNFYQRAYKDQPFIFKN SANDGFHEAIGDTVALSITPDYLKQIGLLDEVPDASKDIGLLLKQALDKVAFLPFGLMIDQWRWKVFSGEITPEQYNKAW WELREKYQGVKAPIARSEADFDPGAKYHVPGNVPYTRYFLAHILQFQFHQSLCEIAGDNGPVHRCSIYGNKEAGAKLNTM LEMGQSKPWPEALAAVTGTKEMDAKAVLDYFAPLQTWLDEQNSTANRQCGW
Specific function: Unknown
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
Organism=Homo sapiens, GI23238214, Length=595, Percent_Identity=41.0084033613445, Blast_Score=484, Evalue=1e-137, Organism=Homo sapiens, GI4503273, Length=582, Percent_Identity=41.5807560137457, Blast_Score=481, Evalue=1e-136, Organism=Homo sapiens, GI295844837, Length=595, Percent_Identity=37.1428571428571, Blast_Score=407, Evalue=1e-113, Organism=Homo sapiens, GI11225609, Length=590, Percent_Identity=36.9491525423729, Blast_Score=403, Evalue=1e-112, Organism=Caenorhabditis elegans, GI71985287, Length=470, Percent_Identity=29.7872340425532, Blast_Score=241, Evalue=9e-64, Organism=Drosophila melanogaster, GI24584232, Length=592, Percent_Identity=38.1756756756757, Blast_Score=417, Evalue=1e-116, Organism=Drosophila melanogaster, GI17137008, Length=592, Percent_Identity=38.1756756756757, Blast_Score=417, Evalue=1e-116, Organism=Drosophila melanogaster, GI17137262, Length=462, Percent_Identity=41.991341991342, Blast_Score=387, Evalue=1e-107, Organism=Drosophila melanogaster, GI85724942, Length=566, Percent_Identity=36.2190812720848, Blast_Score=373, Evalue=1e-103, Organism=Drosophila melanogaster, GI22026846, Length=461, Percent_Identity=28.1995661605206, Blast_Score=199, Evalue=5e-51, Organism=Drosophila melanogaster, GI24762773, Length=540, Percent_Identity=26.8518518518519, Blast_Score=187, Evalue=2e-47, Organism=Drosophila melanogaster, GI28574153, Length=543, Percent_Identity=26.3351749539595, Blast_Score=181, Evalue=2e-45,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 68714; Mature: 68583
Theoretical pI: Translated: 4.59; Mature: 4.59
Prosite motif: PS00013 PROKAR_LIPOPROTEIN ; PS00142 ZINC_PROTEASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 2.0 %Met (Translated Protein) 3.4 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 3.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSKSLVKVSKLSAVIALTLGVSACSKIEAESKPVTVDKTIEAKQFIKDAESKMADLSIEI CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHHHHHHHEEEEEE NRSEWIYSNFITQDTAALAASVSEKYTAASVELATTSAQFANLPLSDAEKRKLNILRSSI CCCCCHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCHHHHCCCCCCHHHHHHHHHHHHC VLPAPLDVVKNAELANISAELNGLYGKGKYCFEDGHCLTQPELSAIMGTSTNPDELLEVW CCCCCHHHHCCCHHHCCHHHHCCCCCCCCEEEECCCCCCCCHHHHHHCCCCCHHHHHHHH QGWREIAKPMRPLFEREVELANEGAKDLGYADLSELWRSNYDMKPDEFSTELDRLWGQVK HHHHHHHHHHHHHHHHHHHHHCCCCHHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH PLYDSLHCYVRGELNEQYGDDVAPASGAIPAHLLGNMWAQSWGNIYEQVAPENADPGYDV HHHHCEEEEEEECCCHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCH TELLAKHNFDEIKMVEQAETFFTSLGFEPLPDTFWERSLFVQPEDRDVVCHASAWDLDSR HHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCEEECCCCCCEEEEECCCCCCCC DDIRIKMCIQKTAEDFTVIHHELGHNFYQRAYKDQPFIFKNSANDGFHEAIGDTVALSIT CCCEEEHHHHHHHHHHHHHHHHHCHHHHHHHHCCCCEEEECCCCCCHHHHCCCEEEEEEC PDYLKQIGLLDEVPDASKDIGLLLKQALDKVAFLPFGLMIDQWRWKVFSGEITPEQYNKA HHHHHHCCCHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCHHHHHHH WWELREKYQGVKAPIARSEADFDPGAKYHVPGNVPYTRYFLAHILQFQFHQSLCEIAGDN HHHHHHHHCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC GPVHRCSIYGNKEAGAKLNTMLEMGQSKPWPEALAAVTGTKEMDAKAVLDYFAPLQTWLD CCEEEEEEECCCCCCHHHHHHHHHCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHC EQNSTANRQCGW CCCCCCCCCCCC >Mature Secondary Structure SKSLVKVSKLSAVIALTLGVSACSKIEAESKPVTVDKTIEAKQFIKDAESKMADLSIEI CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHHHHHHHEEEEEE NRSEWIYSNFITQDTAALAASVSEKYTAASVELATTSAQFANLPLSDAEKRKLNILRSSI CCCCCHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCHHHHCCCCCCHHHHHHHHHHHHC VLPAPLDVVKNAELANISAELNGLYGKGKYCFEDGHCLTQPELSAIMGTSTNPDELLEVW CCCCCHHHHCCCHHHCCHHHHCCCCCCCCEEEECCCCCCCCHHHHHHCCCCCHHHHHHHH QGWREIAKPMRPLFEREVELANEGAKDLGYADLSELWRSNYDMKPDEFSTELDRLWGQVK HHHHHHHHHHHHHHHHHHHHHCCCCHHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH PLYDSLHCYVRGELNEQYGDDVAPASGAIPAHLLGNMWAQSWGNIYEQVAPENADPGYDV HHHHCEEEEEEECCCHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCH TELLAKHNFDEIKMVEQAETFFTSLGFEPLPDTFWERSLFVQPEDRDVVCHASAWDLDSR HHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCEEECCCCCCEEEEECCCCCCCC DDIRIKMCIQKTAEDFTVIHHELGHNFYQRAYKDQPFIFKNSANDGFHEAIGDTVALSIT CCCEEEHHHHHHHHHHHHHHHHHCHHHHHHHHCCCCEEEECCCCCCHHHHCCCEEEEEEC PDYLKQIGLLDEVPDASKDIGLLLKQALDKVAFLPFGLMIDQWRWKVFSGEITPEQYNKA HHHHHHCCCHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCHHHHHHH WWELREKYQGVKAPIARSEADFDPGAKYHVPGNVPYTRYFLAHILQFQFHQSLCEIAGDN HHHHHHHHCCCCCCCCCCCCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC GPVHRCSIYGNKEAGAKLNTMLEMGQSKPWPEALAAVTGTKEMDAKAVLDYFAPLQTWLD CCEEEEEEECCCCCCHHHHHHHHHCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHC EQNSTANRQCGW CCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA