Definition Shewanella halifaxensis HAW-EB4 chromosome, complete genome.
Accession NC_010334
Length 5,226,917

Click here to switch to the map view.

The map label for this gene is 167623667

Identifier: 167623667

GI number: 167623667

Start: 2081112

End: 2083664

Strand: Direct

Name: 167623667

Synonym: Shal_1736

Alternate gene names: NA

Gene position: 2081112-2083664 (Clockwise)

Preceding gene: 167623666

Following gene: 167623668

Centisome position: 39.82

GC content: 44.97

Gene sequence:

>2553_bases
ATGGCTGCGTATTTATTAGAGCGTGGTTTAGCGAATGGTCTAGAGCGTATCGCTAACCTTAGTCATCGCCAGGGAGGCTC
ATATTTTGGCGCTCTTATGGCAAAGAAAGTACCTATTGTGACAGTGATGCTATCGCTGCTGAATTTATCCGTCACTCGTT
TATCAGTGCTTAGTTTATCAGTGCTTAGTTTATCACTACTTAGCTTATTAGTGCCGCAAAGTGCGATGGCGTCATCGTCA
ACAGTGCCCATTTCCATGGCTGAAATGAAGCAAGCAGGCTTGATATTTGAAAGTGAACAGGGAGAGCTGACCATAGCGCT
GCCAATGAAAACCGATGTCAGTATGCATGTCTCTGGTTGGGTTAACCGAGTATCTGTCCGCCACGAATTCAAAAACATGT
CGAGTGAGTGGGTAAATGGTCAGTATCTATTTCCTTTACCGAATGAAGCGGCGGTTGACCAACTTAAGCTACATATTGGC
GCTAGAGTGATTGAAGGGCAGATCCAGCCTAAAGCAAAGGCAAAAGCGATATATGAGCAAGCTAAGGTAGAAGGTAAAAA
GGCTAGTTTACTGGAGCAAAAGCGGGCCAATATTTTTAGTGCTCAGGTGGCAAATTTGGCACCAAATGAAATGTTGATTG
TCGAGCTTACCTATCAAGAAACGCTAGACTATAAAGATGGTGCATTTAGCCTACGCTTCCCTATGGTGATCGCGCCAAGA
TATGCCCCTAGGCAAGAAGCCGACAGCTATAACAAACTTAATAAACCTCAGGCCCTTAGCCAGCAGATTATTAACGGTAC
TAAGCTAAATTATAAGCAAAGTAATGAGCTAATTGATATTAACAAAAGTGTCTATGCGCATAGCGCTGTCGTTAAGGCCG
AAGATGAAGCGTTAGAATCTGAAGCATTAGAATCTAGAGAACGGCAAAATCGAGTCTCGATGACAGTTACTTTTGATGCG
GCAATGCCAATAGAGAATATTGTCAGCCCTTATCATGGTATTAGCATTAATATGGTCGAAAATGCTGCGGCTCAAGTGTC
GTTAGATAACTATGCCGTTGCGAATCGTGATTTTGTGCTGACTTGGCAACCCGTGCAAGGTAGTGAGCCTACGGCAGCCG
TATTTTCTCAACAAGGCAAAACTCATGCTGAGTTAGCCTCACAAGTTACCGCAGGCGATACCTCGTTCAATCAGGGGGGC
GCTAAAAGTAAGCTAAGCCCCCAATCTCAACCAGAACCACAGTCGCAATTACAAGTACAAGATAGCAAGCAACAGACACT
GTCGAAAAAAGCATTAGAAAAATATGCCTTGGTGATGTTAATGCCACCTCAAGGGAGTGACGATGAGTCATCATCGATTG
CACGAGAGTTGGTCTTGGTCATCGATACATCGGGTTCGATGTCGGGGGATGCCATCATTCAGGCAAAATCAGCGCTGAAA
TATGCATTAGCAGGGCTGCGCCCCCAAGATAGTTTCAATGTATTGCAGTTCAACTCCACAGTTGAGCGGTGGTCTAGGCA
TGTAATGCCTGCAACGGCAATTAATCTTGGCCGAGCACAAAATTATATCAATGGTTTACAAGCTGATGGTGGCACTGAGA
TGTCTTTAGCGCTCGATGCCGCACTAACTAAGCTTGACAATGATCGCGGCCATAATAGTAAGCCTGTTCATGACGATGAC
AGATATCAGAGCAGCAATGAGACCCTTGAACAAAGCGCTGCGACACCATTACGGCAAGTGTTATTTATTACCGACGGAGC
CGTGGCTAATGAGTCTAGGTTATTTGAGCAGATAAAAAATCAATTAGGTGAAAGCCGCTTATTTACTATCGGAATAGGCT
CGGCACCCAATGCGCATTTTATGCAAAGAGCGGCAGAGGTTGGCAGGGGAACTTATACCTATATTGGTAAACTTGATGAG
GTAAACCAAAAAGTGGTGTCGCTATTGGAGAAAATAGAGAAGCCTCAAGTTACCGATGTCGAACTTCATTTTAGTGATGG
CAGTGTACCGGACTATTGGCCAGTCCGTATTCCAGATCTTTATGCTCACGAGCCAGTACTGGTCGCCCTGCGTATTCCAA
GCTATGTCAGTGATGACTTATTGATCCAGGGGCAATTAGCGGGGCAATTTTGGCAGCGACGCTTACCACTTAATAGCGCA
GCTCAAGTTAACGACTTAGAGCAAGCTAAAGGCTTAGACTTAATATGGGCAAGAAAGCAGATCGCCGCCCTAGAGCTTAG
TAAGCAAACAGCGAATAAAGAGAGGATTGAAAAACAGATCACGGCGATAGCGATGAAGTTTCATATCATGAGTGCTTATA
CAAGCCTAGTTGCGGTTGATGTCACGCCGGTAAAACCGACAAGTATTGCAGCAAAAGAGGCTAGGGTGATCCAGCATCTG
CCAAGCGGATGGCAAAGGCTATCACAAACACTTCCACAAACAGGAACGAATAGTTTTGTGTTCATGATTTTTGGTACGAC
TCTGTTGCTATTAGCGGCGTTATATCGGCTGTCTTTTAGGCGTACTGACATAAGTGAACCTAGCTGTGGGTAG

Upstream 100 bases:

>100_bases
ACCGTAAGCACCTGTTTGGCCATGTGAGCTAGGCAGGTTAGCACTGCCACAGAGTCTATCTAATCGAGCCTTTGTGGTAG
ATAAGTAAAAGGAGTAACAG

Downstream 100 bases:

>100_bases
CATTTTAAAGCAAGGCGATAATTTTTATGATGATGCAAGGCGTGATAAACGCCTTGGCAGACGGTTTCATCAGTATATTC
CGTGGTTGTTATTGGCTGCA

Product: cell wall anchor domain-containing protein

Products: NA

Alternate protein names: Vault Protein Inter-Alpha-Trypsin; Von Willebrand Factor Type A Domain-Containing Protein; Inter-Alpha-Trypsin Inhibitor Domain-Containing Protein; Vault Protein Inter-Alpha-Trypsin Domain Protein; Von Willebrand Factor Type A; Cell Wall Anchor Domain-Containing Protein; Vault Protein Inter-Alpha-Trypsin Domain-Containing Protein; Von Willebrand Factor Type A Domain Protein; LPXTG-Motif Cell Wall Anchor Domain-Containing Protein; Vault Protein Inter-Alpha-Trypsin Domain; Protein ContAining A Von Willebrand Factor Type A Domain; Transmembrane Protein; LPXTG-Motif Cell Wall Anchor Domain Protein; Inter-Alpha-Trypsin Inhibitor Domain Protein; Von Willebrand Factor

Number of amino acids: Translated: 850; Mature: 849

Protein sequence:

>850_residues
MAAYLLERGLANGLERIANLSHRQGGSYFGALMAKKVPIVTVMLSLLNLSVTRLSVLSLSVLSLSLLSLLVPQSAMASSS
TVPISMAEMKQAGLIFESEQGELTIALPMKTDVSMHVSGWVNRVSVRHEFKNMSSEWVNGQYLFPLPNEAAVDQLKLHIG
ARVIEGQIQPKAKAKAIYEQAKVEGKKASLLEQKRANIFSAQVANLAPNEMLIVELTYQETLDYKDGAFSLRFPMVIAPR
YAPRQEADSYNKLNKPQALSQQIINGTKLNYKQSNELIDINKSVYAHSAVVKAEDEALESEALESRERQNRVSMTVTFDA
AMPIENIVSPYHGISINMVENAAAQVSLDNYAVANRDFVLTWQPVQGSEPTAAVFSQQGKTHAELASQVTAGDTSFNQGG
AKSKLSPQSQPEPQSQLQVQDSKQQTLSKKALEKYALVMLMPPQGSDDESSSIARELVLVIDTSGSMSGDAIIQAKSALK
YALAGLRPQDSFNVLQFNSTVERWSRHVMPATAINLGRAQNYINGLQADGGTEMSLALDAALTKLDNDRGHNSKPVHDDD
RYQSSNETLEQSAATPLRQVLFITDGAVANESRLFEQIKNQLGESRLFTIGIGSAPNAHFMQRAAEVGRGTYTYIGKLDE
VNQKVVSLLEKIEKPQVTDVELHFSDGSVPDYWPVRIPDLYAHEPVLVALRIPSYVSDDLLIQGQLAGQFWQRRLPLNSA
AQVNDLEQAKGLDLIWARKQIAALELSKQTANKERIEKQITAIAMKFHIMSAYTSLVAVDVTPVKPTSIAAKEARVIQHL
PSGWQRLSQTLPQTGTNSFVFMIFGTTLLLLAALYRLSFRRTDISEPSCG

Sequences:

>Translated_850_residues
MAAYLLERGLANGLERIANLSHRQGGSYFGALMAKKVPIVTVMLSLLNLSVTRLSVLSLSVLSLSLLSLLVPQSAMASSS
TVPISMAEMKQAGLIFESEQGELTIALPMKTDVSMHVSGWVNRVSVRHEFKNMSSEWVNGQYLFPLPNEAAVDQLKLHIG
ARVIEGQIQPKAKAKAIYEQAKVEGKKASLLEQKRANIFSAQVANLAPNEMLIVELTYQETLDYKDGAFSLRFPMVIAPR
YAPRQEADSYNKLNKPQALSQQIINGTKLNYKQSNELIDINKSVYAHSAVVKAEDEALESEALESRERQNRVSMTVTFDA
AMPIENIVSPYHGISINMVENAAAQVSLDNYAVANRDFVLTWQPVQGSEPTAAVFSQQGKTHAELASQVTAGDTSFNQGG
AKSKLSPQSQPEPQSQLQVQDSKQQTLSKKALEKYALVMLMPPQGSDDESSSIARELVLVIDTSGSMSGDAIIQAKSALK
YALAGLRPQDSFNVLQFNSTVERWSRHVMPATAINLGRAQNYINGLQADGGTEMSLALDAALTKLDNDRGHNSKPVHDDD
RYQSSNETLEQSAATPLRQVLFITDGAVANESRLFEQIKNQLGESRLFTIGIGSAPNAHFMQRAAEVGRGTYTYIGKLDE
VNQKVVSLLEKIEKPQVTDVELHFSDGSVPDYWPVRIPDLYAHEPVLVALRIPSYVSDDLLIQGQLAGQFWQRRLPLNSA
AQVNDLEQAKGLDLIWARKQIAALELSKQTANKERIEKQITAIAMKFHIMSAYTSLVAVDVTPVKPTSIAAKEARVIQHL
PSGWQRLSQTLPQTGTNSFVFMIFGTTLLLLAALYRLSFRRTDISEPSCG
>Mature_849_residues
AAYLLERGLANGLERIANLSHRQGGSYFGALMAKKVPIVTVMLSLLNLSVTRLSVLSLSVLSLSLLSLLVPQSAMASSST
VPISMAEMKQAGLIFESEQGELTIALPMKTDVSMHVSGWVNRVSVRHEFKNMSSEWVNGQYLFPLPNEAAVDQLKLHIGA
RVIEGQIQPKAKAKAIYEQAKVEGKKASLLEQKRANIFSAQVANLAPNEMLIVELTYQETLDYKDGAFSLRFPMVIAPRY
APRQEADSYNKLNKPQALSQQIINGTKLNYKQSNELIDINKSVYAHSAVVKAEDEALESEALESRERQNRVSMTVTFDAA
MPIENIVSPYHGISINMVENAAAQVSLDNYAVANRDFVLTWQPVQGSEPTAAVFSQQGKTHAELASQVTAGDTSFNQGGA
KSKLSPQSQPEPQSQLQVQDSKQQTLSKKALEKYALVMLMPPQGSDDESSSIARELVLVIDTSGSMSGDAIIQAKSALKY
ALAGLRPQDSFNVLQFNSTVERWSRHVMPATAINLGRAQNYINGLQADGGTEMSLALDAALTKLDNDRGHNSKPVHDDDR
YQSSNETLEQSAATPLRQVLFITDGAVANESRLFEQIKNQLGESRLFTIGIGSAPNAHFMQRAAEVGRGTYTYIGKLDEV
NQKVVSLLEKIEKPQVTDVELHFSDGSVPDYWPVRIPDLYAHEPVLVALRIPSYVSDDLLIQGQLAGQFWQRRLPLNSAA
QVNDLEQAKGLDLIWARKQIAALELSKQTANKERIEKQITAIAMKFHIMSAYTSLVAVDVTPVKPTSIAAKEARVIQHLP
SGWQRLSQTLPQTGTNSFVFMIFGTTLLLLAALYRLSFRRTDISEPSCG

Specific function: Unknown

COG id: COG2304

COG function: function code R; Uncharacterized protein containing a von Willebrand factor type A (vWA) domain

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Homo sapiens, GI133925809, Length=359, Percent_Identity=23.1197771587744, Blast_Score=91, Evalue=5e-18,
Organism=Homo sapiens, GI70778918, Length=348, Percent_Identity=23.8505747126437, Blast_Score=91, Evalue=7e-18,
Organism=Homo sapiens, GI153945780, Length=354, Percent_Identity=20.0564971751412, Blast_Score=85, Evalue=4e-16,
Organism=Homo sapiens, GI153945711, Length=354, Percent_Identity=20.0564971751412, Blast_Score=84, Evalue=4e-16,
Organism=Homo sapiens, GI49355778, Length=348, Percent_Identity=19.2528735632184, Blast_Score=84, Evalue=5e-16,
Organism=Homo sapiens, GI262050538, Length=437, Percent_Identity=23.3409610983982, Blast_Score=81, Evalue=4e-15,
Organism=Homo sapiens, GI31542984, Length=361, Percent_Identity=22.7146814404432, Blast_Score=81, Evalue=5e-15,
Organism=Homo sapiens, GI38348336, Length=350, Percent_Identity=23.7142857142857, Blast_Score=81, Evalue=5e-15,
Organism=Homo sapiens, GI261878614, Length=247, Percent_Identity=23.8866396761134, Blast_Score=77, Evalue=6e-14,
Organism=Homo sapiens, GI156119625, Length=247, Percent_Identity=23.8866396761134, Blast_Score=77, Evalue=6e-14,
Organism=Homo sapiens, GI261878618, Length=243, Percent_Identity=24.2798353909465, Blast_Score=76, Evalue=2e-13,
Organism=Homo sapiens, GI261878616, Length=243, Percent_Identity=24.2798353909465, Blast_Score=76, Evalue=2e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 93441; Mature: 93309

Theoretical pI: Translated: 6.81; Mature: 6.81

Prosite motif: PS50234 VWFA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.1 %Cys     (Translated Protein)
2.7 %Met     (Translated Protein)
2.8 %Cys+Met (Translated Protein)
0.1 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
2.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAAYLLERGLANGLERIANLSHRQGGSYFGALMAKKVPIVTVMLSLLNLSVTRLSVLSLS
CHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH
VLSLSLLSLLVPQSAMASSSTVPISMAEMKQAGLIFESEQGELTIALPMKTDVSMHVSGW
HHHHHHHHHHCCCHHHCCCCCCCEEHHHHHHCCCEEECCCCCEEEEEECCCCCEEEHHHH
VNRVSVRHEFKNMSSEWVNGQYLFPLPNEAAVDQLKLHIGARVIEGQIQPKAKAKAIYEQ
HHHHHHHHHHHCCHHHCCCCEEEECCCCHHHHHHHHHHHCCEEEECCCCCHHHHHHHHHH
AKVEGKKASLLEQKRANIFSAQVANLAPNEMLIVELTYQETLDYKDGAFSLRFPMVIAPR
HHCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEHHHHCCCCCCCEEEECCEEEECC
YAPRQEADSYNKLNKPQALSQQIINGTKLNYKQSNELIDINKSVYAHSAVVKAEDEALES
CCCCHHHHHHHHCCCCHHHHHHHHCCCCCCCCCCCCEEECCCHHHHHHHEEECHHHHHHH
EALESRERQNRVSMTVTFDAAMPIENIVSPYHGISINMVENAAAQVSLDNYAVANRDFVL
HHHHHHHHCCCEEEEEEEECCCCHHHHCCCCCCCEEEEECCCHHEEEECCEEEECCCEEE
TWQPVQGSEPTAAVFSQQGKTHAELASQVTAGDTSFNQGGAKSKLSPQSQPEPQSQLQVQ
EEECCCCCCCHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHEEH
DSKQQTLSKKALEKYALVMLMPPQGSDDESSSIARELVLVIDTSGSMSGDAIIQAKSALK
HHHHHHHHHHHHHHEEEEEEECCCCCCCHHHHHHEEEEEEEECCCCCCCCHHHHHHHHHH
YALAGLRPQDSFNVLQFNSTVERWSRHVMPATAINLGRAQNYINGLQADGGTEMSLALDA
HHHHCCCCCCCCCEEEECHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCEEHHHHHH
ALTKLDNDRGHNSKPVHDDDRYQSSNETLEQSAATPLRQVLFITDGAVANESRLFEQIKN
HHHHHCCCCCCCCCCCCCCHHHCCCHHHHHHHHCCHHHHEEEEECCCCCCHHHHHHHHHH
QLGESRLFTIGIGSAPNAHFMQRAAEVGRGTYTYIGKLDEVNQKVVSLLEKIEKPQVTDV
HCCCCEEEEEECCCCCCHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHCCCCCEEE
ELHFSDGSVPDYWPVRIPDLYAHEPVLVALRIPSYVSDDLLIQGQLAGQFWQRRLPLNSA
EEEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCEEEEEHHHHHHHHHCCCCCCC
AQVNDLEQAKGLDLIWARKQIAALELSKQTANKERIEKQITAIAMKFHIMSAYTSLVAVD
HHCCHHHHHCCCCEEEHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHEEEEE
VTPVKPTSIAAKEARVIQHLPSGWQRLSQTLPQTGTNSFVFMIFGTTLLLLAALYRLSFR
ECCCCCCHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHHHHHH
RTDISEPSCG
CCCCCCCCCC
>Mature Secondary Structure 
AAYLLERGLANGLERIANLSHRQGGSYFGALMAKKVPIVTVMLSLLNLSVTRLSVLSLS
HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH
VLSLSLLSLLVPQSAMASSSTVPISMAEMKQAGLIFESEQGELTIALPMKTDVSMHVSGW
HHHHHHHHHHCCCHHHCCCCCCCEEHHHHHHCCCEEECCCCCEEEEEECCCCCEEEHHHH
VNRVSVRHEFKNMSSEWVNGQYLFPLPNEAAVDQLKLHIGARVIEGQIQPKAKAKAIYEQ
HHHHHHHHHHHCCHHHCCCCEEEECCCCHHHHHHHHHHHCCEEEECCCCCHHHHHHHHHH
AKVEGKKASLLEQKRANIFSAQVANLAPNEMLIVELTYQETLDYKDGAFSLRFPMVIAPR
HHCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEHHHHCCCCCCCEEEECCEEEECC
YAPRQEADSYNKLNKPQALSQQIINGTKLNYKQSNELIDINKSVYAHSAVVKAEDEALES
CCCCHHHHHHHHCCCCHHHHHHHHCCCCCCCCCCCCEEECCCHHHHHHHEEECHHHHHHH
EALESRERQNRVSMTVTFDAAMPIENIVSPYHGISINMVENAAAQVSLDNYAVANRDFVL
HHHHHHHHCCCEEEEEEEECCCCHHHHCCCCCCCEEEEECCCHHEEEECCEEEECCCEEE
TWQPVQGSEPTAAVFSQQGKTHAELASQVTAGDTSFNQGGAKSKLSPQSQPEPQSQLQVQ
EEECCCCCCCHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHEEH
DSKQQTLSKKALEKYALVMLMPPQGSDDESSSIARELVLVIDTSGSMSGDAIIQAKSALK
HHHHHHHHHHHHHHEEEEEEECCCCCCCHHHHHHEEEEEEEECCCCCCCCHHHHHHHHHH
YALAGLRPQDSFNVLQFNSTVERWSRHVMPATAINLGRAQNYINGLQADGGTEMSLALDA
HHHHCCCCCCCCCEEEECHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCEEHHHHHH
ALTKLDNDRGHNSKPVHDDDRYQSSNETLEQSAATPLRQVLFITDGAVANESRLFEQIKN
HHHHHCCCCCCCCCCCCCCHHHCCCHHHHHHHHCCHHHHEEEEECCCCCCHHHHHHHHHH
QLGESRLFTIGIGSAPNAHFMQRAAEVGRGTYTYIGKLDEVNQKVVSLLEKIEKPQVTDV
HCCCCEEEEEECCCCCCHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHCCCCCEEE
ELHFSDGSVPDYWPVRIPDLYAHEPVLVALRIPSYVSDDLLIQGQLAGQFWQRRLPLNSA
EEEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCEEEEEHHHHHHHHHCCCCCCC
AQVNDLEQAKGLDLIWARKQIAALELSKQTANKERIEKQITAIAMKFHIMSAYTSLVAVD
HHCCHHHHHCCCCEEEHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHEEEEE
VTPVKPTSIAAKEARVIQHLPSGWQRLSQTLPQTGTNSFVFMIFGTTLLLLAALYRLSFR
ECCCCCCHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHHHHHHHH
RTDISEPSCG
CCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA