Definition Shewanella halifaxensis HAW-EB4 chromosome, complete genome.
Accession NC_010334
Length 5,226,917

Click here to switch to the map view.

The map label for this gene is 167623621

Identifier: 167623621

GI number: 167623621

Start: 2025376

End: 2026311

Strand: Direct

Name: 167623621

Synonym: Shal_1690

Alternate gene names: NA

Gene position: 2025376-2026311 (Clockwise)

Preceding gene: 167623620

Following gene: 167623622

Centisome position: 38.75

GC content: 45.09

Gene sequence:

>936_bases
GTGGCGGCTAAATCAGATTTAACCCTTCCGGTTGCAATTTCAGCTGGAATTCATATTGGCGTGATAATTATTCTCATTTT
AGGCGTGGATTTTTCAGAAAAGCCTAAAGTGCAGCCACAAGCAAGCGCCCCCGCTATGCAGGCTGTGGTTGTGGATCAAA
AAAAGGTCGCGCAGCACGTTGAAAGGCTTAAAGCGGATAAACGTGAAGCTGAGCGTAAAGAAAAAGCCCGTCAAGATGAA
GCTGATAGACGAGTGCGTGAAGCACGTAAAGAGCGTGAACGCGAACAGGCACAAATTAAGAAACTTGAGCAAGAGCGTAA
ACAAAAAGAGATTGAAACTAAAAATGCAGCTGACGCAGCAAAAGCGGCGCAGTTAAAGCAGAAACAGGAAAAAGAGAAAG
CGGATAAGGCTGAAGCAGATCGCAAGCAGAAAGAGAAAGAACGTAAAGCCTCTGAAGAAGCAGCGAAAAAAGCAGCGGAT
AAACGTAAAGCTGAAGAAGCGGCAGCTAAAAAAGCCGAAGACGAGCGAAAACGAAAAGCCGAAGCTGAACGTAAGCGAAA
AGCGGAAGAAAAAGCCAGACGTGAGCAAGAGCAGATGATGCAAGATGCATTGGCTGCTGAGCAAGCGGCACTTTCTCAGA
CCCGTAATAAGCAAGTCATGAATGAAGTACAACGTTATACCTCGATGATCAGAGCGACTATTCAACGTAACTTAGTGGTT
GATGAGTCTATGCGAGGTAAAAGTTGTCGAGTCTTCATCCGCTTAGCTAATGATGGTTTTGTGACCGCGAGCCAAACGCT
CGATGGAGACAGTGTAGTTTGTCGCGCAACAAAAGCGGCGATAAATAAAGCGGGTAGGTTACCTGTATCGAATGAGCCTG
ACGTTTATAACAAGCTCAAAGAAATCAATTTAACAGTTCAACCCGAGTTCAATTAA

Upstream 100 bases:

>100_bases
CTCAATCCCTTATGAAAAAGTGATCCAGTTGATGGTGACACTGCAAGGTGCTGGTGTGCCGTCAGTGGGGTTAATGACTG
ATTCGCCGGAGGATAAATAA

Downstream 100 bases:

>100_bases
AGGATCCATATGAAAATTTTTGGGAAATGGTTGCTGGTAACCCTGCTTATTTGCAGTATGCCGGTAAAGGCTGCGTTAGA
TATTGTGATTACAGAAGGTA

Product: Tol-Pal system TolA

Products: NA

Alternate protein names: LA Family Protein; Protein lA; LA-Like Protein; Membrane Anchored Protein In lA-TolQ-TolR Complex; Outer Membrane Integrity Protein lA; And Transport-Associated Protein lA; LA Protein Membrane Component

Number of amino acids: Translated: 311; Mature: 310

Protein sequence:

>311_residues
MAAKSDLTLPVAISAGIHIGVIIILILGVDFSEKPKVQPQASAPAMQAVVVDQKKVAQHVERLKADKREAERKEKARQDE
ADRRVREARKEREREQAQIKKLEQERKQKEIETKNAADAAKAAQLKQKQEKEKADKAEADRKQKEKERKASEEAAKKAAD
KRKAEEAAAKKAEDERKRKAEAERKRKAEEKARREQEQMMQDALAAEQAALSQTRNKQVMNEVQRYTSMIRATIQRNLVV
DESMRGKSCRVFIRLANDGFVTASQTLDGDSVVCRATKAAINKAGRLPVSNEPDVYNKLKEINLTVQPEFN

Sequences:

>Translated_311_residues
MAAKSDLTLPVAISAGIHIGVIIILILGVDFSEKPKVQPQASAPAMQAVVVDQKKVAQHVERLKADKREAERKEKARQDE
ADRRVREARKEREREQAQIKKLEQERKQKEIETKNAADAAKAAQLKQKQEKEKADKAEADRKQKEKERKASEEAAKKAAD
KRKAEEAAAKKAEDERKRKAEAERKRKAEEKARREQEQMMQDALAAEQAALSQTRNKQVMNEVQRYTSMIRATIQRNLVV
DESMRGKSCRVFIRLANDGFVTASQTLDGDSVVCRATKAAINKAGRLPVSNEPDVYNKLKEINLTVQPEFN
>Mature_310_residues
AAKSDLTLPVAISAGIHIGVIIILILGVDFSEKPKVQPQASAPAMQAVVVDQKKVAQHVERLKADKREAERKEKARQDEA
DRRVREARKEREREQAQIKKLEQERKQKEIETKNAADAAKAAQLKQKQEKEKADKAEADRKQKEKERKASEEAAKKAADK
RKAEEAAAKKAEDERKRKAEAERKRKAEEKARREQEQMMQDALAAEQAALSQTRNKQVMNEVQRYTSMIRATIQRNLVVD
ESMRGKSCRVFIRLANDGFVTASQTLDGDSVVCRATKAAINKAGRLPVSNEPDVYNKLKEINLTVQPEFN

Specific function: Unknown

COG id: COG3064

COG function: function code M; Membrane protein involved in colicin uptake

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 35146; Mature: 35015

Theoretical pI: Translated: 10.42; Mature: 10.42

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
2.3 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAAKSDLTLPVAISAGIHIGVIIILILGVDFSEKPKVQPQASAPAMQAVVVDQKKVAQHV
CCCCCCCCEEHHHHCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH
ERLKADKREAERKEKARQDEADRRVREARKEREREQAQIKKLEQERKQKEIETKNAADAA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KAAQLKQKQEKEKADKAEADRKQKEKERKASEEAAKKAADKRKAEEAAAKKAEDERKRKA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
EAERKRKAEEKARREQEQMMQDALAAEQAALSQTRNKQVMNEVQRYTSMIRATIQRNLVV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
DESMRGKSCRVFIRLANDGFVTASQTLDGDSVVCRATKAAINKAGRLPVSNEPDVYNKLK
CCCCCCCCEEEEEEECCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH
EINLTVQPEFN
HCCCEECCCCC
>Mature Secondary Structure 
AAKSDLTLPVAISAGIHIGVIIILILGVDFSEKPKVQPQASAPAMQAVVVDQKKVAQHV
CCCCCCCEEHHHHCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH
ERLKADKREAERKEKARQDEADRRVREARKEREREQAQIKKLEQERKQKEIETKNAADAA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
KAAQLKQKQEKEKADKAEADRKQKEKERKASEEAAKKAADKRKAEEAAAKKAEDERKRKA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
EAERKRKAEEKARREQEQMMQDALAAEQAALSQTRNKQVMNEVQRYTSMIRATIQRNLVV
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
DESMRGKSCRVFIRLANDGFVTASQTLDGDSVVCRATKAAINKAGRLPVSNEPDVYNKLK
CCCCCCCCEEEEEEECCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH
EINLTVQPEFN
HCCCEECCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA