Definition Herpetosiphon aurantiacus ATCC 23779 chromosome, complete genome.
Accession NC_009972
Length 6,346,587

Click here to switch to the map view.

The map label for this gene is olpB [H]

Identifier: 159897625

GI number: 159897625

Start: 1255070

End: 1256815

Strand: Direct

Name: olpB [H]

Synonym: Haur_1096

Alternate gene names: 159897625

Gene position: 1255070-1256815 (Clockwise)

Preceding gene: 159897624

Following gene: 159897627

Centisome position: 19.78

GC content: 50.86

Gene sequence:

>1746_bases
ATGCAATTGCGCTCATTGCGGTCGTTGTTGACTGTTGATCGTGCTCGGGTCGGCTCATTATTAAGTGTTTTATTGGTAGG
CTTGTTGGCTTGGATGCTGCCGCAGACCAATGCTCAGCCTGCTTTCGCTGCATCAAGCACCGTCGTGATTAGCCAAGTTT
ATGGGGGTGGCGGCTCTGCTACGGCTACTTACAAGAGTGATTACGTAGAATTGTTCAATTTGAGTGGTTCTGCTGTCTCT
TTAAATGGCTTGTCGATTCAATATGCTTCAAGCACAGGGAACTTTAATGGTGTTTTCGCTTTACCAAATGCTACAATTCT
ACCTGGCAAATACTATCTCGTACAGCTATCTCTGGGTACAGGTCTAGGCGATATCCCAACTCCAGATGCAGCTTCTGGAA
CTAATATCGCTATGTCTGCAACTGCAGGTAAGGTTATTATTGCGAATACCACTACAGCATTAGGGTGTTCCACAAGCGCG
ACTTGTACTCCTGCTCAACAAGCCCAAATTATTGATCTCGTTGGTTATGGCACAGCTGCTAATTACTTCGAAGGTAGTGG
GCCAACAGGTGCGCCAAGCAATACGACGAGCGTTATTCGTACCAATCCTTGTGTTGATGCCGATAATAATGCAACTGAAT
TTAGCGTGGGTACGCCAAACCCACGTAATACTGCCAGCCCTACGTTGAGCTGTTCAGCGGCCACCAATACACCAACGAAC
ACGCCGACTAATACCGCGACCAACACACCAACCAGCACTCCAATTGTGCTTGGGGGCGATAATAATATCCTGTGGGATCA
GCTCTATCACAGCGCCACTGCTGTAAATCCTCAACTTGAGCTTGTGCCAAACGAGAGCTACAGCTTTTTGCATAGTGCTA
GTGGCACAATCGACGAAACCACGGCTGTGACGATTTCGGCATTAACTGATGCGCTTGATGTGCAAACGGTTAGCCTGCGC
TACTGGGATGGAGCGAATTCGACTACAATTCCAATGACGAGGATTAAATCGTTGAGCGCTAGCTTTCGCAGCCAGCCAAT
CCATAGCTACGATTTGTGGCAGGCTAGCATTCCAGCTCAGCCAATCGGCACAAGCATTTTCTATCGGGTGATTGCTCAAG
ATGGTTCGGCCTCAGCCTATTTGAAGCACAATAATGGCCAATATGTGAATCCGCTTGGCCAACATGTGCGGGGCTTCAAT
GATGATCCCGATGATTATAGCTACACGGTTTTAGCGGCAAACCCAACTGCTACCCCAACGAATACCCCAACTAACACGCC
AACGGATACCGCTACGCCGACGGCGACCAATACGCCAACCAATACACCAACCGATACGGCAACGCCAACGGCGAGCAACA
CGCCAACCAATACGCCGACCGATACGGCAACACCAACCAACACGCCAGTGGCTCCAACGGCAACCGATACCGCTACGCCA
ACGGCGACGAACACGCCAACCAATACGCCGACCGATACGGCAACGCCAACCAACACGCCAGTGGCTCCAACGGCAACCGA
TACCGCTACGCCAACGGCGAGCAACACACCAACCAATACGGCTACGCCAACGATCACGGTGACGAGAACACCGACACATA
CGCCAACTAATACAGCAACGCCAACGCGCACGGCGACCAACACGCCAACCAATACGGCGACATCGACGGCGACGAATACG
CCAACCGTCACCAATACGCCAATTGCTCAGCAGCATAAAGTGTTCTTACCATGGGCCAGCAAATAG

Upstream 100 bases:

>100_bases
ATTGATAGGACTAAATAGCATTGACGTTGGGCTAGGCCGCGTGGCAAGATTTGGCTAATCCTTACGTTTTAGCCAAATCT
GTGTCTTGGGGAGTGTCGGT

Downstream 100 bases:

>100_bases
CCCGTTGGTAGCAGCTTAAATGTGATGTTTAATCAAGGGGTTGTGTAATGGCGAAGCACAAGAATGAGGCTGGAACGGTG
TATAATGCACGTTACACAAG

Product: hypothetical protein

Products: NA

Alternate protein names: Outer layer protein B; S-layer protein 1 [H]

Number of amino acids: Translated: 581; Mature: 581

Protein sequence:

>581_residues
MQLRSLRSLLTVDRARVGSLLSVLLVGLLAWMLPQTNAQPAFAASSTVVISQVYGGGGSATATYKSDYVELFNLSGSAVS
LNGLSIQYASSTGNFNGVFALPNATILPGKYYLVQLSLGTGLGDIPTPDAASGTNIAMSATAGKVIIANTTTALGCSTSA
TCTPAQQAQIIDLVGYGTAANYFEGSGPTGAPSNTTSVIRTNPCVDADNNATEFSVGTPNPRNTASPTLSCSAATNTPTN
TPTNTATNTPTSTPIVLGGDNNILWDQLYHSATAVNPQLELVPNESYSFLHSASGTIDETTAVTISALTDALDVQTVSLR
YWDGANSTTIPMTRIKSLSASFRSQPIHSYDLWQASIPAQPIGTSIFYRVIAQDGSASAYLKHNNGQYVNPLGQHVRGFN
DDPDDYSYTVLAANPTATPTNTPTNTPTDTATPTATNTPTNTPTDTATPTASNTPTNTPTDTATPTNTPVAPTATDTATP
TATNTPTNTPTDTATPTNTPVAPTATDTATPTASNTPTNTATPTITVTRTPTHTPTNTATPTRTATNTPTNTATSTATNT
PTVTNTPIAQQHKVFLPWASK

Sequences:

>Translated_581_residues
MQLRSLRSLLTVDRARVGSLLSVLLVGLLAWMLPQTNAQPAFAASSTVVISQVYGGGGSATATYKSDYVELFNLSGSAVS
LNGLSIQYASSTGNFNGVFALPNATILPGKYYLVQLSLGTGLGDIPTPDAASGTNIAMSATAGKVIIANTTTALGCSTSA
TCTPAQQAQIIDLVGYGTAANYFEGSGPTGAPSNTTSVIRTNPCVDADNNATEFSVGTPNPRNTASPTLSCSAATNTPTN
TPTNTATNTPTSTPIVLGGDNNILWDQLYHSATAVNPQLELVPNESYSFLHSASGTIDETTAVTISALTDALDVQTVSLR
YWDGANSTTIPMTRIKSLSASFRSQPIHSYDLWQASIPAQPIGTSIFYRVIAQDGSASAYLKHNNGQYVNPLGQHVRGFN
DDPDDYSYTVLAANPTATPTNTPTNTPTDTATPTATNTPTNTPTDTATPTASNTPTNTPTDTATPTNTPVAPTATDTATP
TATNTPTNTPTDTATPTNTPVAPTATDTATPTASNTPTNTATPTITVTRTPTHTPTNTATPTRTATNTPTNTATSTATNT
PTVTNTPIAQQHKVFLPWASK
>Mature_581_residues
MQLRSLRSLLTVDRARVGSLLSVLLVGLLAWMLPQTNAQPAFAASSTVVISQVYGGGGSATATYKSDYVELFNLSGSAVS
LNGLSIQYASSTGNFNGVFALPNATILPGKYYLVQLSLGTGLGDIPTPDAASGTNIAMSATAGKVIIANTTTALGCSTSA
TCTPAQQAQIIDLVGYGTAANYFEGSGPTGAPSNTTSVIRTNPCVDADNNATEFSVGTPNPRNTASPTLSCSAATNTPTN
TPTNTATNTPTSTPIVLGGDNNILWDQLYHSATAVNPQLELVPNESYSFLHSASGTIDETTAVTISALTDALDVQTVSLR
YWDGANSTTIPMTRIKSLSASFRSQPIHSYDLWQASIPAQPIGTSIFYRVIAQDGSASAYLKHNNGQYVNPLGQHVRGFN
DDPDDYSYTVLAANPTATPTNTPTNTPTDTATPTATNTPTNTPTDTATPTASNTPTNTPTDTATPTNTPVAPTATDTATP
TATNTPTNTPTDTATPTNTPVAPTATDTATPTASNTPTNTATPTITVTRTPTHTPTNTATPTRTATNTPTNTATSTATNT
PTVTNTPIAQQHKVFLPWASK

Specific function: Unknown

COG id: COG2374

COG function: function code R; Predicted extracellular nuclease

Gene ontology:

Cell location: Secreted, cell wall, S-layer [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 3 SLH (S-layer homology) domains [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008965
- InterPro:   IPR002102
- InterPro:   IPR018452
- InterPro:   IPR001119 [H]

Pfam domain/function: PF00963 Cohesin; PF00395 SLH [H]

EC number: NA

Molecular weight: Translated: 60008; Mature: 60008

Theoretical pI: Translated: 4.77; Mature: 4.77

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.7 %Cys     (Translated Protein)
0.7 %Met     (Translated Protein)
1.4 %Cys+Met (Translated Protein)
0.7 %Cys     (Mature Protein)
0.7 %Met     (Mature Protein)
1.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQLRSLRSLLTVDRARVGSLLSVLLVGLLAWMLPQTNAQPAFAASSTVVISQVYGGGGSA
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEECCCCCE
TATYKSDYVELFNLSGSAVSLNGLSIQYASSTGNFNGVFALPNATILPGKYYLVQLSLGT
EEEECCCEEEEEECCCCEEEECCEEEEEECCCCCCCEEEECCCCEECCCCEEEEEEEECC
GLGDIPTPDAASGTNIAMSATAGKVIIANTTTALGCSTSATCTPAQQAQIIDLVGYGTAA
CCCCCCCCCCCCCCCEEEEECCCEEEEEECCEEECCCCCCCCCCCCCCEEEEEEECCCCC
NYFEGSGPTGAPSNTTSVIRTNPCVDADNNATEFSVGTPNPRNTASPTLSCSAATNTPTN
CCCCCCCCCCCCCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCCEEEEECCCCCCCC
TPTNTATNTPTSTPIVLGGDNNILWDQLYHSATAVNPQLELVPNESYSFLHSASGTIDET
CCCCCCCCCCCCCCEEECCCCCEEHHHHHHHHHCCCCEEEEECCCCCHHHHCCCCCCCCC
TAVTISALTDALDVQTVSLRYWDGANSTTIPMTRIKSLSASFRSQPIHSYDLWQASIPAQ
CEEEEEHHHCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCEECEECCCCCC
PIGTSIFYRVIAQDGSASAYLKHNNGQYVNPLGQHVRGFNDDPDDYSYTVLAANPTATPT
CCCHHEEEEEEECCCCCEEEEECCCCCEECHHHHHHCCCCCCCCCCEEEEEEECCCCCCC
NTPTNTPTDTATPTATNTPTNTPTDTATPTASNTPTNTPTDTATPTNTPVAPTATDTATP
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
TATNTPTNTPTDTATPTNTPVAPTATDTATPTASNTPTNTATPTITVTRTPTHTPTNTAT
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCC
PTRTATNTPTNTATSTATNTPTVTNTPIAQQHKVFLPWASK
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCC
>Mature Secondary Structure
MQLRSLRSLLTVDRARVGSLLSVLLVGLLAWMLPQTNAQPAFAASSTVVISQVYGGGGSA
CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEECCCCCE
TATYKSDYVELFNLSGSAVSLNGLSIQYASSTGNFNGVFALPNATILPGKYYLVQLSLGT
EEEECCCEEEEEECCCCEEEECCEEEEEECCCCCCCEEEECCCCEECCCCEEEEEEEECC
GLGDIPTPDAASGTNIAMSATAGKVIIANTTTALGCSTSATCTPAQQAQIIDLVGYGTAA
CCCCCCCCCCCCCCCEEEEECCCEEEEEECCEEECCCCCCCCCCCCCCEEEEEEECCCCC
NYFEGSGPTGAPSNTTSVIRTNPCVDADNNATEFSVGTPNPRNTASPTLSCSAATNTPTN
CCCCCCCCCCCCCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCCEEEEECCCCCCCC
TPTNTATNTPTSTPIVLGGDNNILWDQLYHSATAVNPQLELVPNESYSFLHSASGTIDET
CCCCCCCCCCCCCCEEECCCCCEEHHHHHHHHHCCCCEEEEECCCCCHHHHCCCCCCCCC
TAVTISALTDALDVQTVSLRYWDGANSTTIPMTRIKSLSASFRSQPIHSYDLWQASIPAQ
CEEEEEHHHCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCEECEECCCCCC
PIGTSIFYRVIAQDGSASAYLKHNNGQYVNPLGQHVRGFNDDPDDYSYTVLAANPTATPT
CCCHHEEEEEEECCCCCEEEEECCCCCEECHHHHHHCCCCCCCCCCEEEEEEECCCCCCC
NTPTNTPTDTATPTATNTPTNTPTDTATPTASNTPTNTPTDTATPTNTPVAPTATDTATP
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
TATNTPTNTPTDTATPTNTPVAPTATDTATPTASNTPTNTATPTITVTRTPTHTPTNTAT
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCC
PTRTATNTPTNTATSTATNTPTVTNTPIAQQHKVFLPWASK
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 8458832 [H]