The gene/protein map for NC_011750 is currently unavailable.
Definition Escherichia coli IAI39 chromosome, complete genome.
Accession NC_011750
Length 5,132,068

Click here to switch to the map view.

The map label for this gene is ydbA

Identifier: 218700078

GI number: 218700078

Start: 1785061

End: 1790940

Strand: Direct

Name: ydbA

Synonym: ECIAI39_1709

Alternate gene names: 218700078

Gene position: 1785061-1790940 (Clockwise)

Preceding gene: 218700077

Following gene: 218700079

Centisome position: 34.78

GC content: 51.04

Gene sequence:

>5880_bases
ATGCAAAAGAAAACGTTATTATCCGCCTGTATCGCGCTGGCTTTAAGCGGATCGGGCTGGGCTGCTGACATTGATGATAC
AGATAGTGCAACTCGTCAGCGTAAGGAGACAAGGATACCCTGTCCGACCGCTCACTCCTCTGAAAAACTAAGTCCGCAAC
AACTAAAATCGCTCCCGTCTGAATGTTCTACAACCAATGACAACAACCTCTATTCCTTGATTGCCGTTGGCGCTACTTCA
CTAATCACCACTCTTGCAGTCCTTGAACTAAACCACGATGACGGTAACCACGCTCATTCCTCTGACAATCCTCCAGTACC
GCCTGATGACGATAATGGCGGCAACACACCGGATGATGGTGGCAATACCCCTGACGATGGCGGCAATACTCCTGACGATG
GCGGCAACACACCGGACGATGGCGGCAACACACCGGATGATGGTGGCAATACCCCTGACGATGGTGGGAATACTCCTGAC
GATGGCGGCAACACACCGGATGATGGTGGGAATACTCCTGACGATGGTGGGAATACTCCTGACGATGGCGGCAACGTCAC
CCCGCCCAAAGAGCCTAAAATCTTCAATAACAATGTCACGTTCGACGAAGATAAAGGCACGCTGAAAATTCGTAACGCCA
CCTTTACCTACAGCAAAAACACCGATGGAACTTATACCCTGACGGCTGGAGATGGCCGGACCACGGTTGTACAAGGCTGG
GATGTCGACACGGCTGCCAATACTGTAGAAATTACTGGCGTGAATACCCAGGGCGGTATGACGTGGCGTTACGGTAAAGA
CGGCATTATCTATATCACCAAAACCGTCGGCGCAACGGTGGATGACCCCGCGAACAGTAACGTATTTAACCTCAGCGATG
CGGTGCTCACTGACCAGGGCGGTAATGCCGCGCGAAATGGCGCAACGGTGATTGAAATTAACGGCAGCAGGATCGTCCTT
AATAACGACGGTGACATTTCTGCTACGGGAAAAGATTCTGTGGTTGTGGCGATGACCGGAAACGACATTACCGTTAACAA
TAACGGCCATATGGTTGTCGATGGCGGCACCGCGGGCGTGGTTAACGGCGATCGTGCGATCCTGAATAACCGTGGTGATG
CGGTGATCACTAACGGGGGCGCGGGTGTCATTGTGACGGGCGACAACGCGGTCATCAACAACACGGGGCAAAGCGATATT
GATGGCGACAACTCGGTATCGGTAAAAGTCGCAGGAAACGCAACCAGAATCAAAATGGAAGGCGGGCTCAACGTCAGCGG
TGGCGCGCATGGCATTGATGCCACCGGTGACAATAACGAAGTCAGCAACAAGGGCAACATTTCGGTGGTGGATGCGCATT
CCATGGGCGTGCTGCTGAACGGCGATCGCGCGTCATTCGTTAATATGGGTGATATTAACGTTAGCGGAGGTGCAGCGGAC
GATCACGCTATTGGTGTGCAGATTAACGGCGATAACAGTACCTTTATCAACGTTGGAGATTTAAATGCCGACGACACGGC
GACCGGGGTGAAAATTACCGGTGACGCCAGCGACATTGCCCTGGCCGGGGCGATGCATGTCGGGAATTTTGCCTCCGGGC
TGGAGGTGACAGGTAAGAACAATGACCTGTCGTTGTCCACCAATATGATGGATGTCACCGGGAAACAGTCCACCGGGGTG
ACCATTACAGGTGATGATAATACCATTGATATCACCGGTGATATGGTCGTTGATCAAAACTCTGTCGGGGCGAAAATTGC
CGGCGATCGCGTTTCGTTGCAACAGAAAGGTGATATTACCGTTAACGGTGCAGGGCACGGCGTTGAAGTCAGCGGTAGTA
AGGCGGCAATCAGTAATCAGGGCAAACTGACCGTAAAAGATCAAGACTCAATCGGTATCGCTATCATCGGGGATGACGCG
CAATTTACCACCGTGGGTGAGATTGATGTCTCGCTGAACGGTACGGGTGTTGCGATCAGCGGCGATCGTGAGCAAGTGAA
TTTGAGCGGTGATATAAATGTCATTCAGGAGCGTGACGGCAGCGGTACTTTCCAGGGCGGAACGGGCATCAGCATAATGG
GTAACGACAGCAGCATGCTGTTGGCAGGAAACATTAATGTGACGTCCAGTATGGGGGACCAACCGTCCACCTCACCCCAG
TCGTTAACCGGTGTTACCATCGGGGGAGAAAACAACACTGTCGATCTGCAAGGCGACATCAATATCACTGTTGATTCGGA
TTTCCTGGAGCCTCAGAATGACCTTTACGGCGTGATGGTCTCGGGTTCAAATAACATCATCAACCTCGATGGTGGTATCA
ATATTTCCGGCGATAGTGGCGGACATTTTATCAAAGGCGTGCAGGTAACCGGGAATAATTCCGTCAATATTAGTGGTCAT
TCTGTAATGAATACCCGACAGGTATTAGGAACCTTCTCGCTGATTTCGGTGGCTGATGGCGGGAATGTCGTATTTGATGA
AAGCGCTGTCACCGATATTCAGAGTTCAACAAGAGATTTCCCAAATTACTTTATCGGTTCGGTAATCATTGCAATGGGAA
GTCAGTCTGCTATTCGAAATAATGGCATTGTTAACACAACCGATGCTCAGGAATTAATGATGGCGAATTCGGGGGGGCAG
GTTGTCAATGCCGGGGAGATTAATATTAGGCCAGACGCTGAATCACATAGTTTCTTTGCTGGAATGGCGGCAAGGGGTGA
TGACTCGCAGGCAAAAAATGTCTCCGGAGGAACAATCAATCTGACATCAAATACACAACCTTATCGGGGGTCAGGAATTG
GCGAGTACCCGGTAAAATGGTACAGCAACACGGGTTATGCCTTGTTAGCCAGTAATTACGGAACGGTAATCAATGAAACA
GGAGCAACCATTAATTTGCATGGTGCGGGAACCTATGGTGTTTCTGCTTCAAAAGGAACGGCGACCAATGCAGGCGAAAT
AAATGTCGACGGTTTTGTACCAACAGTTGATGAAAATGGCTTCATTATTGATGAAACTTACTGGCAGACCAATTCCATAT
ATCTGATGGGCGGTGGGATGTTGGCCGGTTCAACGGATGCCGGCAACGGTGATGCAAAAGCGGTGAATACCGGTACCATC
AACGTTAATAATGAAGGCTTCGGCATGCTGGCAATGAGCGGCGGGACCGTTGTAAACCAGGGGACAATCAATCTCACCAC
CGATGAGGGCGTGACGAAGCAGCAAGATAACCAACTGTTCGCGATGGGTGCAGTTCAAGGTGGTCTGGCGATTAATGACC
AGGATGGGGTTATTAATATTAACACTGATATTGGCCAGGCGTTTTATAAAGATAGCACTGGCACCATTCTCAACTACGGC
AAAATTAATCTTTTCGGTAATCCAATGGATGAAAGTGATTCCCATATGGGCGTTACGCCGGATGACAAAGATATTCTCAG
TGAACTGTCTGGTAGCGGCGAGAGCATCAGCAAAACCACAACTGGCGACGGTTTTATCGCCGTTAATAACCAGGCAAACT
ATGGCGACGAAACGCTTAATGGCGACGTTACAGCAAACGGCTGGATATTCAACCAACCTGATGCCAGCCTGACAATCAAC
GGCGAACTGAGCGTAAACCAGGGGCTGGAAAATAGTGGTCACCTGGATGTTGACAGCATTAACAGCAAAACCACTATCTA
CAACCGTGAAACAGGCAGCATAACAACCGATCTGCTGACGTTGAATGGCGCGGTTTCCTTCTTTAATGAAGGCGAATTTA
GTGGTTCAATTACCGGAAACAGCTATCAACAGAATGTGGTGAATACCGGTGAAATGACGGTGACCGAAGATGGCCACTCA
CTCGTCAACGGAAGTTTCCTGTTCTTCAATGAAGCTGGCGCGACCCTGACAAATAGCGGCAATGCGGTAACGGGCGGGGA
AAACGCGATTATTCATGTAACCAGAACCAGTGATTCTGTTTCGCAGGTTAACCGTGGCACCATCACGGCCACAAATGGCT
ATAGCGCAATCAAAACTGAAAATACTGGCTCGAACAGTAACGGGAAATGGATTTGGAATACGGAAACGGGAGTGATAAAT
GGTATTAATCCTGTTGCTCCGTTGGTTGATTTAAGACGAGGTTATAATTTTTCTAACGCTGGCGTTATTAATGTTCAGGG
CGATAATGCGGTGGGCATCAGTGGCGGCACAACCAGCTATACTGTCAAGTTGGTGAACAGTGGCACCATTAATGTGGGTA
CTGAACAGGGGCAACTGGATGGAACCAACGGCGAAGGCTTAATTGGTATTAAGGGCAATGGTAAGGACACGACAATTAAT
AACACGCAAACGGGTGTGATTAATGTCTATGCCGATAACTCCTGGGCGTTTGGTGGGCAAACGAAAGCCATCATTAACAA
CGGTGAAATTAATCTGCTGTGTGACACCGGATGCGATATTTATGCTCCAGGAACGACAGGGACGCAAAAAGATCACAACG
GTACTGCGGATATCACCGTACCAGAGGCATCAACCACGCCGTCGCAAGGTAATGTTCCAACGCCGCCTGCCGATCCAAAC
GCACCGCAGTTGCTGAGCAACTATACCATCGGTACCAACAGCGACGGGAGTTCGGGTACGCTCAGCGCGAATAATCTGGT
TATTGGCGACAACGTGAGTGTTAACGCCGGGTTTAGTGCCGGAACAGCGGACACCACCGTTGTCGTTAACGATGTATTCA
AAGGCGAAAACATCAGCGGTGTAGATAACATTGTTTCCTCTACGGTCGTCTGGACTGCCAAAGGCAGTACCGACGCCAGC
GGCAACGTTGACGTGACCATGAGCAAAAACGCCTACACCGATGTAGCGACCGATGGCTCAGTGAGTGACGTGGCGAAGGC
ACTTGATGCGGGTTATACCAACAACGAGCTGTACACCAGTCTGAACGTGGGCACCACCGCCGAACTGAACAGCGCGCTGA
AACAAATCAGTGGTAGCCAGGCGACCACCGTATTCCGTGAGGCGCGCGTGTTAAGCAACCGCTTCAGCATGCTGGCGGAT
GCAGCGCCGAAGATGGGCAACGGTCTGGCGTTCAACGTGGTGGCGAAAGGTGATCCGCGTGCCGAACTGGGTAACAATAC
CGAGTACGACATGCTGGCGCTGCGTAAAACAGTTGACCTGAGCGAAAGCCAGACCATGAGCCTGGAATACGGTATCGCGC
GTCTGGATGGTGACGGTGCGCAAAAAGCAGGCGACAACGGAATAACCGGCGGCTACAGCCAGTTCTTTGGCCTGAAACAT
CAGATGTCCTTCGACAATGGCATGAACTGGAATAACGCGCTGCGTTACGATATTCATCAACTGGACAGCAGCCGCTCGGT
GGCTTACGGCGACGTCAGTAAGACGGCGGATACCAACGTGAAACAGCAGTACCTGGAGTTCCGTAGCGAAGGGGCGAAAA
CCTTTGAACCGCGCGAAGGGCTGAAAATCACGCCATATGCGGGACTGAAACTGCGTCACACGCTGGAAGGCGGTTATCAG
GAGCGTAATGCCGGAGACTTTAACCTGAGCATGAACAGCGGCAGCGAAACGGCGATGGACAGCATCGTCGGGCTGAAACT
GGACTACGCAGGCAAAGACGGCTGGAGTGCGAACGCCACTCTGGAAGGTGGGCCGAACCTGAGCTACGCGAAGAGCCAGC
GCACGGCAAGCCTGGCAGGCGCAGGCAGCCAGCACTTTAATGTCGATGATGGACAGAAGGGCGGCAGTATCAACAGCCTG
GCAAGCGTCGGCGTGAAGTACAGCAGCAAAGAGAGCTCGCTGAATCTGGATGCCTATCACTGGAAAGAAGACGGCGTCAG
CGATAAAGGCGTGATGCTCAATTTCAAGAAAACGTTCTAA

Upstream 100 bases:

>100_bases
GCTTCTTAAATAATATATAATCATGACCACTCTTCGTTTATGACGTTTCTGTTACACCCATTTTACTTTTTAAACTGTCT
CTGATTGAGAGGGATTTCTC

Downstream 100 bases:

>100_bases
TTTTTAGTATGTGATGCATAAACCGCAACGCTGATACAGGTTGCGGTTTTTTATTGCCGGATGCGGCGCTTGACGCGTTT
TGTTTTGGTGTCTTCCAGGA

Product: putative cell adhesion autotransported outer membrane protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 1959; Mature: 1959

Protein sequence:

>1959_residues
MQKKTLLSACIALALSGSGWAADIDDTDSATRQRKETRIPCPTAHSSEKLSPQQLKSLPSECSTTNDNNLYSLIAVGATS
LITTLAVLELNHDDGNHAHSSDNPPVPPDDDNGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPD
DGGNTPDDGGNTPDDGGNTPDDGGNVTPPKEPKIFNNNVTFDEDKGTLKIRNATFTYSKNTDGTYTLTAGDGRTTVVQGW
DVDTAANTVEITGVNTQGGMTWRYGKDGIIYITKTVGATVDDPANSNVFNLSDAVLTDQGGNAARNGATVIEINGSRIVL
NNDGDISATGKDSVVVAMTGNDITVNNNGHMVVDGGTAGVVNGDRAILNNRGDAVITNGGAGVIVTGDNAVINNTGQSDI
DGDNSVSVKVAGNATRIKMEGGLNVSGGAHGIDATGDNNEVSNKGNISVVDAHSMGVLLNGDRASFVNMGDINVSGGAAD
DHAIGVQINGDNSTFINVGDLNADDTATGVKITGDASDIALAGAMHVGNFASGLEVTGKNNDLSLSTNMMDVTGKQSTGV
TITGDDNTIDITGDMVVDQNSVGAKIAGDRVSLQQKGDITVNGAGHGVEVSGSKAAISNQGKLTVKDQDSIGIAIIGDDA
QFTTVGEIDVSLNGTGVAISGDREQVNLSGDINVIQERDGSGTFQGGTGISIMGNDSSMLLAGNINVTSSMGDQPSTSPQ
SLTGVTIGGENNTVDLQGDINITVDSDFLEPQNDLYGVMVSGSNNIINLDGGINISGDSGGHFIKGVQVTGNNSVNISGH
SVMNTRQVLGTFSLISVADGGNVVFDESAVTDIQSSTRDFPNYFIGSVIIAMGSQSAIRNNGIVNTTDAQELMMANSGGQ
VVNAGEINIRPDAESHSFFAGMAARGDDSQAKNVSGGTINLTSNTQPYRGSGIGEYPVKWYSNTGYALLASNYGTVINET
GATINLHGAGTYGVSASKGTATNAGEINVDGFVPTVDENGFIIDETYWQTNSIYLMGGGMLAGSTDAGNGDAKAVNTGTI
NVNNEGFGMLAMSGGTVVNQGTINLTTDEGVTKQQDNQLFAMGAVQGGLAINDQDGVININTDIGQAFYKDSTGTILNYG
KINLFGNPMDESDSHMGVTPDDKDILSELSGSGESISKTTTGDGFIAVNNQANYGDETLNGDVTANGWIFNQPDASLTIN
GELSVNQGLENSGHLDVDSINSKTTIYNRETGSITTDLLTLNGAVSFFNEGEFSGSITGNSYQQNVVNTGEMTVTEDGHS
LVNGSFLFFNEAGATLTNSGNAVTGGENAIIHVTRTSDSVSQVNRGTITATNGYSAIKTENTGSNSNGKWIWNTETGVIN
GINPVAPLVDLRRGYNFSNAGVINVQGDNAVGISGGTTSYTVKLVNSGTINVGTEQGQLDGTNGEGLIGIKGNGKDTTIN
NTQTGVINVYADNSWAFGGQTKAIINNGEINLLCDTGCDIYAPGTTGTQKDHNGTADITVPEASTTPSQGNVPTPPADPN
APQLLSNYTIGTNSDGSSGTLSANNLVIGDNVSVNAGFSAGTADTTVVVNDVFKGENISGVDNIVSSTVVWTAKGSTDAS
GNVDVTMSKNAYTDVATDGSVSDVAKALDAGYTNNELYTSLNVGTTAELNSALKQISGSQATTVFREARVLSNRFSMLAD
AAPKMGNGLAFNVVAKGDPRAELGNNTEYDMLALRKTVDLSESQTMSLEYGIARLDGDGAQKAGDNGITGGYSQFFGLKH
QMSFDNGMNWNNALRYDIHQLDSSRSVAYGDVSKTADTNVKQQYLEFRSEGAKTFEPREGLKITPYAGLKLRHTLEGGYQ
ERNAGDFNLSMNSGSETAMDSIVGLKLDYAGKDGWSANATLEGGPNLSYAKSQRTASLAGAGSQHFNVDDGQKGGSINSL
ASVGVKYSSKESSLNLDAYHWKEDGVSDKGVMLNFKKTF

Sequences:

>Translated_1959_residues
MQKKTLLSACIALALSGSGWAADIDDTDSATRQRKETRIPCPTAHSSEKLSPQQLKSLPSECSTTNDNNLYSLIAVGATS
LITTLAVLELNHDDGNHAHSSDNPPVPPDDDNGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPD
DGGNTPDDGGNTPDDGGNTPDDGGNVTPPKEPKIFNNNVTFDEDKGTLKIRNATFTYSKNTDGTYTLTAGDGRTTVVQGW
DVDTAANTVEITGVNTQGGMTWRYGKDGIIYITKTVGATVDDPANSNVFNLSDAVLTDQGGNAARNGATVIEINGSRIVL
NNDGDISATGKDSVVVAMTGNDITVNNNGHMVVDGGTAGVVNGDRAILNNRGDAVITNGGAGVIVTGDNAVINNTGQSDI
DGDNSVSVKVAGNATRIKMEGGLNVSGGAHGIDATGDNNEVSNKGNISVVDAHSMGVLLNGDRASFVNMGDINVSGGAAD
DHAIGVQINGDNSTFINVGDLNADDTATGVKITGDASDIALAGAMHVGNFASGLEVTGKNNDLSLSTNMMDVTGKQSTGV
TITGDDNTIDITGDMVVDQNSVGAKIAGDRVSLQQKGDITVNGAGHGVEVSGSKAAISNQGKLTVKDQDSIGIAIIGDDA
QFTTVGEIDVSLNGTGVAISGDREQVNLSGDINVIQERDGSGTFQGGTGISIMGNDSSMLLAGNINVTSSMGDQPSTSPQ
SLTGVTIGGENNTVDLQGDINITVDSDFLEPQNDLYGVMVSGSNNIINLDGGINISGDSGGHFIKGVQVTGNNSVNISGH
SVMNTRQVLGTFSLISVADGGNVVFDESAVTDIQSSTRDFPNYFIGSVIIAMGSQSAIRNNGIVNTTDAQELMMANSGGQ
VVNAGEINIRPDAESHSFFAGMAARGDDSQAKNVSGGTINLTSNTQPYRGSGIGEYPVKWYSNTGYALLASNYGTVINET
GATINLHGAGTYGVSASKGTATNAGEINVDGFVPTVDENGFIIDETYWQTNSIYLMGGGMLAGSTDAGNGDAKAVNTGTI
NVNNEGFGMLAMSGGTVVNQGTINLTTDEGVTKQQDNQLFAMGAVQGGLAINDQDGVININTDIGQAFYKDSTGTILNYG
KINLFGNPMDESDSHMGVTPDDKDILSELSGSGESISKTTTGDGFIAVNNQANYGDETLNGDVTANGWIFNQPDASLTIN
GELSVNQGLENSGHLDVDSINSKTTIYNRETGSITTDLLTLNGAVSFFNEGEFSGSITGNSYQQNVVNTGEMTVTEDGHS
LVNGSFLFFNEAGATLTNSGNAVTGGENAIIHVTRTSDSVSQVNRGTITATNGYSAIKTENTGSNSNGKWIWNTETGVIN
GINPVAPLVDLRRGYNFSNAGVINVQGDNAVGISGGTTSYTVKLVNSGTINVGTEQGQLDGTNGEGLIGIKGNGKDTTIN
NTQTGVINVYADNSWAFGGQTKAIINNGEINLLCDTGCDIYAPGTTGTQKDHNGTADITVPEASTTPSQGNVPTPPADPN
APQLLSNYTIGTNSDGSSGTLSANNLVIGDNVSVNAGFSAGTADTTVVVNDVFKGENISGVDNIVSSTVVWTAKGSTDAS
GNVDVTMSKNAYTDVATDGSVSDVAKALDAGYTNNELYTSLNVGTTAELNSALKQISGSQATTVFREARVLSNRFSMLAD
AAPKMGNGLAFNVVAKGDPRAELGNNTEYDMLALRKTVDLSESQTMSLEYGIARLDGDGAQKAGDNGITGGYSQFFGLKH
QMSFDNGMNWNNALRYDIHQLDSSRSVAYGDVSKTADTNVKQQYLEFRSEGAKTFEPREGLKITPYAGLKLRHTLEGGYQ
ERNAGDFNLSMNSGSETAMDSIVGLKLDYAGKDGWSANATLEGGPNLSYAKSQRTASLAGAGSQHFNVDDGQKGGSINSL
ASVGVKYSSKESSLNLDAYHWKEDGVSDKGVMLNFKKTF
>Mature_1959_residues
MQKKTLLSACIALALSGSGWAADIDDTDSATRQRKETRIPCPTAHSSEKLSPQQLKSLPSECSTTNDNNLYSLIAVGATS
LITTLAVLELNHDDGNHAHSSDNPPVPPDDDNGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPD
DGGNTPDDGGNTPDDGGNTPDDGGNVTPPKEPKIFNNNVTFDEDKGTLKIRNATFTYSKNTDGTYTLTAGDGRTTVVQGW
DVDTAANTVEITGVNTQGGMTWRYGKDGIIYITKTVGATVDDPANSNVFNLSDAVLTDQGGNAARNGATVIEINGSRIVL
NNDGDISATGKDSVVVAMTGNDITVNNNGHMVVDGGTAGVVNGDRAILNNRGDAVITNGGAGVIVTGDNAVINNTGQSDI
DGDNSVSVKVAGNATRIKMEGGLNVSGGAHGIDATGDNNEVSNKGNISVVDAHSMGVLLNGDRASFVNMGDINVSGGAAD
DHAIGVQINGDNSTFINVGDLNADDTATGVKITGDASDIALAGAMHVGNFASGLEVTGKNNDLSLSTNMMDVTGKQSTGV
TITGDDNTIDITGDMVVDQNSVGAKIAGDRVSLQQKGDITVNGAGHGVEVSGSKAAISNQGKLTVKDQDSIGIAIIGDDA
QFTTVGEIDVSLNGTGVAISGDREQVNLSGDINVIQERDGSGTFQGGTGISIMGNDSSMLLAGNINVTSSMGDQPSTSPQ
SLTGVTIGGENNTVDLQGDINITVDSDFLEPQNDLYGVMVSGSNNIINLDGGINISGDSGGHFIKGVQVTGNNSVNISGH
SVMNTRQVLGTFSLISVADGGNVVFDESAVTDIQSSTRDFPNYFIGSVIIAMGSQSAIRNNGIVNTTDAQELMMANSGGQ
VVNAGEINIRPDAESHSFFAGMAARGDDSQAKNVSGGTINLTSNTQPYRGSGIGEYPVKWYSNTGYALLASNYGTVINET
GATINLHGAGTYGVSASKGTATNAGEINVDGFVPTVDENGFIIDETYWQTNSIYLMGGGMLAGSTDAGNGDAKAVNTGTI
NVNNEGFGMLAMSGGTVVNQGTINLTTDEGVTKQQDNQLFAMGAVQGGLAINDQDGVININTDIGQAFYKDSTGTILNYG
KINLFGNPMDESDSHMGVTPDDKDILSELSGSGESISKTTTGDGFIAVNNQANYGDETLNGDVTANGWIFNQPDASLTIN
GELSVNQGLENSGHLDVDSINSKTTIYNRETGSITTDLLTLNGAVSFFNEGEFSGSITGNSYQQNVVNTGEMTVTEDGHS
LVNGSFLFFNEAGATLTNSGNAVTGGENAIIHVTRTSDSVSQVNRGTITATNGYSAIKTENTGSNSNGKWIWNTETGVIN
GINPVAPLVDLRRGYNFSNAGVINVQGDNAVGISGGTTSYTVKLVNSGTINVGTEQGQLDGTNGEGLIGIKGNGKDTTIN
NTQTGVINVYADNSWAFGGQTKAIINNGEINLLCDTGCDIYAPGTTGTQKDHNGTADITVPEASTTPSQGNVPTPPADPN
APQLLSNYTIGTNSDGSSGTLSANNLVIGDNVSVNAGFSAGTADTTVVVNDVFKGENISGVDNIVSSTVVWTAKGSTDAS
GNVDVTMSKNAYTDVATDGSVSDVAKALDAGYTNNELYTSLNVGTTAELNSALKQISGSQATTVFREARVLSNRFSMLAD
AAPKMGNGLAFNVVAKGDPRAELGNNTEYDMLALRKTVDLSESQTMSLEYGIARLDGDGAQKAGDNGITGGYSQFFGLKH
QMSFDNGMNWNNALRYDIHQLDSSRSVAYGDVSKTADTNVKQQYLEFRSEGAKTFEPREGLKITPYAGLKLRHTLEGGYQ
ERNAGDFNLSMNSGSETAMDSIVGLKLDYAGKDGWSANATLEGGPNLSYAKSQRTASLAGAGSQHFNVDDGQKGGSINSL
ASVGVKYSSKESSLNLDAYHWKEDGVSDKGVMLNFKKTF

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 1 autotransporter (TC 1.B.12) domain [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR005546 [H]

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 202283; Mature: 202283

Theoretical pI: Translated: 4.06; Mature: 4.06

Prosite motif: PS00639 THIOL_PROTEASE_HIS

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
1.9 %Met     (Translated Protein)
2.2 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
1.9 %Met     (Mature Protein)
2.2 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQKKTLLSACIALALSGSGWAADIDDTDSATRQRKETRIPCPTAHSSEKLSPQQLKSLPS
CCHHHHHHHHHHHEECCCCCEECCCCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCCH
ECSTTNDNNLYSLIAVGATSLITTLAVLELNHDDGNHAHSSDNPPVPPDDDNGGNTPDDG
HHCCCCCCCEEEEEEHHHHHHHEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC
GNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTP
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
DDGGNVTPPKEPKIFNNNVTFDEDKGTLKIRNATFTYSKNTDGTYTLTAGDGRTTVVQGW
CCCCCCCCCCCCEEECCCCEEECCCCEEEEEEEEEEEECCCCCEEEEEECCCCEEEEECC
DVDTAANTVEITGVNTQGGMTWRYGKDGIIYITKTVGATVDDPANSNVFNLSDAVLTDQG
CCCCCCCEEEEEEECCCCCEEEEECCCCEEEEEEECCCEECCCCCCCEEECCCEEEECCC
GNAARNGATVIEINGSRIVLNNDGDISATGKDSVVVAMTGNDITVNNNGHMVVDGGTAGV
CCCCCCCCEEEEECCCEEEECCCCCEEECCCCCEEEEEECCEEEECCCCEEEEECCCEEE
VNGDRAILNNRGDAVITNGGAGVIVTGDNAVINNTGQSDIDGDNSVSVKVAGNATRIKME
ECCCEEEECCCCCEEEECCCCEEEEECCCEEEECCCCCCCCCCCEEEEEECCCEEEEEEC
GGLNVSGGAHGIDATGDNNEVSNKGNISVVDAHSMGVLLNGDRASFVNMGDINVSGGAAD
CCEEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCEEEEECCCCEEEEECCEEECCCCCC
DHAIGVQINGDNSTFINVGDLNADDTATGVKITGDASDIALAGAMHVGNFASGLEVTGKN
CCEEEEEEECCCCEEEEECCCCCCCCCCCEEEECCCCCEEEEEEEECCCCCCCCEEECCC
NDLSLSTNMMDVTGKQSTGVTITGDDNTIDITGDMVVDQNSVGAKIAGDRVSLQQKGDIT
CCEEEEEEEEEECCCCCCCEEEECCCCEEEEECCEEEECCCCCCEEECCEEEEEECCCEE
VNGAGHGVEVSGSKAAISNQGKLTVKDQDSIGIAIIGDDAQFTTVGEIDVSLNGTGVAIS
ECCCCCCEEECCCCEEECCCCEEEEECCCCEEEEEEECCCCEEEEEEEEEEECCCEEEEE
GDREQVNLSGDINVIQERDGSGTFQGGTGISIMGNDSSMLLAGNINVTSSMGDQPSTSPQ
CCCCEEEECCCEEEEEECCCCCEECCCCEEEEEECCCCEEEEECCEEECCCCCCCCCCCC
SLTGVTIGGENNTVDLQGDINITVDSDFLEPQNDLYGVMVSGSNNIINLDGGINISGDSG
CEEEEEECCCCCEEEEECCEEEEECCCCCCCCCCEEEEEEECCCCEEECCCCEEECCCCC
GHFIKGVQVTGNNSVNISGHSVMNTRQVLGTFSLISVADGGNVVFDESAVTDIQSSTRDF
CCEEEEEEEECCCEEEECCCCHHHHHHHHEEEEEEEEECCCCEEECCHHHHHHHHHCCCC
PNYFIGSVIIAMGSQSAIRNNGIVNTTDAQELMMANSGGQVVNAGEINIRPDAESHSFFA
CHHHHHHHHEECCCCHHHHCCCCEECCCHHHHEEECCCCCEEECCEEEECCCCCCCCEEE
GMAARGDDSQAKNVSGGTINLTSNTQPYRGSGIGEYPVKWYSNTGYALLASNYGTVINET
EEECCCCCCHHCCCCCCEEEECCCCCCCCCCCCCCCCEEEECCCCEEEEECCCCCEEECC
GATINLHGAGTYGVSASKGTATNAGEINVDGFVPTVDENGFIIDETYWQTNSIYLMGGGM
CCEEEEECCCEECCCCCCCCCCCCCEEEECCEEEEECCCCEEEEECEEECCEEEEEECCE
LAGSTDAGNGDAKAVNTGTINVNNEGFGMLAMSGGTVVNQGTINLTTDEGVTKQQDNQLF
EECCCCCCCCCCEEEEEEEEEECCCCEEEEEECCCEEEECCEEEEECCCCCCCCCCCEEE
AMGAVQGGLAINDQDGVININTDIGQAFYKDSTGTILNYGKINLFGNPMDESDSHMGVTP
EEEEECCCEEECCCCCEEEEECCCCHHEECCCCCCEEEEEEEEEECCCCCCCCCCCCCCC
DDKDILSELSGSGESISKTTTGDGFIAVNNQANYGDETLNGDVTANGWIFNQPDASLTIN
CHHHHHHHHCCCCCCCEEECCCCEEEEECCCCCCCCCCCCCCEEECCEEEECCCCCEEEE
GELSVNQGLENSGHLDVDSINSKTTIYNRETGSITTDLLTLNGAVSFFNEGEFSGSITGN
EEEEECCCCCCCCCEEEECCCCCEEEEECCCCCEEEEEEEECCEEEEECCCCEEEEECCC
SYQQNVVNTGEMTVTEDGHSLVNGSFLFFNEAGATLTNSGNAVTGGENAIIHVTRTSDSV
CHHHCCCCCCCEEEECCCCCEECCEEEEEECCCCEEECCCCEEECCCCEEEEEEECCCHH
SQVNRGTITATNGYSAIKTENTGSNSNGKWIWNTETGVINGINPVAPLVDLRRGYNFSNA
HHCCCCEEEEECCCEEEEECCCCCCCCCCEEEECCCCEEECCCCHHHHHHHHHCCCCCCC
GVINVQGDNAVGISGGTTSYTVKLVNSGTINVGTEQGQLDGTNGEGLIGIKGNGKDTTIN
EEEEECCCCEEEECCCCCEEEEEEEECCEEEECCCCCEECCCCCCEEEEEECCCCCCEEC
NTQTGVINVYADNSWAFGGQTKAIINNGEINLLCDTGCDIYAPGTTGTQKDHNGTADITV
CCCCEEEEEEECCCEEECCCEEEEEECCCEEEEECCCCEEEECCCCCCCCCCCCEEEEEE
PEASTTPSQGNVPTPPADPNAPQLLSNYTIGTNSDGSSGTLSANNLVIGDNVSVNAGFSA
CCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCEEECCCEEECCCEEEECCCCC
GTADTTVVVNDVFKGENISGVDNIVSSTVVWTAKGSTDASGNVDVTMSKNAYTDVATDGS
CCCCCEEEEEEEECCCCCCCHHHHHCCEEEEEECCCCCCCCCEEEEECCCCCEEECCCCC
VSDVAKALDAGYTNNELYTSLNVGTTAELNSALKQISGSQATTVFREARVLSNRFSMLAD
HHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHH
AAPKMGNGLAFNVVAKGDPRAELGNNTEYDMLALRKTVDLSESQTMSLEYGIARLDGDGA
CCCCCCCCEEEEEEECCCCCHHCCCCCCEEEEEEEECCCCCCCCEEEEEECEEEECCCCC
QKAGDNGITGGYSQFFGLKHQMSFDNGMNWNNALRYDIHQLDSSRSVAYGDVSKTADTNV
CCCCCCCCCCCHHHHHCEEEEECCCCCCCCCCEEEEEHHHCCCCCCEEECCCCCCCCCHH
KQQYLEFRSEGAKTFEPREGLKITPYAGLKLRHTLEGGYQERNAGDFNLSMNSGSETAMD
HHHHHHHHHCCCCCCCCCCCCEECCCCCEEEEEECCCCCCCCCCCEEEEEECCCCHHHHH
SIVGLKLDYAGKDGWSANATLEGGPNLSYAKSQRTASLAGAGSQHFNVDDGQKGGSINSL
HHEEEEEEECCCCCCCCCEEECCCCCCCCHHCCCCEEECCCCCCCCCCCCCCCCCCHHHH
ASVGVKYSSKESSLNLDAYHWKEDGVSDKGVMLNFKKTF
HHHCEEECCCCCCCCEEEEECCCCCCCCCCEEEEEEECC
>Mature Secondary Structure
MQKKTLLSACIALALSGSGWAADIDDTDSATRQRKETRIPCPTAHSSEKLSPQQLKSLPS
CCHHHHHHHHHHHEECCCCCEECCCCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCCH
ECSTTNDNNLYSLIAVGATSLITTLAVLELNHDDGNHAHSSDNPPVPPDDDNGGNTPDDG
HHCCCCCCCEEEEEEHHHHHHHEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC
GNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTPDDGGNTP
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
DDGGNVTPPKEPKIFNNNVTFDEDKGTLKIRNATFTYSKNTDGTYTLTAGDGRTTVVQGW
CCCCCCCCCCCCEEECCCCEEECCCCEEEEEEEEEEEECCCCCEEEEEECCCCEEEEECC
DVDTAANTVEITGVNTQGGMTWRYGKDGIIYITKTVGATVDDPANSNVFNLSDAVLTDQG
CCCCCCCEEEEEEECCCCCEEEEECCCCEEEEEEECCCEECCCCCCCEEECCCEEEECCC
GNAARNGATVIEINGSRIVLNNDGDISATGKDSVVVAMTGNDITVNNNGHMVVDGGTAGV
CCCCCCCCEEEEECCCEEEECCCCCEEECCCCCEEEEEECCEEEECCCCEEEEECCCEEE
VNGDRAILNNRGDAVITNGGAGVIVTGDNAVINNTGQSDIDGDNSVSVKVAGNATRIKME
ECCCEEEECCCCCEEEECCCCEEEEECCCEEEECCCCCCCCCCCEEEEEECCCEEEEEEC
GGLNVSGGAHGIDATGDNNEVSNKGNISVVDAHSMGVLLNGDRASFVNMGDINVSGGAAD
CCEEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCEEEEECCCCEEEEECCEEECCCCCC
DHAIGVQINGDNSTFINVGDLNADDTATGVKITGDASDIALAGAMHVGNFASGLEVTGKN
CCEEEEEEECCCCEEEEECCCCCCCCCCCEEEECCCCCEEEEEEEECCCCCCCCEEECCC
NDLSLSTNMMDVTGKQSTGVTITGDDNTIDITGDMVVDQNSVGAKIAGDRVSLQQKGDIT
CCEEEEEEEEEECCCCCCCEEEECCCCEEEEECCEEEECCCCCCEEECCEEEEEECCCEE
VNGAGHGVEVSGSKAAISNQGKLTVKDQDSIGIAIIGDDAQFTTVGEIDVSLNGTGVAIS
ECCCCCCEEECCCCEEECCCCEEEEECCCCEEEEEEECCCCEEEEEEEEEEECCCEEEEE
GDREQVNLSGDINVIQERDGSGTFQGGTGISIMGNDSSMLLAGNINVTSSMGDQPSTSPQ
CCCCEEEECCCEEEEEECCCCCEECCCCEEEEEECCCCEEEEECCEEECCCCCCCCCCCC
SLTGVTIGGENNTVDLQGDINITVDSDFLEPQNDLYGVMVSGSNNIINLDGGINISGDSG
CEEEEEECCCCCEEEEECCEEEEECCCCCCCCCCEEEEEEECCCCEEECCCCEEECCCCC
GHFIKGVQVTGNNSVNISGHSVMNTRQVLGTFSLISVADGGNVVFDESAVTDIQSSTRDF
CCEEEEEEEECCCEEEECCCCHHHHHHHHEEEEEEEEECCCCEEECCHHHHHHHHHCCCC
PNYFIGSVIIAMGSQSAIRNNGIVNTTDAQELMMANSGGQVVNAGEINIRPDAESHSFFA
CHHHHHHHHEECCCCHHHHCCCCEECCCHHHHEEECCCCCEEECCEEEECCCCCCCCEEE
GMAARGDDSQAKNVSGGTINLTSNTQPYRGSGIGEYPVKWYSNTGYALLASNYGTVINET
EEECCCCCCHHCCCCCCEEEECCCCCCCCCCCCCCCCEEEECCCCEEEEECCCCCEEECC
GATINLHGAGTYGVSASKGTATNAGEINVDGFVPTVDENGFIIDETYWQTNSIYLMGGGM
CCEEEEECCCEECCCCCCCCCCCCCEEEECCEEEEECCCCEEEEECEEECCEEEEEECCE
LAGSTDAGNGDAKAVNTGTINVNNEGFGMLAMSGGTVVNQGTINLTTDEGVTKQQDNQLF
EECCCCCCCCCCEEEEEEEEEECCCCEEEEEECCCEEEECCEEEEECCCCCCCCCCCEEE
AMGAVQGGLAINDQDGVININTDIGQAFYKDSTGTILNYGKINLFGNPMDESDSHMGVTP
EEEEECCCEEECCCCCEEEEECCCCHHEECCCCCCEEEEEEEEEECCCCCCCCCCCCCCC
DDKDILSELSGSGESISKTTTGDGFIAVNNQANYGDETLNGDVTANGWIFNQPDASLTIN
CHHHHHHHHCCCCCCCEEECCCCEEEEECCCCCCCCCCCCCCEEECCEEEECCCCCEEEE
GELSVNQGLENSGHLDVDSINSKTTIYNRETGSITTDLLTLNGAVSFFNEGEFSGSITGN
EEEEECCCCCCCCCEEEECCCCCEEEEECCCCCEEEEEEEECCEEEEECCCCEEEEECCC
SYQQNVVNTGEMTVTEDGHSLVNGSFLFFNEAGATLTNSGNAVTGGENAIIHVTRTSDSV
CHHHCCCCCCCEEEECCCCCEECCEEEEEECCCCEEECCCCEEECCCCEEEEEEECCCHH
SQVNRGTITATNGYSAIKTENTGSNSNGKWIWNTETGVINGINPVAPLVDLRRGYNFSNA
HHCCCCEEEEECCCEEEEECCCCCCCCCCEEEECCCCEEECCCCHHHHHHHHHCCCCCCC
GVINVQGDNAVGISGGTTSYTVKLVNSGTINVGTEQGQLDGTNGEGLIGIKGNGKDTTIN
EEEEECCCCEEEECCCCCEEEEEEEECCEEEECCCCCEECCCCCCEEEEEECCCCCCEEC
NTQTGVINVYADNSWAFGGQTKAIINNGEINLLCDTGCDIYAPGTTGTQKDHNGTADITV
CCCCEEEEEEECCCEEECCCEEEEEECCCEEEEECCCCEEEECCCCCCCCCCCCEEEEEE
PEASTTPSQGNVPTPPADPNAPQLLSNYTIGTNSDGSSGTLSANNLVIGDNVSVNAGFSA
CCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCEEECCCEEECCCEEEECCCCC
GTADTTVVVNDVFKGENISGVDNIVSSTVVWTAKGSTDASGNVDVTMSKNAYTDVATDGS
CCCCCEEEEEEEECCCCCCCHHHHHCCEEEEEECCCCCCCCCEEEEECCCCCEEECCCCC
VSDVAKALDAGYTNNELYTSLNVGTTAELNSALKQISGSQATTVFREARVLSNRFSMLAD
HHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHH
AAPKMGNGLAFNVVAKGDPRAELGNNTEYDMLALRKTVDLSESQTMSLEYGIARLDGDGA
CCCCCCCCEEEEEEECCCCCHHCCCCCCEEEEEEEECCCCCCCCEEEEEECEEEECCCCC
QKAGDNGITGGYSQFFGLKHQMSFDNGMNWNNALRYDIHQLDSSRSVAYGDVSKTADTNV
CCCCCCCCCCCHHHHHCEEEEECCCCCCCCCCEEEEEHHHCCCCCCEEECCCCCCCCCHH
KQQYLEFRSEGAKTFEPREGLKITPYAGLKLRHTLEGGYQERNAGDFNLSMNSGSETAMD
HHHHHHHHHCCCCCCCCCCCCEECCCCCEEEEEECCCCCCCCCCCEEEEEECCCCHHHHH
SIVGLKLDYAGKDGWSANATLEGGPNLSYAKSQRTASLAGAGSQHFNVDDGQKGGSINSL
HHEEEEEEECCCCCCCCCEEECCCCCCCCHHCCCCEEECCCCCCCCCCCCCCCCCCHHHH
ASVGVKYSSKESSLNLDAYHWKEDGVSDKGVMLNFKKTF
HHHCEEECCCCCCCCEEEEECCCCCCCCCCEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11677609; 1987123 [H]