Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is ydbA [H]

Identifier: 157160883

GI number: 157160883

Start: 1488585

End: 1494440

Strand: Direct

Name: ydbA [H]

Synonym: EcHS_A1488

Alternate gene names: 157160883

Gene position: 1488585-1494440 (Clockwise)

Preceding gene: 157160882

Following gene: 157160884

Centisome position: 32.06

GC content: 49.73

Gene sequence:

>5856_bases
ATGCAAAGGAAAACTCTATTGTCGGCCTGTATTGCATTAGCTCTGAGTGGTCAGGGTTGGGCGGCAGATATCACAGAGGT
AGAAACCACCACAGGTGAAAAGAAAAATACCAATGTGACTTGTCCGGCAGACCCAGGAAAACTCAGTCCGGAAGAGCTTA
AACGCTTACCCTCTGAATGCTCTCCTTTAGTCGAACAAAACCTGATGCCATGGCTTGCCACAGGCGCTGCTGCGTTAATC
ACGGCCTTAGCCGTAGTGGAACTAAACGACGATGATGATCATCATCATCGCAACAATTCTCCACTCCCACCGACACCCCC
TGATGATGAATCAGACGACACTCCAGTTCCCCCAAGTCCTGGCGGAGATGAGATAATACCGGACGATCCGGATGATACGC
CTACACCTCCCAAACCGATTTCGTTTAATAATGACGTTATTCTCGATAAAGCAGAAAAAACGTTAACTATTCGCGATTCA
GTTTTTACTTATACCGAGAATGCTGACGGGACTATTTCTCTGCAAGATAGCAATGGTCGTAAGGCAACGATTAATCTTTG
GCAGATTGATGAAGCGAATAACACTGTTGCCCTTGAAGGGGTGAGCGCAGATGGCGCAACGAAGTGGCAATATAATCATA
AAGGTGAGCTTGTTATTACGGGTGATAATGCCACAGTAAACAACAATGGCAAAACCACCGTTGACGGCAAGGATTCCACC
GGTACGGAAATCAACGGTAATAACGGGAAAGTGATTCAGGACGGCGATCTGGATGTCAGCGGCGGCGGTCACGGTATTGA
TATCACCGGTGACAGCGCGACGGTAGATAACAAGGGCACCATGACCGTTACCGATCCGGAGTCCATCGGTATCCAGATTG
ACGGCGACAAGGCGGTTGTTAATAACGAAGGCGAGAGCACCATCACCAACGGCGGCACCGGCACGCAGATTAACGGTGAC
GACGCCACGGCGAATAACAGCGGCAAAACCACCGTTGACGGCAAGGATTCCACTGGCACGGAAATCAACGGTAATAACGG
GAAAGTTATCCAGGACGGCGATCTGGATGTCAGCGGCGGCGGTCACGGTATTGATATCACCGGTGACAGCGCGACGGTAG
ATAACAAGGGCACCATGACCGTTACCGATCCGGAGTCCATCGGTATCCAGATTGACGGCGACAAGGCGGTTGTTAATAAC
GAAGGCGAGAGCACCATCACCAACGGTGGCACCGGCACGCAGATTAACGGTGATGACGCCACGGCAAACAACAACGGCAA
AACCACCGTTGACGGCAAGGATTCCACCGGTACGGAAATTGCTGGCAATAACGGGAAGGTGATTCAGGACGGCGATCTGG
ATGTCAGCGGCGGCGGTCACGGTATTGATATCACCGGCGACAGCGCAACGGTGGATAACAAGGGCACCATGACCGTCACC
GATCCGGAGTCCATCGGTATCCAGGTTGACGGCGACCAGGCCATCGTCAATAACGAAGGCGAGAGCACTATCACCAATGG
CGGCACCGGCACTCAGATCAACGGTAACGACGCCACCGCGAATAACAGTGGAAAAACCACTGTTGATGGAAAAGATTCCA
CGGGTACCAAAATCGCGGGCAATATCGGCATTGTAAATCTGGATGGTAGCCTGACTGTTACAGGCGGTGCGCATGGTGTT
GAGAACATTGGTGACAACGGCACGGTTAACAACAAAGGAGATATTGTTGTTTCCGATACTGGATCGATTGGCGTGCTCAT
CAACGGTGAGGGGGCAACAGTATCCAATACGGGTGATGTTAACGTTAGCAATGAAGCGACAGGGTTCAGCATCACAACCA
ACAGTGGGAAGGTTTCGCTGGCAGGCAGTATGCAGGTTGGCGATTTCTCGACCGGGGTAGATCTTAATGGCAACAATAAC
AGCGTGACGCTGGCGGCAAAAGATCTAAAAGTGGTCGGGCAGAAAGCGACGGGCATAAACGTTTCTGGCGATGCGAATAC
AGTGAATATCACTGGTAACGTTCTGGTTGATAAGGATAAAACCGCAGACAATGCGGCGGAATATTTCTTCGATCCATCCG
TGGGTATCAACGTTTACGGCAGTGATAATAACGTGACGCTGGATGGAAAGTTAACTGTTGTATCAGACAGTGAGGTTACT
TCTCGTCAGAGTAATTTATTTGATGGCAGCGCAGAGAAAACGTCAGGTCTGGTTGTGATTGGCGATGGCAATACCGTTAA
TATGAATGGTGGACTTGAACTGATTGGAGAGAAAAACGCGCTTGCAGATGGGTCGCAGGTTGCTTCCTTGCGCACAGGAT
ATAGTTATACCAGCGTTATTGTCGTTAGTGGTGAGTCGTCGGTATATCTGAATGGAGATACGACAATCAGCGGAGAATTC
CCTCTGGGGTTTGCCGGGGTTATTCGGGTACAGGATAAAGCTTTGCTGGAAATTGGCAGTGGCGCTACGCTAACAATGCA
GGATATTGACAGTTTTGAACATCATGGGACAAGAACCCCAGAACTTACTTATGCTGATTCCGGTGCGAAAATTGTTAATA
AAGGTACTGTTGAAATTCAGAATTTAGGTTTTGCTTTTGTTACTGGTGAAAATACAACAGGTATAAATAGTGGCACGATC
TCGTTATTACAAAATGGTAAAGATCCGGCACCGTCTCCCATTGTTTTACTGGCTACTAACGGAGGGAGCGCCACTAATGC
AGGTACGATCACAGGTAAAGTGACGGAACGACATAGCGTATTTAACAAGTATTCAACGGGCACATCGAATTCATTTATTT
TTAATAACGATGTCAGTAGCATAACAGGGTTAGTCGCTCAATCGAATAGCACAATTATCAATACTGACAGCGGCATCATT
GATTTGTATGGTCGTGGTAGTGTCGGCATGCTTGCTATAGCAGATTCAACAGCAGAAAATCAGGGTAAAATTACACTGGA
TTCTATGTGGGTAGATGCAAATGACACTACCGCAATGCGAGATATAGCTAGCAACAGCGCCATTGACTTCGGTACAGGTG
TGGGAGTTGGTACTGATAGTTATAGTGGTGCAGGGAAAAATGCAACAGCAATTAACCAATTGGGCGGTGTTATAACTATT
TATAACGCCGGCGCAGGTATGGCGGCCTATGGCGCCAGCAATACAGTTATTAACCAGGGGACGATTAACCTCGAAAAAAA
TGGTAATTATGACGATAGTCTGGCAGCAAATACTCTGGTAGGGATGGCTGTTTATGAGCATGGTACTGCTATCAACGACC
AGACGGGTGTTATCAATATCAATGTTGGTACTAGTCAGGCGTTTTATAACGATGGCACAGGAACAATTGTTAACTATGGT
ACAATCTGCACTTTCGGCGTGTGCCAATCGGGGAATGAGTACAATAACACAGATGATTTCACCTCACTGATCTATACCGG
TGGCGATACGATTACACGAAGCGGAGAAACTGTAACGCTAAATAATGCCGGAGAAATGACTGCGCAAATTACCATGAATG
CTGGTGCTGATAGTTCGTTAGTGAACAACACCGGAACTATCAATAAAATCGTGCAGAACGCGGGGGTATTCAATAATAGT
GGCAGTGTAACAGGGCGGATGATGTCGGCTGGCGGGGTCTTTAATAATCAAACTGACGGGGCGATTATGAGAGGTGCTGC
GCTGACAGGTACTGCAGTGGCAAATAACGAAGGAACCTGGAACCTCGGAAGTAGCAGTGAGGGTAACAACACCGGGATGC
TGGAAGTTAATAATAATTCTGCTTTCAATAACCGCGGCGAGTTTATTCTTGATAACGACAAGAATGCTGTGCACATCAAC
CAGTCCGGTACGCTTTATAATACCGGTCACATGAACATCAGTAATTCTTCCCACAACGGAGCCGTTAATATGTGGGGCGG
AAATGGTCGTTTTATCAATGACGGAACGATTGATGTTTCTGCGAAGTCACTGGTAGTCAGCGCTAATAATGCCGGCGATC
AGAATGCCTTCTTCTGGAACCAGGATAACGGGGTCATCAACTTCGATCACGACAGCGCCAGTGCCGTGAAAGCCACCCAC
AGCAACTTTATTGCCCAGAATGACGGCATCATGAACATCAGCGGCACCGGTGCTGTGGCTATGGAAGGTGATAAGAACGC
GCAGCTGGTTAACAATGGCACCATCAACCTCGGTACCGCAGGCACTACTGACACGGATATGATCGGTATGCAACTCAATG
CCAACGCCACGGCGGATGCGGTGATCGAGAACAACGGCACCATCAATATCTTCGCTAATAACTCGTTTGCATTTAGCGTA
CTGGGTACAGTAGGTCATGTGGTTAACAACGGCACGGTGGTGATTGCCGATGGGGTTACGGGTTCTGGCCTGATCAAGCA
GGGCGACAGCATCAATGTTGAAGGTATGAACGGTAACAACGGTAATAGCAGCGAAGTGCATTATGGCGACTATACGTTGC
CGGATGTGCCGAAGCCCAATACGGTTAGTGTAACGTCGGGAAGTGATGAGGCTGGTGGCAGCATGAACAACCTCAACGGC
TATGTCGTCGGTACCAACGTTAACGGCAGCGCCGGGAAGCTGAAGGTTAACAATGCCAGCATGAACGGCGTGGAGATTAA
CACGGGCTTTACCGCTGGTACGGCAGACACCATTGTGAGTTTTGATAACGTAGTGGAAGGTAGCAACCTGACCGACGCTG
ACGCCATCACCTCAACGTCCGTGGTATGGACTGCCAAAGGCAGCACCGATGCCAGCGGTAACGTTGACGTCACCATGAGC
AAAAACGCTTATACCGATGTGGCAACAGATGCCTCGGTGAATGACATCGCGAAAGCACTGGATGCGGGTTACACCAACAA
CGAACTGTTTACCAGCCTGAACGTCGGCACGACTGCTGAACTGAACAGTGCTCTGAAACAGGTCAGCGGTAGCCAGGCGA
CCACGGTATTCCGCGAAGCGCGCGTGTTAAGCAACCGCTTTAGTATGCTGGCAGATGCCGCGCCGAAAGTGGGTAACGGT
CTGGCGTTCAACGTTGTCGCGAAAGGCGATCCGCGTGCCGAGTTAGGTAATAATACCGAATACGACATGCTGGCATTGCG
TAAAACTATCGACCTGAGCGAAAGCCAGACGATGAGTCTGGAGTACGGTATCGCTCGTCTCGATGGTGATGGTGCGCAGA
AAGCGGGTGATAATGGCGTTACAGGCGGTTATAGCCAGTTCTTTGGCCTGAAACATCAGATGTCGTTCGATAACGGCATG
AACTGGAATAACGCCTTGCGTTACGACGTTCACAACCTTGACAGCAGCCGCTCGATTGCATTTGGCAACACGAACAAAAC
GGCTGATACCGACGTGAAACAGCAGTACCTGGAGTTCCGCAGCGAAGGGGCGAAGACTTTCGAACCGAGCGAAGGACTGA
AGGTTACGCCATATGCGGGTGTAAAACTGCGTCACACACTGGAAGGTGGCTATCAGGAGCGCAATGCCGGAGACTTTAAC
CTGAATATGAACAGTGGCAGCGAAACGGCGGTGGACAGCATCGTCGGGCTGAAACTGGACTACGCAGGTAAAGACGGCTG
GAGCGCTAGCGCTACGCTGGAAGGCGGGCCGAACCTGAGCTACGCGAAGAGCCAGCGTACGGCAAGCCTGGCAGGCGCAG
GTAGTCAGCACTTTAACGTCGATGACGGTCAGAAGGGCGGCGGCATCAATAGCCTGACAAGCGTCGGCGTGAAGTACAGC
AGCAAAGAAAGTTCGCTGAATCTGGATGCGTACAACTGGAAAGAGGATGGCATCAGCGATAAAGGCGTGATGCTGAACTT
CAAGAAAACGTTCTAA

Upstream 100 bases:

>100_bases
AATAATCTACTGGCAATATAGGATGTCTTCAATGTTTTAAATAACTAATTGGTCGGGTTAGTGCATCCGGCTTTCTTTAT
ATTCGCCAGAAGGATTTATT

Downstream 100 bases:

>100_bases
TTTTTAGCATGTGATCCCTAAACCGCAACGCTGATACAGGTTGCGGTTTTTTTATTGCCGGATGTGGTACGTGACGCGTT
TTGTTTTGTGTCTTTCAGGA

Product: autotransporter (AT) family porin

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 1951; Mature: 1951

Protein sequence:

>1951_residues
MQRKTLLSACIALALSGQGWAADITEVETTTGEKKNTNVTCPADPGKLSPEELKRLPSECSPLVEQNLMPWLATGAAALI
TALAVVELNDDDDHHHRNNSPLPPTPPDDESDDTPVPPSPGGDEIIPDDPDDTPTPPKPISFNNDVILDKAEKTLTIRDS
VFTYTENADGTISLQDSNGRKATINLWQIDEANNTVALEGVSADGATKWQYNHKGELVITGDNATVNNNGKTTVDGKDST
GTEINGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPESIGIQIDGDKAVVNNEGESTITNGGTGTQINGD
DATANNSGKTTVDGKDSTGTEINGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPESIGIQIDGDKAVVNN
EGESTITNGGTGTQINGDDATANNNGKTTVDGKDSTGTEIAGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVT
DPESIGIQVDGDQAIVNNEGESTITNGGTGTQINGNDATANNSGKTTVDGKDSTGTKIAGNIGIVNLDGSLTVTGGAHGV
ENIGDNGTVNNKGDIVVSDTGSIGVLINGEGATVSNTGDVNVSNEATGFSITTNSGKVSLAGSMQVGDFSTGVDLNGNNN
SVTLAAKDLKVVGQKATGINVSGDANTVNITGNVLVDKDKTADNAAEYFFDPSVGINVYGSDNNVTLDGKLTVVSDSEVT
SRQSNLFDGSAEKTSGLVVIGDGNTVNMNGGLELIGEKNALADGSQVASLRTGYSYTSVIVVSGESSVYLNGDTTISGEF
PLGFAGVIRVQDKALLEIGSGATLTMQDIDSFEHHGTRTPELTYADSGAKIVNKGTVEIQNLGFAFVTGENTTGINSGTI
SLLQNGKDPAPSPIVLLATNGGSATNAGTITGKVTERHSVFNKYSTGTSNSFIFNNDVSSITGLVAQSNSTIINTDSGII
DLYGRGSVGMLAIADSTAENQGKITLDSMWVDANDTTAMRDIASNSAIDFGTGVGVGTDSYSGAGKNATAINQLGGVITI
YNAGAGMAAYGASNTVINQGTINLEKNGNYDDSLAANTLVGMAVYEHGTAINDQTGVININVGTSQAFYNDGTGTIVNYG
TICTFGVCQSGNEYNNTDDFTSLIYTGGDTITRSGETVTLNNAGEMTAQITMNAGADSSLVNNTGTINKIVQNAGVFNNS
GSVTGRMMSAGGVFNNQTDGAIMRGAALTGTAVANNEGTWNLGSSSEGNNTGMLEVNNNSAFNNRGEFILDNDKNAVHIN
QSGTLYNTGHMNISNSSHNGAVNMWGGNGRFINDGTIDVSAKSLVVSANNAGDQNAFFWNQDNGVINFDHDSASAVKATH
SNFIAQNDGIMNISGTGAVAMEGDKNAQLVNNGTINLGTAGTTDTDMIGMQLNANATADAVIENNGTINIFANNSFAFSV
LGTVGHVVNNGTVVIADGVTGSGLIKQGDSINVEGMNGNNGNSSEVHYGDYTLPDVPKPNTVSVTSGSDEAGGSMNNLNG
YVVGTNVNGSAGKLKVNNASMNGVEINTGFTAGTADTIVSFDNVVEGSNLTDADAITSTSVVWTAKGSTDASGNVDVTMS
KNAYTDVATDASVNDIAKALDAGYTNNELFTSLNVGTTAELNSALKQVSGSQATTVFREARVLSNRFSMLADAAPKVGNG
LAFNVVAKGDPRAELGNNTEYDMLALRKTIDLSESQTMSLEYGIARLDGDGAQKAGDNGVTGGYSQFFGLKHQMSFDNGM
NWNNALRYDVHNLDSSRSIAFGNTNKTADTDVKQQYLEFRSEGAKTFEPSEGLKVTPYAGVKLRHTLEGGYQERNAGDFN
LNMNSGSETAVDSIVGLKLDYAGKDGWSASATLEGGPNLSYAKSQRTASLAGAGSQHFNVDDGQKGGGINSLTSVGVKYS
SKESSLNLDAYNWKEDGISDKGVMLNFKKTF

Sequences:

>Translated_1951_residues
MQRKTLLSACIALALSGQGWAADITEVETTTGEKKNTNVTCPADPGKLSPEELKRLPSECSPLVEQNLMPWLATGAAALI
TALAVVELNDDDDHHHRNNSPLPPTPPDDESDDTPVPPSPGGDEIIPDDPDDTPTPPKPISFNNDVILDKAEKTLTIRDS
VFTYTENADGTISLQDSNGRKATINLWQIDEANNTVALEGVSADGATKWQYNHKGELVITGDNATVNNNGKTTVDGKDST
GTEINGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPESIGIQIDGDKAVVNNEGESTITNGGTGTQINGD
DATANNSGKTTVDGKDSTGTEINGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPESIGIQIDGDKAVVNN
EGESTITNGGTGTQINGDDATANNNGKTTVDGKDSTGTEIAGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVT
DPESIGIQVDGDQAIVNNEGESTITNGGTGTQINGNDATANNSGKTTVDGKDSTGTKIAGNIGIVNLDGSLTVTGGAHGV
ENIGDNGTVNNKGDIVVSDTGSIGVLINGEGATVSNTGDVNVSNEATGFSITTNSGKVSLAGSMQVGDFSTGVDLNGNNN
SVTLAAKDLKVVGQKATGINVSGDANTVNITGNVLVDKDKTADNAAEYFFDPSVGINVYGSDNNVTLDGKLTVVSDSEVT
SRQSNLFDGSAEKTSGLVVIGDGNTVNMNGGLELIGEKNALADGSQVASLRTGYSYTSVIVVSGESSVYLNGDTTISGEF
PLGFAGVIRVQDKALLEIGSGATLTMQDIDSFEHHGTRTPELTYADSGAKIVNKGTVEIQNLGFAFVTGENTTGINSGTI
SLLQNGKDPAPSPIVLLATNGGSATNAGTITGKVTERHSVFNKYSTGTSNSFIFNNDVSSITGLVAQSNSTIINTDSGII
DLYGRGSVGMLAIADSTAENQGKITLDSMWVDANDTTAMRDIASNSAIDFGTGVGVGTDSYSGAGKNATAINQLGGVITI
YNAGAGMAAYGASNTVINQGTINLEKNGNYDDSLAANTLVGMAVYEHGTAINDQTGVININVGTSQAFYNDGTGTIVNYG
TICTFGVCQSGNEYNNTDDFTSLIYTGGDTITRSGETVTLNNAGEMTAQITMNAGADSSLVNNTGTINKIVQNAGVFNNS
GSVTGRMMSAGGVFNNQTDGAIMRGAALTGTAVANNEGTWNLGSSSEGNNTGMLEVNNNSAFNNRGEFILDNDKNAVHIN
QSGTLYNTGHMNISNSSHNGAVNMWGGNGRFINDGTIDVSAKSLVVSANNAGDQNAFFWNQDNGVINFDHDSASAVKATH
SNFIAQNDGIMNISGTGAVAMEGDKNAQLVNNGTINLGTAGTTDTDMIGMQLNANATADAVIENNGTINIFANNSFAFSV
LGTVGHVVNNGTVVIADGVTGSGLIKQGDSINVEGMNGNNGNSSEVHYGDYTLPDVPKPNTVSVTSGSDEAGGSMNNLNG
YVVGTNVNGSAGKLKVNNASMNGVEINTGFTAGTADTIVSFDNVVEGSNLTDADAITSTSVVWTAKGSTDASGNVDVTMS
KNAYTDVATDASVNDIAKALDAGYTNNELFTSLNVGTTAELNSALKQVSGSQATTVFREARVLSNRFSMLADAAPKVGNG
LAFNVVAKGDPRAELGNNTEYDMLALRKTIDLSESQTMSLEYGIARLDGDGAQKAGDNGVTGGYSQFFGLKHQMSFDNGM
NWNNALRYDVHNLDSSRSIAFGNTNKTADTDVKQQYLEFRSEGAKTFEPSEGLKVTPYAGVKLRHTLEGGYQERNAGDFN
LNMNSGSETAVDSIVGLKLDYAGKDGWSASATLEGGPNLSYAKSQRTASLAGAGSQHFNVDDGQKGGGINSLTSVGVKYS
SKESSLNLDAYNWKEDGISDKGVMLNFKKTF
>Mature_1951_residues
MQRKTLLSACIALALSGQGWAADITEVETTTGEKKNTNVTCPADPGKLSPEELKRLPSECSPLVEQNLMPWLATGAAALI
TALAVVELNDDDDHHHRNNSPLPPTPPDDESDDTPVPPSPGGDEIIPDDPDDTPTPPKPISFNNDVILDKAEKTLTIRDS
VFTYTENADGTISLQDSNGRKATINLWQIDEANNTVALEGVSADGATKWQYNHKGELVITGDNATVNNNGKTTVDGKDST
GTEINGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPESIGIQIDGDKAVVNNEGESTITNGGTGTQINGD
DATANNSGKTTVDGKDSTGTEINGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPESIGIQIDGDKAVVNN
EGESTITNGGTGTQINGDDATANNNGKTTVDGKDSTGTEIAGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVT
DPESIGIQVDGDQAIVNNEGESTITNGGTGTQINGNDATANNSGKTTVDGKDSTGTKIAGNIGIVNLDGSLTVTGGAHGV
ENIGDNGTVNNKGDIVVSDTGSIGVLINGEGATVSNTGDVNVSNEATGFSITTNSGKVSLAGSMQVGDFSTGVDLNGNNN
SVTLAAKDLKVVGQKATGINVSGDANTVNITGNVLVDKDKTADNAAEYFFDPSVGINVYGSDNNVTLDGKLTVVSDSEVT
SRQSNLFDGSAEKTSGLVVIGDGNTVNMNGGLELIGEKNALADGSQVASLRTGYSYTSVIVVSGESSVYLNGDTTISGEF
PLGFAGVIRVQDKALLEIGSGATLTMQDIDSFEHHGTRTPELTYADSGAKIVNKGTVEIQNLGFAFVTGENTTGINSGTI
SLLQNGKDPAPSPIVLLATNGGSATNAGTITGKVTERHSVFNKYSTGTSNSFIFNNDVSSITGLVAQSNSTIINTDSGII
DLYGRGSVGMLAIADSTAENQGKITLDSMWVDANDTTAMRDIASNSAIDFGTGVGVGTDSYSGAGKNATAINQLGGVITI
YNAGAGMAAYGASNTVINQGTINLEKNGNYDDSLAANTLVGMAVYEHGTAINDQTGVININVGTSQAFYNDGTGTIVNYG
TICTFGVCQSGNEYNNTDDFTSLIYTGGDTITRSGETVTLNNAGEMTAQITMNAGADSSLVNNTGTINKIVQNAGVFNNS
GSVTGRMMSAGGVFNNQTDGAIMRGAALTGTAVANNEGTWNLGSSSEGNNTGMLEVNNNSAFNNRGEFILDNDKNAVHIN
QSGTLYNTGHMNISNSSHNGAVNMWGGNGRFINDGTIDVSAKSLVVSANNAGDQNAFFWNQDNGVINFDHDSASAVKATH
SNFIAQNDGIMNISGTGAVAMEGDKNAQLVNNGTINLGTAGTTDTDMIGMQLNANATADAVIENNGTINIFANNSFAFSV
LGTVGHVVNNGTVVIADGVTGSGLIKQGDSINVEGMNGNNGNSSEVHYGDYTLPDVPKPNTVSVTSGSDEAGGSMNNLNG
YVVGTNVNGSAGKLKVNNASMNGVEINTGFTAGTADTIVSFDNVVEGSNLTDADAITSTSVVWTAKGSTDASGNVDVTMS
KNAYTDVATDASVNDIAKALDAGYTNNELFTSLNVGTTAELNSALKQVSGSQATTVFREARVLSNRFSMLADAAPKVGNG
LAFNVVAKGDPRAELGNNTEYDMLALRKTIDLSESQTMSLEYGIARLDGDGAQKAGDNGVTGGYSQFFGLKHQMSFDNGM
NWNNALRYDVHNLDSSRSIAFGNTNKTADTDVKQQYLEFRSEGAKTFEPSEGLKVTPYAGVKLRHTLEGGYQERNAGDFN
LNMNSGSETAVDSIVGLKLDYAGKDGWSASATLEGGPNLSYAKSQRTASLAGAGSQHFNVDDGQKGGGINSLTSVGVKYS
SKESSLNLDAYNWKEDGISDKGVMLNFKKTF

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: Contains 1 autotransporter (TC 1.B.12) domain [H]

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR005546 [H]

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 200896; Mature: 200896

Theoretical pI: Translated: 4.08; Mature: 4.08

Prosite motif: PS00639 THIOL_PROTEASE_HIS

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
2.1 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
1.8 %Met     (Mature Protein)
2.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQRKTLLSACIALALSGQGWAADITEVETTTGEKKNTNVTCPADPGKLSPEELKRLPSEC
CCHHHHHHHHHHHEECCCCCEEECEEEECCCCCCCCCEEECCCCCCCCCHHHHHHCHHHH
SPLVEQNLMPWLATGAAALITALAVVELNDDDDHHHRNNSPLPPTPPDDESDDTPVPPSP
HHHHHHCCCCHHHHHHHHHEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
GGDEIIPDDPDDTPTPPKPISFNNDVILDKAEKTLTIRDSVFTYTENADGTISLQDSNGR
CCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCEEEEECEEEEEECCCCCEEEEECCCCC
KATINLWQIDEANNTVALEGVSADGATKWQYNHKGELVITGDNATVNNNGKTTVDGKDST
EEEEEEEEEECCCCEEEEEEECCCCCEEEEECCCCCEEEECCCEEECCCCCEEECCCCCC
GTEINGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPESIGIQIDGDKAVV
CCEEECCCCCEEECCCEEECCCCCEEEECCCCCEECCCCCEEEECHHHEEEEECCCEEEE
NNEGESTITNGGTGTQINGDDATANNSGKTTVDGKDSTGTEINGNNGKVIQDGDLDVSGG
CCCCCCEEECCCCCCEECCCCCCCCCCCCEEECCCCCCCCEEECCCCCEEECCCEEECCC
GHGIDITGDSATVDNKGTMTVTDPESIGIQIDGDKAVVNNEGESTITNGGTGTQINGDDA
CCEEEECCCCCEECCCCCEEEECHHHEEEEECCCEEEECCCCCCEEECCCCCCEECCCCC
TANNNGKTTVDGKDSTGTEIAGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVT
CCCCCCCEEECCCCCCCCEEECCCCCEEECCCEEECCCCCEEEECCCCCEECCCCCEEEE
DPESIGIQVDGDQAIVNNEGESTITNGGTGTQINGNDATANNSGKTTVDGKDSTGTKIAG
CHHHEEEEECCCEEEECCCCCCEEECCCCCCEECCCCCCCCCCCCEEECCCCCCCCEEEC
NIGIVNLDGSLTVTGGAHGVENIGDNGTVNNKGDIVVSDTGSIGVLINGEGATVSNTGDV
CEEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCEEEECCCCEEEEECCCCCEECCCCCE
NVSNEATGFSITTNSGKVSLAGSMQVGDFSTGVDLNGNNNSVTLAAKDLKVVGQKATGIN
EECCCCCCEEEEECCCEEEEEECEEECCCCCCEEECCCCCEEEEEEECEEEECCCCCCEE
VSGDANTVNITGNVLVDKDKTADNAAEYFFDPSVGINVYGSDNNVTLDGKLTVVSDSEVT
ECCCCCEEEEEEEEEEECCCCCCCCHHEEECCCCCEEEEECCCCEEECCEEEEEECCHHH
SRQSNLFDGSAEKTSGLVVIGDGNTVNMNGGLELIGEKNALADGSQVASLRTGYSYTSVI
HHHCCCCCCCCCCCCCEEEEECCCEEEECCCEEEEECCCCCCCCCCEEEEECCCCEEEEE
VVSGESSVYLNGDTTISGEFPLGFAGVIRVQDKALLEIGSGATLTMQDIDSFEHHGTRTP
EEECCCEEEECCCEEEECCCCCCEEEEEEECCCEEEEECCCCEEEHHHHHHHHHCCCCCC
ELTYADSGAKIVNKGTVEIQNLGFAFVTGENTTGINSGTISLLQNGKDPAPSPIVLLATN
EEEECCCCCEEEECCEEEEEECCEEEEECCCCCCCCCCEEEEECCCCCCCCCCEEEEEEC
GGSATNAGTITGKVTERHSVFNKYSTGTSNSFIFNNDVSSITGLVAQSNSTIINTDSGII
CCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCEEEECCHHHHEEEEEECCCEEEECCCCEE
DLYGRGSVGMLAIADSTAENQGKITLDSMWVDANDTTAMRDIASNSAIDFGTGVGVGTDS
EEECCCCEEEEEEECCCCCCCCCEEEEEEEECCCCCHHHHHHHCCCCEECCCCCCCCCCC
YSGAGKNATAINQLGGVITIYNAGAGMAAYGASNTVINQGTINLEKNGNYDDSLAANTLV
CCCCCCCCHHHHHCCCEEEEEECCCCEEEECCCCEEEECCEEEEECCCCCCCHHHHHHEE
GMAVYEHGTAINDQTGVININVGTSQAFYNDGTGTIVNYGTICTFGVCQSGNEYNNTDDF
EEEEEECCCEECCCCCEEEEEECCCCEEEECCCCEEEECCCEEEEEEECCCCCCCCCCCE
TSLIYTGGDTITRSGETVTLNNAGEMTAQITMNAGADSSLVNNTGTINKIVQNAGVFNNS
EEEEEECCCEEECCCCEEEECCCCCEEEEEEEECCCCCCEECCCCHHHHHHHHCCEECCC
GSVTGRMMSAGGVFNNQTDGAIMRGAALTGTAVANNEGTWNLGSSSEGNNTGMLEVNNNS
CCCEEEEEECCCCCCCCCCCEEEECCEEECEEEECCCCCEECCCCCCCCCEEEEEECCCC
AFNNRGEFILDNDKNAVHINQSGTLYNTGHMNISNSSHNGAVNMWGGNGRFINDGTIDVS
CCCCCCCEEEECCCCEEEECCCCCEEECCEEEECCCCCCCEEEEECCCCEEECCCEEEEE
AKSLVVSANNAGDQNAFFWNQDNGVINFDHDSASAVKATHSNFIAQNDGIMNISGTGAVA
EEEEEEEECCCCCCCCEEEECCCCEEEECCCCCCCEEECCCCEEECCCCEEEECCCCEEE
MEGDKNAQLVNNGTINLGTAGTTDTDMIGMQLNANATADAVIENNGTINIFANNSFAFSV
EECCCCCEEEECCEEEECCCCCCCCEEEEEEECCCCCEEEEEECCCEEEEEECCCEEEEE
LGTVGHVVNNGTVVIADGVTGSGLIKQGDSINVEGMNGNNGNSSEVHYGDYTLPDVPKPN
HHHHHHEECCCEEEEECCCCCCCEEECCCEEEEEECCCCCCCCCEEEECCCCCCCCCCCC
TVSVTSGSDEAGGSMNNLNGYVVGTNVNGSAGKLKVNNASMNGVEINTGFTAGTADTIVS
EEEEECCCCCCCCCCCCCCCEEEEECCCCCCCEEEECCCCCCCEEEECCCCCCCHHHEEE
FDNVVEGSNLTDADAITSTSVVWTAKGSTDASGNVDVTMSKNAYTDVATDASVNDIAKAL
HHHEECCCCCCCHHHCCCEEEEEEECCCCCCCCCEEEEECCCCCEEECCCCCHHHHHHHH
DAGYTNNELFTSLNVGTTAELNSALKQVSGSQATTVFREARVLSNRFSMLADAAPKVGNG
HCCCCCCEEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
LAFNVVAKGDPRAELGNNTEYDMLALRKTIDLSESQTMSLEYGIARLDGDGAQKAGDNGV
EEEEEEECCCCCHHCCCCCCHHEEEEEHHCCCCCCCEEEEEECEEEECCCCCCCCCCCCC
TGGYSQFFGLKHQMSFDNGMNWNNALRYDVHNLDSSRSIAFGNTNKTADTDVKQQYLEFR
CCCHHHHHCEEEEECCCCCCCCCCEEEEEEECCCCCCEEEECCCCCCCCHHHHHHHHHHH
SEGAKTFEPSEGLKVTPYAGVKLRHTLEGGYQERNAGDFNLNMNSGSETAVDSIVGLKLD
HCCCCCCCCCCCEEECCCCCEEEEEECCCCCCCCCCCEEEEEECCCCCHHHHHEEEEEEE
YAGKDGWSASATLEGGPNLSYAKSQRTASLAGAGSQHFNVDDGQKGGGINSLTSVGVKYS
ECCCCCCCEEEEECCCCCCCCHHCCCCEEECCCCCCCCCCCCCCCCCCCCHHHHCCEEEC
SKESSLNLDAYNWKEDGISDKGVMLNFKKTF
CCCCCEEEEEECCCCCCCCCCCEEEEEEECC
>Mature Secondary Structure
MQRKTLLSACIALALSGQGWAADITEVETTTGEKKNTNVTCPADPGKLSPEELKRLPSEC
CCHHHHHHHHHHHEECCCCCEEECEEEECCCCCCCCCEEECCCCCCCCCHHHHHHCHHHH
SPLVEQNLMPWLATGAAALITALAVVELNDDDDHHHRNNSPLPPTPPDDESDDTPVPPSP
HHHHHHCCCCHHHHHHHHHEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
GGDEIIPDDPDDTPTPPKPISFNNDVILDKAEKTLTIRDSVFTYTENADGTISLQDSNGR
CCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCEEEEECEEEEEECCCCCEEEEECCCCC
KATINLWQIDEANNTVALEGVSADGATKWQYNHKGELVITGDNATVNNNGKTTVDGKDST
EEEEEEEEEECCCCEEEEEEECCCCCEEEEECCCCCEEEECCCEEECCCCCEEECCCCCC
GTEINGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVTDPESIGIQIDGDKAVV
CCEEECCCCCEEECCCEEECCCCCEEEECCCCCEECCCCCEEEECHHHEEEEECCCEEEE
NNEGESTITNGGTGTQINGDDATANNSGKTTVDGKDSTGTEINGNNGKVIQDGDLDVSGG
CCCCCCEEECCCCCCEECCCCCCCCCCCCEEECCCCCCCCEEECCCCCEEECCCEEECCC
GHGIDITGDSATVDNKGTMTVTDPESIGIQIDGDKAVVNNEGESTITNGGTGTQINGDDA
CCEEEECCCCCEECCCCCEEEECHHHEEEEECCCEEEECCCCCCEEECCCCCCEECCCCC
TANNNGKTTVDGKDSTGTEIAGNNGKVIQDGDLDVSGGGHGIDITGDSATVDNKGTMTVT
CCCCCCCEEECCCCCCCCEEECCCCCEEECCCEEECCCCCEEEECCCCCEECCCCCEEEE
DPESIGIQVDGDQAIVNNEGESTITNGGTGTQINGNDATANNSGKTTVDGKDSTGTKIAG
CHHHEEEEECCCEEEECCCCCCEEECCCCCCEECCCCCCCCCCCCEEECCCCCCCCEEEC
NIGIVNLDGSLTVTGGAHGVENIGDNGTVNNKGDIVVSDTGSIGVLINGEGATVSNTGDV
CEEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCEEEECCCCEEEEECCCCCEECCCCCE
NVSNEATGFSITTNSGKVSLAGSMQVGDFSTGVDLNGNNNSVTLAAKDLKVVGQKATGIN
EECCCCCCEEEEECCCEEEEEECEEECCCCCCEEECCCCCEEEEEEECEEEECCCCCCEE
VSGDANTVNITGNVLVDKDKTADNAAEYFFDPSVGINVYGSDNNVTLDGKLTVVSDSEVT
ECCCCCEEEEEEEEEEECCCCCCCCHHEEECCCCCEEEEECCCCEEECCEEEEEECCHHH
SRQSNLFDGSAEKTSGLVVIGDGNTVNMNGGLELIGEKNALADGSQVASLRTGYSYTSVI
HHHCCCCCCCCCCCCCEEEEECCCEEEECCCEEEEECCCCCCCCCCEEEEECCCCEEEEE
VVSGESSVYLNGDTTISGEFPLGFAGVIRVQDKALLEIGSGATLTMQDIDSFEHHGTRTP
EEECCCEEEECCCEEEECCCCCCEEEEEEECCCEEEEECCCCEEEHHHHHHHHHCCCCCC
ELTYADSGAKIVNKGTVEIQNLGFAFVTGENTTGINSGTISLLQNGKDPAPSPIVLLATN
EEEECCCCCEEEECCEEEEEECCEEEEECCCCCCCCCCEEEEECCCCCCCCCCEEEEEEC
GGSATNAGTITGKVTERHSVFNKYSTGTSNSFIFNNDVSSITGLVAQSNSTIINTDSGII
CCCCCCCCEEEEEEEHHHHHHHHCCCCCCCCEEEECCHHHHEEEEEECCCEEEECCCCEE
DLYGRGSVGMLAIADSTAENQGKITLDSMWVDANDTTAMRDIASNSAIDFGTGVGVGTDS
EEECCCCEEEEEEECCCCCCCCCEEEEEEEECCCCCHHHHHHHCCCCEECCCCCCCCCCC
YSGAGKNATAINQLGGVITIYNAGAGMAAYGASNTVINQGTINLEKNGNYDDSLAANTLV
CCCCCCCCHHHHHCCCEEEEEECCCCEEEECCCCEEEECCEEEEECCCCCCCHHHHHHEE
GMAVYEHGTAINDQTGVININVGTSQAFYNDGTGTIVNYGTICTFGVCQSGNEYNNTDDF
EEEEEECCCEECCCCCEEEEEECCCCEEEECCCCEEEECCCEEEEEEECCCCCCCCCCCE
TSLIYTGGDTITRSGETVTLNNAGEMTAQITMNAGADSSLVNNTGTINKIVQNAGVFNNS
EEEEEECCCEEECCCCEEEECCCCCEEEEEEEECCCCCCEECCCCHHHHHHHHCCEECCC
GSVTGRMMSAGGVFNNQTDGAIMRGAALTGTAVANNEGTWNLGSSSEGNNTGMLEVNNNS
CCCEEEEEECCCCCCCCCCCEEEECCEEECEEEECCCCCEECCCCCCCCCEEEEEECCCC
AFNNRGEFILDNDKNAVHINQSGTLYNTGHMNISNSSHNGAVNMWGGNGRFINDGTIDVS
CCCCCCCEEEECCCCEEEECCCCCEEECCEEEECCCCCCCEEEEECCCCEEECCCEEEEE
AKSLVVSANNAGDQNAFFWNQDNGVINFDHDSASAVKATHSNFIAQNDGIMNISGTGAVA
EEEEEEEECCCCCCCCEEEECCCCEEEECCCCCCCEEECCCCEEECCCCEEEECCCCEEE
MEGDKNAQLVNNGTINLGTAGTTDTDMIGMQLNANATADAVIENNGTINIFANNSFAFSV
EECCCCCEEEECCEEEECCCCCCCCEEEEEEECCCCCEEEEEECCCEEEEEECCCEEEEE
LGTVGHVVNNGTVVIADGVTGSGLIKQGDSINVEGMNGNNGNSSEVHYGDYTLPDVPKPN
HHHHHHEECCCEEEEECCCCCCCEEECCCEEEEEECCCCCCCCCEEEECCCCCCCCCCCC
TVSVTSGSDEAGGSMNNLNGYVVGTNVNGSAGKLKVNNASMNGVEINTGFTAGTADTIVS
EEEEECCCCCCCCCCCCCCCEEEEECCCCCCCEEEECCCCCCCEEEECCCCCCCHHHEEE
FDNVVEGSNLTDADAITSTSVVWTAKGSTDASGNVDVTMSKNAYTDVATDASVNDIAKAL
HHHEECCCCCCCHHHCCCEEEEEEECCCCCCCCCEEEEECCCCCEEECCCCCHHHHHHHH
DAGYTNNELFTSLNVGTTAELNSALKQVSGSQATTVFREARVLSNRFSMLADAAPKVGNG
HCCCCCCEEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCC
LAFNVVAKGDPRAELGNNTEYDMLALRKTIDLSESQTMSLEYGIARLDGDGAQKAGDNGV
EEEEEEECCCCCHHCCCCCCHHEEEEEHHCCCCCCCEEEEEECEEEECCCCCCCCCCCCC
TGGYSQFFGLKHQMSFDNGMNWNNALRYDVHNLDSSRSIAFGNTNKTADTDVKQQYLEFR
CCCHHHHHCEEEEECCCCCCCCCCEEEEEEECCCCCCEEEECCCCCCCCHHHHHHHHHHH
SEGAKTFEPSEGLKVTPYAGVKLRHTLEGGYQERNAGDFNLNMNSGSETAVDSIVGLKLD
HCCCCCCCCCCCEEECCCCCEEEEEECCCCCCCCCCCEEEEEECCCCCHHHHHEEEEEEE
YAGKDGWSASATLEGGPNLSYAKSQRTASLAGAGSQHFNVDDGQKGGGINSLTSVGVKYS
ECCCCCCCEEEEECCCCCCCCHHCCCCEEECCCCCCCCCCCCCCCCCCCCHHHHCCEEEC
SKESSLNLDAYNWKEDGISDKGVMLNFKKTF
CCCCCEEEEEECCCCCCCCCCCEEEEEEECC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9097039; 9278503; 1665988 [H]