Definition Ralstonia eutropha JMP134 chromosome 1, complete sequence.
Accession NC_007347
Length 3,806,533

Click here to switch to the map view.

The map label for this gene is fhaB [H]

Identifier: 73539799

GI number: 73539799

Start: 111843

End: 121088

Strand: Direct

Name: fhaB [H]

Synonym: Reut_A0093

Alternate gene names: 73539799

Gene position: 111843-121088 (Clockwise)

Preceding gene: 73539798

Following gene: 73539800

Centisome position: 2.94

GC content: 64.62

Gene sequence:

>9246_bases
ATGAACACAGAGCTTTATCGTCTGGTCTTCAATGTTGCGCGCGGCATGCTGGTCGCCGTGCAGGAATGCGCGAAAGGGCG
TGGCAAGGGGAGCCACACCGGCACGCGCGCGAGCTCCGGTTCAGGTGTTTTCGTGCCGGTGTTCTGGTTCACTGCGCTGA
GCGCAGCGACGATGGGATTGCCGCTCGCGCACCACGCTCAGGCACAGACACTGCCGATCCAGGTCGACAGGGGTGCCACC
GGTGCGCAGCCGTACGTCAGCACAGCAGCAAATGGCACCCCGGTCGTGAATATCGCACCGCCGAACCGCCCCGGTGGCAC
CAGCGTGAACAACTTCATCCAGTACAACGTCGGCCCGTCCGGCGTAGTCGTGAACAACTCCGGCCAGAACAGCCAGACGC
AGATAGCGGGATGGGTGCATGGGAACATGCAACTGGGCAACAACCATGCGGGCACCATCGTGCAGCAGGTGACTGCGCCC
AATCCCTCCCAACTGCTGGGCATGCAGGAGATCGCCGGCAATTCCGCTGCACTCGTCCTGGTCAACCCTGCGGGGATTTA
CTGTTCGGCTTGCGGAACCATCGGGGCTGATCGCTTCACGCTGTCCACGGGCCGTGCGTTGTATGGCCCCGATGGGAGCC
TAGCCGGCTTCGACGTGAGCCAAGGCAATCTCGCCATTGGCGCACAAGGCCTGTCCAGCCCGCAAGCGCAAGTGGACCTG
CTCGCGCGCTCGATCCAGGTCAATGGCGAAGTGTGGTCCAAGTACCTGAACGCCATCGCCGGCGCCAACCAGATTGATGC
CGAGACGCTTGCCGCCACGCCGCAGGCCGGCGCGGGCCGGGCCCCGCAGTTCGCCATTGACGCCTCGGCGCTCGGGTCGA
TGTACGCGGGGGCGGTGCGGCTCGTCGGCACCGAGAAGGGGATCGGCTTCAATATCGGCAATAACATCGTCGCCAGCACG
GGCGATATCGTCCTCGACGTCAACGGCGACGTAAGGATCCTGCCGAGTGCCCGCCTGCAGGCCCAGGGCGCGGCAACCGT
CTCCGGCACCAACCTCGACAATGCCGGCACGGTGACCACGCGGGGCAGGATCACTGCCACAACACCTGGCACGCTGTCCA
ACAGCGGCGTGCTGTCCGCTGGCGGCGACGTGCTGGCCCAAACTAGCCAGCTTGCCAACAGCGGCACCATTGGCGCCGGC
ACCGACGCCAACGGCAACGTGACGCAAGCCGGCACGGCCAACATTGCCGCCAGCGCCGCCATCCAGTCCAGCGGCAGTAT
CCTCGCCGCGGGCGATGCCAACCTGTCGGCGCCGCGCCTGAACCTGAACGGCGGTACGCTGATTGCGCACAACACCGCCA
ACGTGTCGGCCACGGGCGACATCAGCCACCAGGGCGCTCGCCTGGAAGGCAATGCGGTGCAGATCGCCGCCGGCGGCACC
TTCGATAACACCGCCGGCTCGGTGGTCGCGGGTGCCAACGGGGCCACGGTCCAGGCCGCCAGCATCCTGAACCGGAGCGG
CAGCCTCAGCAGCGGCGGCACTCTGGGCGTGAACGCACAGCAGACGCTGGACAATACTGCTGGGACTGTGGCGGGCACAG
GTGCCGCAACCCTCCAGGCGGCCAATACCATCAACCGCGGCGGCAGCCTTGGCTCGACCCAAGGCGCGGTCCAGGTGACC
GGGGCCCTGGATAACACCGGCGGCAAGACCCTGGCGGCCGGCGATCTCTCGGTTTCGGGCGGAGCGATCACGAACGACCA
GGGCCAGATGTCGGGCCAGAACGTACGCCTGGATGCCGGCGCGCAGGCGTTCAGCAACGTCCACGGTACCATCAACGGGG
CGGGCACGACCACGGTCACGGCCGGCAGCGTGCAGAACCAGGGTGGCGCGGTCACCAGTACCGGCACGCTCGACGTACGC
ACCCCGGGCAGTATCGAGAATGCTGGAGGTACCCTGGCGGCCAACGGGGCCACGACGCTTACCGCCGCCAAGGTGAACAA
CCAGGCCGGCACCATCGGCTCGGTCAGCAGCAGCCTGACGGGGAACGGACCGCTGGATAACACGGGCGGCAAGGCGCTAG
CCGGTACCGACGTGGCCATGTCGGGCGGTGCGATCACCAACGACCAGGGCCAGATTGCCGGGCAGTCGGTCACGCTTGAC
GCCGCAGAACAGGCCGTGAGCAATGCCCAGGGTTCCATCGCTGGTGGCATCGGTGCCACCACGATTGTGGCGGCCAGCCT
ACAGAACCAGCAGGGCAGCATCACCAGCGGCGGGACGCTCGCCGTGCAGACGCCGGGCGTGATCGAGAACGCGGGCGGCA
CGCTGGGCGCCACCGGCGCCGCGACGCTCAGCGCCAGCCAGATCAACAACCAGGCCGGCACGCTCGGCTCGGTCAAGGAC
GCATTGACGGTCAACGCGCCGCTGGACAACACCAATGGCAGCGCCACCGCCGGCACGAACCTGACGATTCAGGGCGGCCC
CATTGTGAACGACCACGGCCAGCTCTCGGCCGGGCAGACGCTGAAGCTGGACACCGAGGGCCAGGCGCTGGTGAACACGG
CCGGCACGATGTCAGCCCAGGATATCCAGGCCGATACCGGCACGGTCAACAATCAGGGCGGACTGGTGCTGGCCAAGGGC
ACCTTGCAGGCCAACACCCACGGCCAGGCCTACGACAACAGCCAGGGCGGCCAGACCATCGCGGGCGGTGCCATGACGCT
GACCACGGGCGCGCTCAACAACGCGAATGGCGTCGTCTCCGGCCAGCAGGCGGTGACCGTCAACGGCGCGGCCATGGCCA
ACCAGCAGGGCCAAATTATCGCGGGCGGGCCGCTGGCGGTTACCGGCGACAGCCTGGCCAATGCCGGGGGGCAGGTCGCC
GCGAATGGCGACGTCACGCTGCGCATGGCCAATACGCTGGATAACACCGCAGGCTTTACCCACGCGGGTGGCGCGCTGGA
CGTACAGGCTGCCACGATCCTGAATGCGAACACGCTGGGCGGCACCGATGCCAATCCGCTCGGCATGGAAGGCAGCACCG
TCCAGCTCGTCGCCAGCGCTATCGACAATATCCTCGGCGCGCTGCTCGCCGACACCGCACTGACGGTGACTGCAGCGACG
CTGAACAATACCCAGGGGGAAGTGACCTCGGGCGGCACCGCGCAGCTCAACGTGGACGCCACCACTAATACCCAGGGTCT
GCTGGCCGCCAACCAGAAGCTGGGCGTGACCGGCGCAAGCCTCACCGGCGACGGCACGGTCCAGTCGGCACAGGGTGATG
TGGCCCTGAGCCTCAAGAGCGATTTCAATAACTCCGGCGAGGTCAAGGCGGCCATCAACCTGGCACTGGACACGACCGGC
GATGTGAACAACAGCGGCACCATGCGCGCTGGCAATGGGCTGGACGTGCATGGGCGCAACCTCAACAACACGGGCGAGCT
GTACGGCGCGGTGTCGAACCACTTGCGCGCCGACCAGTCGGTCAACAACTCGGGGCTGATCGACGGCGGCGCGGTCCGCA
TCGACGCTGGCACCACGGTCACTAACGTGGACCGCATCTTTGGCGACACGGTGTCCATCGGCGCCGGCCAGCAAATCCTG
AACGACGTGAATCCGGCGACCGGCCAGGGCGGTGTGATCGCCTCGCGCGTGGGCGACATCAACCTGGGGGCCCCGGACAT
CATCAACCGTGAACACGCGCTGATCTATGCCTCGCAGGACTTGAACGTCGGGGGCGCCCTGGACGCGAATGGCAAGGCCA
CCGGCCAGGCCAACTCACTCACCAATGCCTCGGCGACGATTGATGTGGCACGCGACGCGAACGTCAATGCGACCAGCATC
AACAACCTGAACAACCACTTCGAGACCCAAGTTACTGATACCGGCGTGGTGAATACCGTCACCTACCGCCTACGCGGGTC
CGATCAGGACATTGATCCGACCACGGCAATCTTCTGGGACTGGAAGCGCGGCTCTCCCGATGCGGCTCACCCCGCCACCG
ACCTGGGGTGGCTGTATCAGGACGCCAACGAGCGCGGGGCTTATCGGTGGCTGATCCTGCCCTCGACGCAGTACCCGTTC
TCGCAGTTCGGCCCGCCCTTTGATTGGTCGCGCCTGCCGGATGGCAGCACCGGTCCCAACCGGGGCTACTATGACGCCGT
GGAAGGCGACAACGCGTTCCTCCCCGCCGAGCAATGGACTCCGGTGGGCCTGGCGCTGGCCCAGTTCTCGCAGACGGACA
ATGTGGGCAACGTCGTGGCGGTGACCGATGAGCACTTCTACTACCAGCCCGGGGATGCCATCTGGGACAAACTGGGTGTC
CAGCGTCCCAGTTCCGCGCCGCCGCCATTCCAGGCGGCTTGCGCCAGTGATGCGCCCGCATCGTGTCAGGACGCGTATCA
GGCGTACCAGACCTGGCGCCAGGCCAACTTCGCCCAGTACCAGGCGCTCAACGACAAGATCAAGGCGTTCAACCTGGACT
TTCATTCCCGCGTGGTGAGGGACTTCTACTCGGTCAACGAGCAGACGCAGACGCGAGACGAGACGGTCAAGACAACCGAC
CCGGCGCGCCTGCTGGTCGGTGGCAATGCCACGCTCAACGGGGCCGTGGTCAATGACAAGAGCCAGATTCTCGTCGGTGG
CGACCTGATCGTGCCCGCGCCCGTGGACAACCGGGGTTACACCGGTACGCGCATCGAAACGGTGACCGGTTCCCAGGACT
GGAACTACATCAACTACGGCGTCAACGACCCGGACCGGCGTACCACGCCCGGACCGTTGCCACCGATTAATGTCAACCTG
CCGCTGGTGCTGGCCACGGGCACGTCGCTTGCCAACCAGGGCACCATCGGCCACGACGGCAGTGCGCCAGGCCAAGGTGC
TGGCTTGGGCACGATGGCTGGGGCACAGGGCGCGGGCACGGGTCTGGCACCGCTTGCCACCCAGCCGGTGCTGAAGGAAC
TGGTGCTGCCAACCTCGGGCAACGGGCTCGCGTTCAATGGCCCCGGTGCCCGCCTGGGCGGCGCCACGATCCGCCAGGTC
ACACCGGCCCTGGCCATGCCGCAAAACGCGCTCTTTCACGTCAATACGGCACCAGGCGCGCACTACCTGATCGAGACAGA
TCCGCGCTTCACCGACCAGCGTCAGTGGCTGTCCAGCGACTTCATGCTGTCCCAACTCGGTCAGGACCCGAACAATGTCG
TGAAGCGCCTGGGCGACGGCTTCTACGAAGCACGCCTCGTCGCGGATGCCGTGATGCTGGGCACGGGCCAGCGCTTCGTC
GGCGACTATTCCGACAACGAAGCCCAGTACATCGGCCTGATGAAGGCCGGCGTCACCTTCGCGCAGCAGTTCCACCTGAC
CGTGGGCACCGAGCTGACGCCTGACCAGATGGCCGCACTGACCTCGGACATGGTGTGGCTGGTCGAGAAGACCGTGACCC
TGCCGGATGGAAGCACCCAGAAGGTGCTGGTGCCGCAGGTCTACCTGATGTCGCACGTGGGCGAACTGAAGGCGGACGGC
ACACTGATTTCGGCGAACAACGTGGGCATTCAGACCACGGGCGATGTCAACAACACCGGTACCATCTCCGGGCGCAAGCT
CGCCGTCATCGACGCGCAGAACATCAATAACATCGGTGGCACGCTCAATGGCGGCACGCTGGTCCTGAACGCCCAGCAGG
ACATCAACAACCTGGCGGGCAAGATCACCGGAGGCAACGTAGCGGCCCAGGCCGGCCGCGACATCAATTTCACGACCACC
ACGACCACGGCAACGGGCGTGGCGGGTGAAGCGGTCCACAGTCGCACCGTCATCTCGGGTGTCTCGGAACTCAACGCGGA
CAACGCGACGCTGCTTGCCGGGCGCGACCTCACGGCCACGGCCGCCAGCATTGCGACCACCGGTGACCTCGGACTGGGCG
CCGGGCGCAATGTCAATCTTGGCACGGTTGAGATCGCGGAGCGTCGTGACTCGGTGGCGGATGACAAGAACCGCACGAGC
GTCGCACGTAGCACCGAGATCGGGACGCAGATACAGGCTGGCGGCGATGCCACGCTCCTCGCAGGCCAGGATGTGAACGC
GAAAGCGGCGTATGTGAGCGCTGGAGGGGCCATCGGGGTCGGCGCGGGGCACGACATCAACATCAGAGCCGGACAGGCGT
CGGTGTCGATCCGCGACGAGCAAAGCAGAACCAGCGGCGGGTTCCTGTCATCCCAGTCCACCCATACCATTGACCAGAAG
GCCAGCACCGATGCGGTCGGCAGCACCTTCTCGGGCAACACGGTGGACATGCAGGCCAAGCACGACCTGACCCTCGCCGG
CAGTACGGTGGCGGGCACGAAGGATGTCAACCTGTCGGCTGGCCACAATCTCGAAATCGGGACCACCGAAACCCAGTCGT
CGGCCTACAGCTTCAAGGAAGAGAAGAAGTCGGGATTCGGCGCCACGGGCAATGGCATTTCCTACGGCAGTCGTGACCAG
AAGGACACCACGCACGACGCCGGGACGCAGCAGGTCGGAAGCATGGTGGGCTCGACCGATGGCAGCGTGCACCTCAATGC
TGGCAACACCCTCAGCGTCAAGGGCAGCAGCCTGATCGCCGCGCAGGACATCACCGGCAAGGGTGCAGACATCAACATCG
AGGCTGCCCAGAACGCGCAGCACCACGACGAGACTCACGAAGTGAAGCAAAGCGGCTTCACGCTGGGCGTTGGTGGGACT
GTGGGCCAGGTCATGAGTGCCGCGCAGAAGATCAACAACGCGTCCAAGAGCCAGGATGGGCGCGCCTCAGCGCTGTGGGG
CATCGCGGCGGCGCGGGACGCCTATGATGGCGCCAGCGCCATCGGCTCCATGGTCGGCGGGTCGGGCGGAGGTAGCCCGG
CCAGCGGGCAGCAGCCCAGCGCCACCGTTCAGTTGAGCTGGGGAAGTAGCCAAAGCAAACAGACGCTCACGCAGGACTCG
ACCAGCCACAACGGCTCACGTGTCTCGGCGGGCGGCACGGCATCGTTCCAGGCCACCGGCGTGGATGCGGACGGCAACAA
GACCGCCGGTAACCTGAACGTCATCGGCTCGGATATCAACGCCAGCAAGGTGGCGCTGCAGGCCAAGCACGATGTCAATA
TCGTCTCGGCGACCGACACCGACGAAAGCCACAGCACCAACAAGTCCAGCAGCGTCAGCGTCGGCGTGTCGGTCGGTACG
ACTGGTTACGGCGTATCGGCTTCGGGCTCCATGGCCAAGGGCAACTCCGACAGCACGGGCACGTCGCAGGCGAATAGCCA
CGTGCGCGGCAGCGAGAGCGTCACCATCGTGTCGGAGAACGACACGAATATTCTTGGCGGCACGGTCAGCGGCGGCCATG
TTGCCATGGACGTGGGCGGTAACCTGAACCTCGCCAGCCGGCAGGACACCCAGCAGATGCATGCCGACCAGCAGAGCGCG
AGTGGTGGTGTCAGTTTCAGCACGATGGGAGGGTTTTCAGGAAATGCCTCCTACATGCAGGGCAAGGCGAATGGCAGCTA
TGCCAACGTGGGCGAACAGACCGGCGTCTACGCTGGCCAGGGTGGCTTCGACATCAACGTCAAGGGCAACACCGACCTCA
AGGGCGCGGTGATCGCCAGCGACGCCACCAAGGACAGGAACAACCTGTCCACGGGCACGCTGACCTACAGCGACTTGCAG
AACCACTCGGGCTACAGCGCGACCTCGGTCGGCACGTCGGTCGGCACCAGTCCGAGCGCCATGAGCCCGATGATTCCGCA
GCACAAGAGCCAAAGCGAAAACGGCGTGGCGCAGTCGGCGATAGCGGATGGCGCGATTACGATCAAGGATCAGGCCAAAC
AGTCGCAGGACGTGGCGGGACTGAAGCGCGACACGACCAGCACCAACAGCCAGGTCGGCAACAACCCGAACCTGAAGAAC
GTGCTGGACTCGCAGGCCGACACCATGGCGGCGGCGCAGGCAGCAGGCGCGGCCGTGGCGAGGACCGTTGGTGATATCGC
TCAGTCGAAGCAGGATGACGCCCAGAAGCGTATGGAGACTGCTGGTCAAGCACTGAAGCAGGACCCGAGCCCTGAGAACC
AAGCCGCATTTGATGCTGCCAAGGCAGATTACGAAGGCTGGAACGAAGGCGGCCAGTATCGCGCCGGATTGCAGGCGGCT
GGTGGGGCACTCATCGGTGGCCTTGGGGGTGGCAGTGCGTTGACTGCAGCAGGCGGTGCAGCAGGTGCGGGCGCGGCGTC
GCTGGCGGCAACAAAACTAGAAGACGTGGCAAACAACGTGTCAAAGGCGGTTGGCAGCGACAACCCCGTGTTGAACCAAG
CGATTGGCAACCTCGTCTCCAGCGTTGCCGCTGGGGGGCTTGGGGCACTGGTTGGCGGTGTCGCCGGAGCCGCCACAGGC
GCAAACGTGGACATCTACAATCGCCAGTTGCACCCCGACGAACAAACCCTTGCTAATCGCTTGTCAGCGGCCAGTGGTGG
GAAATACACGCCCCAGCAGGTCGGCGACGCCATGCGTGCTGCGAGCAATGGCAAGTACAAAGAAGACGTCACTGCAGGAA
TGGTGGTGGATGGTGTAAGCAATCCTAATGGCGTCTATGACAATGGAGCAATGTTCACCGTGCCTTCGCAAGACGGACGG
ACATTGGTGCAGACTATTCCGAACTCGGTCGACCCCAACCTTGCGGGATATATCCAGTCGATGACGGGAGGCAAGTCGTC
GCCCTACAGCTGGAATGACGAAACTTTGGGTAAGACGGCGCCTACAATGGCTTCGACGCCTACGAATCCGTTTACACCAG
CGCCCAACGGCTGTATTACGGCTGAGTGCGCGGCGGGCTTGGGGCAGCAAGGGCGTGGACTTATTCCCGACTACGCTACC
GGCGGAGTTTCCGTTCTTTCTGGCTCAGCCTCTGCCACAGTAAATCTTTACGATGGCACAAGTTATGTCGCGGGTGGTGT
CACGCAGAATTTCCCAAGCGTTTCTTGGAAGCCTGGTGTCACTGGGACTGTGGGATGGATTTTCGGCGCGAACGACGCCA
AGGCAGCTAACAGCTTTCTTAACGGCGACGGAAATCAGGCATTTGTATCCATCCCAACACCCTTTAAATTCAACGTGGTT
GGCGCAGTAACACATGCATATGGAGGTTCAACAGCAATTGAATTCGGCCTGGGAAGCCCAGGGACCATCGGATACGGCGT
GACACCGTGGAGCCATGGCGTTCCAGTAACAAACGGAGGGAAATGA

Upstream 100 bases:

>100_bases
TGGCAGACTGGGTCGACATCGCATTCCTCATCGCGATGAACATCGCCAGTCAGTGAAGTCGCCCGACTTCTTTGCGCCTG
AAATACCAATAACGAGCGTC

Downstream 100 bases:

>100_bases
GAATTTATGGTGCGAGGACCAACTGATGAAGAGTATGCACGTTTCACTAAAGGTCAAAGACGTGCTTACTGGGGAAGCGT
GATTATCGTCTCGTCGATAA

Product: filamentous haemagglutinin adhesin HecA 20-repeat-containing protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 3081; Mature: 3081

Protein sequence:

>3081_residues
MNTELYRLVFNVARGMLVAVQECAKGRGKGSHTGTRASSGSGVFVPVFWFTALSAATMGLPLAHHAQAQTLPIQVDRGAT
GAQPYVSTAANGTPVVNIAPPNRPGGTSVNNFIQYNVGPSGVVVNNSGQNSQTQIAGWVHGNMQLGNNHAGTIVQQVTAP
NPSQLLGMQEIAGNSAALVLVNPAGIYCSACGTIGADRFTLSTGRALYGPDGSLAGFDVSQGNLAIGAQGLSSPQAQVDL
LARSIQVNGEVWSKYLNAIAGANQIDAETLAATPQAGAGRAPQFAIDASALGSMYAGAVRLVGTEKGIGFNIGNNIVAST
GDIVLDVNGDVRILPSARLQAQGAATVSGTNLDNAGTVTTRGRITATTPGTLSNSGVLSAGGDVLAQTSQLANSGTIGAG
TDANGNVTQAGTANIAASAAIQSSGSILAAGDANLSAPRLNLNGGTLIAHNTANVSATGDISHQGARLEGNAVQIAAGGT
FDNTAGSVVAGANGATVQAASILNRSGSLSSGGTLGVNAQQTLDNTAGTVAGTGAATLQAANTINRGGSLGSTQGAVQVT
GALDNTGGKTLAAGDLSVSGGAITNDQGQMSGQNVRLDAGAQAFSNVHGTINGAGTTTVTAGSVQNQGGAVTSTGTLDVR
TPGSIENAGGTLAANGATTLTAAKVNNQAGTIGSVSSSLTGNGPLDNTGGKALAGTDVAMSGGAITNDQGQIAGQSVTLD
AAEQAVSNAQGSIAGGIGATTIVAASLQNQQGSITSGGTLAVQTPGVIENAGGTLGATGAATLSASQINNQAGTLGSVKD
ALTVNAPLDNTNGSATAGTNLTIQGGPIVNDHGQLSAGQTLKLDTEGQALVNTAGTMSAQDIQADTGTVNNQGGLVLAKG
TLQANTHGQAYDNSQGGQTIAGGAMTLTTGALNNANGVVSGQQAVTVNGAAMANQQGQIIAGGPLAVTGDSLANAGGQVA
ANGDVTLRMANTLDNTAGFTHAGGALDVQAATILNANTLGGTDANPLGMEGSTVQLVASAIDNILGALLADTALTVTAAT
LNNTQGEVTSGGTAQLNVDATTNTQGLLAANQKLGVTGASLTGDGTVQSAQGDVALSLKSDFNNSGEVKAAINLALDTTG
DVNNSGTMRAGNGLDVHGRNLNNTGELYGAVSNHLRADQSVNNSGLIDGGAVRIDAGTTVTNVDRIFGDTVSIGAGQQIL
NDVNPATGQGGVIASRVGDINLGAPDIINREHALIYASQDLNVGGALDANGKATGQANSLTNASATIDVARDANVNATSI
NNLNNHFETQVTDTGVVNTVTYRLRGSDQDIDPTTAIFWDWKRGSPDAAHPATDLGWLYQDANERGAYRWLILPSTQYPF
SQFGPPFDWSRLPDGSTGPNRGYYDAVEGDNAFLPAEQWTPVGLALAQFSQTDNVGNVVAVTDEHFYYQPGDAIWDKLGV
QRPSSAPPPFQAACASDAPASCQDAYQAYQTWRQANFAQYQALNDKIKAFNLDFHSRVVRDFYSVNEQTQTRDETVKTTD
PARLLVGGNATLNGAVVNDKSQILVGGDLIVPAPVDNRGYTGTRIETVTGSQDWNYINYGVNDPDRRTTPGPLPPINVNL
PLVLATGTSLANQGTIGHDGSAPGQGAGLGTMAGAQGAGTGLAPLATQPVLKELVLPTSGNGLAFNGPGARLGGATIRQV
TPALAMPQNALFHVNTAPGAHYLIETDPRFTDQRQWLSSDFMLSQLGQDPNNVVKRLGDGFYEARLVADAVMLGTGQRFV
GDYSDNEAQYIGLMKAGVTFAQQFHLTVGTELTPDQMAALTSDMVWLVEKTVTLPDGSTQKVLVPQVYLMSHVGELKADG
TLISANNVGIQTTGDVNNTGTISGRKLAVIDAQNINNIGGTLNGGTLVLNAQQDINNLAGKITGGNVAAQAGRDINFTTT
TTTATGVAGEAVHSRTVISGVSELNADNATLLAGRDLTATAASIATTGDLGLGAGRNVNLGTVEIAERRDSVADDKNRTS
VARSTEIGTQIQAGGDATLLAGQDVNAKAAYVSAGGAIGVGAGHDINIRAGQASVSIRDEQSRTSGGFLSSQSTHTIDQK
ASTDAVGSTFSGNTVDMQAKHDLTLAGSTVAGTKDVNLSAGHNLEIGTTETQSSAYSFKEEKKSGFGATGNGISYGSRDQ
KDTTHDAGTQQVGSMVGSTDGSVHLNAGNTLSVKGSSLIAAQDITGKGADINIEAAQNAQHHDETHEVKQSGFTLGVGGT
VGQVMSAAQKINNASKSQDGRASALWGIAAARDAYDGASAIGSMVGGSGGGSPASGQQPSATVQLSWGSSQSKQTLTQDS
TSHNGSRVSAGGTASFQATGVDADGNKTAGNLNVIGSDINASKVALQAKHDVNIVSATDTDESHSTNKSSSVSVGVSVGT
TGYGVSASGSMAKGNSDSTGTSQANSHVRGSESVTIVSENDTNILGGTVSGGHVAMDVGGNLNLASRQDTQQMHADQQSA
SGGVSFSTMGGFSGNASYMQGKANGSYANVGEQTGVYAGQGGFDINVKGNTDLKGAVIASDATKDRNNLSTGTLTYSDLQ
NHSGYSATSVGTSVGTSPSAMSPMIPQHKSQSENGVAQSAIADGAITIKDQAKQSQDVAGLKRDTTSTNSQVGNNPNLKN
VLDSQADTMAAAQAAGAAVARTVGDIAQSKQDDAQKRMETAGQALKQDPSPENQAAFDAAKADYEGWNEGGQYRAGLQAA
GGALIGGLGGGSALTAAGGAAGAGAASLAATKLEDVANNVSKAVGSDNPVLNQAIGNLVSSVAAGGLGALVGGVAGAATG
ANVDIYNRQLHPDEQTLANRLSAASGGKYTPQQVGDAMRAASNGKYKEDVTAGMVVDGVSNPNGVYDNGAMFTVPSQDGR
TLVQTIPNSVDPNLAGYIQSMTGGKSSPYSWNDETLGKTAPTMASTPTNPFTPAPNGCITAECAAGLGQQGRGLIPDYAT
GGVSVLSGSASATVNLYDGTSYVAGGVTQNFPSVSWKPGVTGTVGWIFGANDAKAANSFLNGDGNQAFVSIPTPFKFNVV
GAVTHAYGGSTAIEFGLGSPGTIGYGVTPWSHGVPVTNGGK

Sequences:

>Translated_3081_residues
MNTELYRLVFNVARGMLVAVQECAKGRGKGSHTGTRASSGSGVFVPVFWFTALSAATMGLPLAHHAQAQTLPIQVDRGAT
GAQPYVSTAANGTPVVNIAPPNRPGGTSVNNFIQYNVGPSGVVVNNSGQNSQTQIAGWVHGNMQLGNNHAGTIVQQVTAP
NPSQLLGMQEIAGNSAALVLVNPAGIYCSACGTIGADRFTLSTGRALYGPDGSLAGFDVSQGNLAIGAQGLSSPQAQVDL
LARSIQVNGEVWSKYLNAIAGANQIDAETLAATPQAGAGRAPQFAIDASALGSMYAGAVRLVGTEKGIGFNIGNNIVAST
GDIVLDVNGDVRILPSARLQAQGAATVSGTNLDNAGTVTTRGRITATTPGTLSNSGVLSAGGDVLAQTSQLANSGTIGAG
TDANGNVTQAGTANIAASAAIQSSGSILAAGDANLSAPRLNLNGGTLIAHNTANVSATGDISHQGARLEGNAVQIAAGGT
FDNTAGSVVAGANGATVQAASILNRSGSLSSGGTLGVNAQQTLDNTAGTVAGTGAATLQAANTINRGGSLGSTQGAVQVT
GALDNTGGKTLAAGDLSVSGGAITNDQGQMSGQNVRLDAGAQAFSNVHGTINGAGTTTVTAGSVQNQGGAVTSTGTLDVR
TPGSIENAGGTLAANGATTLTAAKVNNQAGTIGSVSSSLTGNGPLDNTGGKALAGTDVAMSGGAITNDQGQIAGQSVTLD
AAEQAVSNAQGSIAGGIGATTIVAASLQNQQGSITSGGTLAVQTPGVIENAGGTLGATGAATLSASQINNQAGTLGSVKD
ALTVNAPLDNTNGSATAGTNLTIQGGPIVNDHGQLSAGQTLKLDTEGQALVNTAGTMSAQDIQADTGTVNNQGGLVLAKG
TLQANTHGQAYDNSQGGQTIAGGAMTLTTGALNNANGVVSGQQAVTVNGAAMANQQGQIIAGGPLAVTGDSLANAGGQVA
ANGDVTLRMANTLDNTAGFTHAGGALDVQAATILNANTLGGTDANPLGMEGSTVQLVASAIDNILGALLADTALTVTAAT
LNNTQGEVTSGGTAQLNVDATTNTQGLLAANQKLGVTGASLTGDGTVQSAQGDVALSLKSDFNNSGEVKAAINLALDTTG
DVNNSGTMRAGNGLDVHGRNLNNTGELYGAVSNHLRADQSVNNSGLIDGGAVRIDAGTTVTNVDRIFGDTVSIGAGQQIL
NDVNPATGQGGVIASRVGDINLGAPDIINREHALIYASQDLNVGGALDANGKATGQANSLTNASATIDVARDANVNATSI
NNLNNHFETQVTDTGVVNTVTYRLRGSDQDIDPTTAIFWDWKRGSPDAAHPATDLGWLYQDANERGAYRWLILPSTQYPF
SQFGPPFDWSRLPDGSTGPNRGYYDAVEGDNAFLPAEQWTPVGLALAQFSQTDNVGNVVAVTDEHFYYQPGDAIWDKLGV
QRPSSAPPPFQAACASDAPASCQDAYQAYQTWRQANFAQYQALNDKIKAFNLDFHSRVVRDFYSVNEQTQTRDETVKTTD
PARLLVGGNATLNGAVVNDKSQILVGGDLIVPAPVDNRGYTGTRIETVTGSQDWNYINYGVNDPDRRTTPGPLPPINVNL
PLVLATGTSLANQGTIGHDGSAPGQGAGLGTMAGAQGAGTGLAPLATQPVLKELVLPTSGNGLAFNGPGARLGGATIRQV
TPALAMPQNALFHVNTAPGAHYLIETDPRFTDQRQWLSSDFMLSQLGQDPNNVVKRLGDGFYEARLVADAVMLGTGQRFV
GDYSDNEAQYIGLMKAGVTFAQQFHLTVGTELTPDQMAALTSDMVWLVEKTVTLPDGSTQKVLVPQVYLMSHVGELKADG
TLISANNVGIQTTGDVNNTGTISGRKLAVIDAQNINNIGGTLNGGTLVLNAQQDINNLAGKITGGNVAAQAGRDINFTTT
TTTATGVAGEAVHSRTVISGVSELNADNATLLAGRDLTATAASIATTGDLGLGAGRNVNLGTVEIAERRDSVADDKNRTS
VARSTEIGTQIQAGGDATLLAGQDVNAKAAYVSAGGAIGVGAGHDINIRAGQASVSIRDEQSRTSGGFLSSQSTHTIDQK
ASTDAVGSTFSGNTVDMQAKHDLTLAGSTVAGTKDVNLSAGHNLEIGTTETQSSAYSFKEEKKSGFGATGNGISYGSRDQ
KDTTHDAGTQQVGSMVGSTDGSVHLNAGNTLSVKGSSLIAAQDITGKGADINIEAAQNAQHHDETHEVKQSGFTLGVGGT
VGQVMSAAQKINNASKSQDGRASALWGIAAARDAYDGASAIGSMVGGSGGGSPASGQQPSATVQLSWGSSQSKQTLTQDS
TSHNGSRVSAGGTASFQATGVDADGNKTAGNLNVIGSDINASKVALQAKHDVNIVSATDTDESHSTNKSSSVSVGVSVGT
TGYGVSASGSMAKGNSDSTGTSQANSHVRGSESVTIVSENDTNILGGTVSGGHVAMDVGGNLNLASRQDTQQMHADQQSA
SGGVSFSTMGGFSGNASYMQGKANGSYANVGEQTGVYAGQGGFDINVKGNTDLKGAVIASDATKDRNNLSTGTLTYSDLQ
NHSGYSATSVGTSVGTSPSAMSPMIPQHKSQSENGVAQSAIADGAITIKDQAKQSQDVAGLKRDTTSTNSQVGNNPNLKN
VLDSQADTMAAAQAAGAAVARTVGDIAQSKQDDAQKRMETAGQALKQDPSPENQAAFDAAKADYEGWNEGGQYRAGLQAA
GGALIGGLGGGSALTAAGGAAGAGAASLAATKLEDVANNVSKAVGSDNPVLNQAIGNLVSSVAAGGLGALVGGVAGAATG
ANVDIYNRQLHPDEQTLANRLSAASGGKYTPQQVGDAMRAASNGKYKEDVTAGMVVDGVSNPNGVYDNGAMFTVPSQDGR
TLVQTIPNSVDPNLAGYIQSMTGGKSSPYSWNDETLGKTAPTMASTPTNPFTPAPNGCITAECAAGLGQQGRGLIPDYAT
GGVSVLSGSASATVNLYDGTSYVAGGVTQNFPSVSWKPGVTGTVGWIFGANDAKAANSFLNGDGNQAFVSIPTPFKFNVV
GAVTHAYGGSTAIEFGLGSPGTIGYGVTPWSHGVPVTNGGK
>Mature_3081_residues
MNTELYRLVFNVARGMLVAVQECAKGRGKGSHTGTRASSGSGVFVPVFWFTALSAATMGLPLAHHAQAQTLPIQVDRGAT
GAQPYVSTAANGTPVVNIAPPNRPGGTSVNNFIQYNVGPSGVVVNNSGQNSQTQIAGWVHGNMQLGNNHAGTIVQQVTAP
NPSQLLGMQEIAGNSAALVLVNPAGIYCSACGTIGADRFTLSTGRALYGPDGSLAGFDVSQGNLAIGAQGLSSPQAQVDL
LARSIQVNGEVWSKYLNAIAGANQIDAETLAATPQAGAGRAPQFAIDASALGSMYAGAVRLVGTEKGIGFNIGNNIVAST
GDIVLDVNGDVRILPSARLQAQGAATVSGTNLDNAGTVTTRGRITATTPGTLSNSGVLSAGGDVLAQTSQLANSGTIGAG
TDANGNVTQAGTANIAASAAIQSSGSILAAGDANLSAPRLNLNGGTLIAHNTANVSATGDISHQGARLEGNAVQIAAGGT
FDNTAGSVVAGANGATVQAASILNRSGSLSSGGTLGVNAQQTLDNTAGTVAGTGAATLQAANTINRGGSLGSTQGAVQVT
GALDNTGGKTLAAGDLSVSGGAITNDQGQMSGQNVRLDAGAQAFSNVHGTINGAGTTTVTAGSVQNQGGAVTSTGTLDVR
TPGSIENAGGTLAANGATTLTAAKVNNQAGTIGSVSSSLTGNGPLDNTGGKALAGTDVAMSGGAITNDQGQIAGQSVTLD
AAEQAVSNAQGSIAGGIGATTIVAASLQNQQGSITSGGTLAVQTPGVIENAGGTLGATGAATLSASQINNQAGTLGSVKD
ALTVNAPLDNTNGSATAGTNLTIQGGPIVNDHGQLSAGQTLKLDTEGQALVNTAGTMSAQDIQADTGTVNNQGGLVLAKG
TLQANTHGQAYDNSQGGQTIAGGAMTLTTGALNNANGVVSGQQAVTVNGAAMANQQGQIIAGGPLAVTGDSLANAGGQVA
ANGDVTLRMANTLDNTAGFTHAGGALDVQAATILNANTLGGTDANPLGMEGSTVQLVASAIDNILGALLADTALTVTAAT
LNNTQGEVTSGGTAQLNVDATTNTQGLLAANQKLGVTGASLTGDGTVQSAQGDVALSLKSDFNNSGEVKAAINLALDTTG
DVNNSGTMRAGNGLDVHGRNLNNTGELYGAVSNHLRADQSVNNSGLIDGGAVRIDAGTTVTNVDRIFGDTVSIGAGQQIL
NDVNPATGQGGVIASRVGDINLGAPDIINREHALIYASQDLNVGGALDANGKATGQANSLTNASATIDVARDANVNATSI
NNLNNHFETQVTDTGVVNTVTYRLRGSDQDIDPTTAIFWDWKRGSPDAAHPATDLGWLYQDANERGAYRWLILPSTQYPF
SQFGPPFDWSRLPDGSTGPNRGYYDAVEGDNAFLPAEQWTPVGLALAQFSQTDNVGNVVAVTDEHFYYQPGDAIWDKLGV
QRPSSAPPPFQAACASDAPASCQDAYQAYQTWRQANFAQYQALNDKIKAFNLDFHSRVVRDFYSVNEQTQTRDETVKTTD
PARLLVGGNATLNGAVVNDKSQILVGGDLIVPAPVDNRGYTGTRIETVTGSQDWNYINYGVNDPDRRTTPGPLPPINVNL
PLVLATGTSLANQGTIGHDGSAPGQGAGLGTMAGAQGAGTGLAPLATQPVLKELVLPTSGNGLAFNGPGARLGGATIRQV
TPALAMPQNALFHVNTAPGAHYLIETDPRFTDQRQWLSSDFMLSQLGQDPNNVVKRLGDGFYEARLVADAVMLGTGQRFV
GDYSDNEAQYIGLMKAGVTFAQQFHLTVGTELTPDQMAALTSDMVWLVEKTVTLPDGSTQKVLVPQVYLMSHVGELKADG
TLISANNVGIQTTGDVNNTGTISGRKLAVIDAQNINNIGGTLNGGTLVLNAQQDINNLAGKITGGNVAAQAGRDINFTTT
TTTATGVAGEAVHSRTVISGVSELNADNATLLAGRDLTATAASIATTGDLGLGAGRNVNLGTVEIAERRDSVADDKNRTS
VARSTEIGTQIQAGGDATLLAGQDVNAKAAYVSAGGAIGVGAGHDINIRAGQASVSIRDEQSRTSGGFLSSQSTHTIDQK
ASTDAVGSTFSGNTVDMQAKHDLTLAGSTVAGTKDVNLSAGHNLEIGTTETQSSAYSFKEEKKSGFGATGNGISYGSRDQ
KDTTHDAGTQQVGSMVGSTDGSVHLNAGNTLSVKGSSLIAAQDITGKGADINIEAAQNAQHHDETHEVKQSGFTLGVGGT
VGQVMSAAQKINNASKSQDGRASALWGIAAARDAYDGASAIGSMVGGSGGGSPASGQQPSATVQLSWGSSQSKQTLTQDS
TSHNGSRVSAGGTASFQATGVDADGNKTAGNLNVIGSDINASKVALQAKHDVNIVSATDTDESHSTNKSSSVSVGVSVGT
TGYGVSASGSMAKGNSDSTGTSQANSHVRGSESVTIVSENDTNILGGTVSGGHVAMDVGGNLNLASRQDTQQMHADQQSA
SGGVSFSTMGGFSGNASYMQGKANGSYANVGEQTGVYAGQGGFDINVKGNTDLKGAVIASDATKDRNNLSTGTLTYSDLQ
NHSGYSATSVGTSVGTSPSAMSPMIPQHKSQSENGVAQSAIADGAITIKDQAKQSQDVAGLKRDTTSTNSQVGNNPNLKN
VLDSQADTMAAAQAAGAAVARTVGDIAQSKQDDAQKRMETAGQALKQDPSPENQAAFDAAKADYEGWNEGGQYRAGLQAA
GGALIGGLGGGSALTAAGGAAGAGAASLAATKLEDVANNVSKAVGSDNPVLNQAIGNLVSSVAAGGLGALVGGVAGAATG
ANVDIYNRQLHPDEQTLANRLSAASGGKYTPQQVGDAMRAASNGKYKEDVTAGMVVDGVSNPNGVYDNGAMFTVPSQDGR
TLVQTIPNSVDPNLAGYIQSMTGGKSSPYSWNDETLGKTAPTMASTPTNPFTPAPNGCITAECAAGLGQQGRGLIPDYAT
GGVSVLSGSASATVNLYDGTSYVAGGVTQNFPSVSWKPGVTGTVGWIFGANDAKAANSFLNGDGNQAFVSIPTPFKFNVV
GAVTHAYGGSTAIEFGLGSPGTIGYGVTPWSHGVPVTNGGK

Specific function: Evidence for a role in host-cell binding and infection [H]

COG id: COG3210

COG function: function code U; Large exoproteins involved in heme utilization or adhesion

Gene ontology:

Cell location: Cell surface [H]

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010069
- InterPro:   IPR008619
- InterPro:   IPR008638
- InterPro:   IPR012334
- InterPro:   IPR011050
- InterPro:   IPR011102 [H]

Pfam domain/function: PF05594 Fil_haemagg; PF05860 Haemagg_act [H]

EC number: NA

Molecular weight: Translated: 309325; Mature: 309325

Theoretical pI: Translated: 4.70; Mature: 4.70

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.2 %Cys     (Translated Protein)
1.3 %Met     (Translated Protein)
1.5 %Cys+Met (Translated Protein)
0.2 %Cys     (Mature Protein)
1.3 %Met     (Mature Protein)
1.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MNTELYRLVFNVARGMLVAVQECAKGRGKGSHTGTRASSGSGVFVPVFWFTALSAATMGL
CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHHCC
PLAHHAQAQTLPIQVDRGATGAQPYVSTAANGTPVVNIAPPNRPGGTSVNNFIQYNVGPS
CCHHCCCCCEEEEEECCCCCCCCCCCEECCCCCEEEEECCCCCCCCCCCCCEEEECCCCC
GVVVNNSGQNSQTQIAGWVHGNMQLGNNHAGTIVQQVTAPNPSQLLGMQEIAGNSAALVL
CEEEECCCCCCCEEEEEEEECCEEECCCCCCEEEEEECCCCHHHHHHHHHHCCCCEEEEE
VNPAGIYCSACGTIGADRFTLSTGRALYGPDGSLAGFDVSQGNLAIGAQGLSSPQAQVDL
ECCCCEEEECCCCCCCCEEEEECCCEEECCCCCEEEEEECCCCEEEECCCCCCCHHHHHH
LARSIQVNGEVWSKYLNAIAGANQIDAETLAATPQAGAGRAPQFAIDASALGSMYAGAVR
HHHHHCCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCEEECHHHHHHHHHCEEE
LVGTEKGIGFNIGNNIVASTGDIVLDVNGDVRILPSARLQAQGAATVSGTNLDNAGTVTT
EEECCCCCCEECCCCEEECCCCEEEECCCCEEECCCCCEECCCCEEEECCCCCCCCEEEE
RGRITATTPGTLSNSGVLSAGGDVLAQTSQLANSGTIGAGTDANGNVTQAGTANIAASAA
CCEEEEECCCCCCCCCEEECCCHHHHHHHHHCCCCCCCCCCCCCCCEEECCCCCCEEHHE
IQSSGSILAAGDANLSAPRLNLNGGTLIAHNTANVSATGDISHQGARLEGNAVQIAAGGT
ECCCCCEEEECCCCCCCCEEECCCCEEEEECCCCEEECCCCCCCCCEECCCEEEEEECCC
FDNTAGSVVAGANGATVQAASILNRSGSLSSGGTLGVNAQQTLDNTAGTVAGTGAATLQA
CCCCCCCEEECCCCCEEEHHHHHCCCCCCCCCCEECCCHHHHHCCCCCCEECCCCHHHHH
ANTINRGGSLGSTQGAVQVTGALDNTGGKTLAAGDLSVSGGAITNDQGQMSGQNVRLDAG
HHHHCCCCCCCCCCCEEEEEEEEECCCCCEEEECCEEECCCEEECCCCCCCCCEEEECCC
AQAFSNVHGTINGAGTTTVTAGSVQNQGGAVTSTGTLDVRTPGSIENAGGTLAANGATTL
HHHHHCCCCEEECCCCEEEEECCCCCCCCEEEECCEEEEECCCCCCCCCCEEEECCCCEE
TAAKVNNQAGTIGSVSSSLTGNGPLDNTGGKALAGTDVAMSGGAITNDQGQIAGQSVTLD
EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCEEECCCCCCCCCEEEHH
AAEQAVSNAQGSIAGGIGATTIVAASLQNQQGSITSGGTLAVQTPGVIENAGGTLGATGA
HHHHHHHCCCCCCCCCCCHHHHEEEHHCCCCCCCCCCCEEEEECCCCEECCCCCCCCCCC
ATLSASQINNQAGTLGSVKDALTVNAPLDNTNGSATAGTNLTIQGGPIVNDHGQLSAGQT
CEEEHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCEEEEECCCEECCCCCCCCCCE
LKLDTEGQALVNTAGTMSAQDIQADTGTVNNQGGLVLAKGTLQANTHGQAYDNSQGGQTI
EEECCCCCEEEECCCCCCHHHCCCCCCCCCCCCCEEEEECEEEECCCCCCCCCCCCCCEE
AGGAMTLTTGALNNANGVVSGQQAVTVNGAAMANQQGQIIAGGPLAVTGDSLANAGGQVA
ECCEEEEEECCCCCCCCEECCCEEEEECCCEECCCCCCEEECCCEEEECCHHHCCCCEEE
ANGDVTLRMANTLDNTAGFTHAGGALDVQAATILNANTLGGTDANPLGMEGSTVQLVASA
ECCCEEEEEECCCCCCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHH
IDNILGALLADTALTVTAATLNNTQGEVTSGGTAQLNVDATTNTQGLLAANQKLGVTGAS
HHHHHHHHHHHHHEEEEEEEECCCCCCEECCCEEEEEEECCCCCCEEEEECCCCCCCCCC
LTGDGTVQSAQGDVALSLKSDFNNSGEVKAAINLALDTTGDVNNSGTMRAGNGLDVHGRN
CCCCCCEECCCCCEEEEEECCCCCCCCEEEEEEEEEECCCCCCCCCCEECCCCCEECCCC
LNNTGELYGAVSNHLRADQSVNNSGLIDGGAVRIDAGTTVTNVDRIFGDTVSIGAGQQIL
CCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEHHHHHHCCEEECCCCHHHH
NDVNPATGQGGVIASRVGDINLGAPDIINREHALIYASQDLNVGGALDANGKATGQANSL
HCCCCCCCCCCEEEEECCCCCCCCCCCCCCCEEEEEEECCCCCCCEECCCCCCCCCCCCC
TNASATIDVARDANVNATSINNLNNHFETQVTDTGVVNTVTYRLRGSDQDIDPTTAIFWD
CCCCEEEEEECCCCCCCEEECCCCCCEEEEECCCCCEEEEEEEEECCCCCCCCCEEEEEE
WKRGSPDAAHPATDLGWLYQDANERGAYRWLILPSTQYPFSQFGPPFDWSRLPDGSTGPN
CCCCCCCCCCCCCHHHHHHHCCCCCCCEEEEEECCCCCCHHHCCCCCCCCCCCCCCCCCC
RGYYDAVEGDNAFLPAEQWTPVGLALAQFSQTDNVGNVVAVTDEHFYYQPGDAIWDKLGV
CCEEECCCCCCCEECCCCCCCHHHHHHHHCCCCCCCCEEEEECCCEEECCCHHHHHHHCC
QRPSSAPPPFQAACASDAPASCQDAYQAYQTWRQANFAQYQALNDKIKAFNLDFHSRVVR
CCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHCCCEEEEECCHHHHHHH
DFYSVNEQTQTRDETVKTTDPARLLVGGNATLNGAVVNDKSQILVGGDLIVPAPVDNRGY
HHHHCCCHHHCCCCCEECCCCCEEEECCCCEECCEEECCCCEEEECCCEEEECCCCCCCC
TGTRIETVTGSQDWNYINYGVNDPDRRTTPGPLPPINVNLPLVLATGTSLANQGTIGHDG
CCCEEEEEECCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCC
SAPGQGAGLGTMAGAQGAGTGLAPLATQPVLKELVLPTSGNGLAFNGPGARLGGATIRQV
CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCHHHHH
TPALAMPQNALFHVNTAPGAHYLIETDPRFTDQRQWLSSDFMLSQLGQDPNNVVKRLGDG
CHHHCCCCCCEEEEECCCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHCCC
FYEARLVADAVMLGTGQRFVGDYSDNEAQYIGLMKAGVTFAQQFHLTVGTELTPDQMAAL
HHHHHHHHHHHHCCCCCCEECCCCCCCEEEEEEHHHCHHEEEEEEEEECCCCCHHHHHHH
TSDMVWLVEKTVTLPDGSTQKVLVPQVYLMSHVGELKADGTLISANNVGIQTTGDVNNTG
HHHHEEEEEEEEECCCCCCCEEECHHHHHHHHHHCCCCCCEEEECCCCCEEECCCCCCCC
TISGRKLAVIDAQNINNIGGTLNGGTLVLNAQQDINNLAGKITGGNVAAQAGRDINFTTT
CCCCCEEEEEECCCCCCCCCEECCCEEEEECHHHHHHHCCEECCCCEECCCCCCEEEEEE
TTTATGVAGEAVHSRTVISGVSELNADNATLLAGRDLTATAASIATTGDLGLGAGRNVNL
CCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHEECCCCCCCCCCCCCC
GTVEIAERRDSVADDKNRTSVARSTEIGTQIQAGGDATLLAGQDVNAKAAYVSAGGAIGV
CEEEEHHHHCCCCCCCCCHHHHHHHCCCCEEECCCCEEEEECCCCCCEEEEEECCCEEEC
GAGHDINIRAGQASVSIRDEQSRTSGGFLSSQSTHTIDQKASTDAVGSTFSGNTVDMQAK
CCCCEEEEECCCEEEEEECCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEC
HDLTLAGSTVAGTKDVNLSAGHNLEIGTTETQSSAYSFKEEKKSGFGATGNGISYGSRDQ
CCEEEECCEECCCCCCEECCCCCEEECCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCC
KDTTHDAGTQQVGSMVGSTDGSVHLNAGNTLSVKGSSLIAAQDITGKGADINIEAAQNAQ
CCCCCCCCHHHHHHHHCCCCCEEEECCCCEEEECCCEEEEEECCCCCCCEEEEEECCCCC
HHDETHEVKQSGFTLGVGGTVGQVMSAAQKINNASKSQDGRASALWGIAAARDAYDGASA
CCHHHHHHHHCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHH
IGSMVGGSGGGSPASGQQPSATVQLSWGSSQSKQTLTQDSTSHNGSRVSAGGTASFQATG
HHHHHCCCCCCCCCCCCCCCEEEEEECCCCCCHHHHHCCCCCCCCCEEECCCCCEEEEEC
VDADGNKTAGNLNVIGSDINASKVALQAKHDVNIVSATDTDESHSTNKSSSVSVGVSVGT
CCCCCCCCCCEEEEEECCCCCCEEEEEECCCEEEEEECCCCCCCCCCCCCCEEEEEEECC
TGYGVSASGSMAKGNSDSTGTSQANSHVRGSESVTIVSENDTNILGGTVSGGHVAMDVGG
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCEEEEECCCCCEEEEEECC
NLNLASRQDTQQMHADQQSASGGVSFSTMGGFSGNASYMQGKANGSYANVGEQTGVYAGQ
CCEECCCHHHHHHHHCHHHCCCCEEEEECCCCCCCCCEEECCCCCCCCCCCCCCCEEECC
GGFDINVKGNTDLKGAVIASDATKDRNNLSTGTLTYSDLQNHSGYSATSVGTSVGTSPSA
CCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEEHHHHHCCCCCCHHHCCCCCCCCCCC
MSPMIPQHKSQSENGVAQSAIADGAITIKDQAKQSQDVAGLKRDTTSTNSQVGNNPNLKN
CCCCCCCCCCCCCCCHHHHHHCCCEEEEHHHHCCHHHHCCCCCCCCCCCCCCCCCCCHHH
VLDSQADTMAAAQAAGAAVARTVGDIAQSKQDDAQKRMETAGQALKQDPSPENQAAFDAA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH
KADYEGWNEGGQYRAGLQAAGGALIGGLGGGSALTAAGGAAGAGAASLAATKLEDVANNV
HCCCCCCCCCCCEECCHHHCCCEEEECCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHH
SKAVGSDNPVLNQAIGNLVSSVAAGGLGALVGGVAGAATGANVDIYNRQLHPDEQTLANR
HHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHH
LSAASGGKYTPQQVGDAMRAASNGKYKEDVTAGMVVDGVSNPNGVYDNGAMFTVPSQDGR
HHHCCCCCCCHHHHHHHHHHHCCCCCHHCCCCCEEEECCCCCCCCCCCCEEEEECCCCCC
TLVQTIPNSVDPNLAGYIQSMTGGKSSPYSWNDETLGKTAPTMASTPTNPFTPAPNGCIT
HHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE
AECAAGLGQQGRGLIPDYATGGVSVLSGSASATVNLYDGTSYVAGGVTQNFPSVSWKPGV
HHHHHCCCCCCCCCCCCCCCCCEEEEECCCEEEEEEECCCCEEECCCCCCCCCCCCCCCC
TGTVGWIFGANDAKAANSFLNGDGNQAFVSIPTPFKFNVVGAVTHAYGGSTAIEFGLGSP
CEEEEEEEECCCHHHHHHHHCCCCCEEEEECCCCEEEEEEEEEEECCCCCEEEEECCCCC
GTIGYGVTPWSHGVPVTNGGK
CCCCCCCCCCCCCCCCCCCCC
>Mature Secondary Structure
MNTELYRLVFNVARGMLVAVQECAKGRGKGSHTGTRASSGSGVFVPVFWFTALSAATMGL
CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHHCC
PLAHHAQAQTLPIQVDRGATGAQPYVSTAANGTPVVNIAPPNRPGGTSVNNFIQYNVGPS
CCHHCCCCCEEEEEECCCCCCCCCCCEECCCCCEEEEECCCCCCCCCCCCCEEEECCCCC
GVVVNNSGQNSQTQIAGWVHGNMQLGNNHAGTIVQQVTAPNPSQLLGMQEIAGNSAALVL
CEEEECCCCCCCEEEEEEEECCEEECCCCCCEEEEEECCCCHHHHHHHHHHCCCCEEEEE
VNPAGIYCSACGTIGADRFTLSTGRALYGPDGSLAGFDVSQGNLAIGAQGLSSPQAQVDL
ECCCCEEEECCCCCCCCEEEEECCCEEECCCCCEEEEEECCCCEEEECCCCCCCHHHHHH
LARSIQVNGEVWSKYLNAIAGANQIDAETLAATPQAGAGRAPQFAIDASALGSMYAGAVR
HHHHHCCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCEEECHHHHHHHHHCEEE
LVGTEKGIGFNIGNNIVASTGDIVLDVNGDVRILPSARLQAQGAATVSGTNLDNAGTVTT
EEECCCCCCEECCCCEEECCCCEEEECCCCEEECCCCCEECCCCEEEECCCCCCCCEEEE
RGRITATTPGTLSNSGVLSAGGDVLAQTSQLANSGTIGAGTDANGNVTQAGTANIAASAA
CCEEEEECCCCCCCCCEEECCCHHHHHHHHHCCCCCCCCCCCCCCCEEECCCCCCEEHHE
IQSSGSILAAGDANLSAPRLNLNGGTLIAHNTANVSATGDISHQGARLEGNAVQIAAGGT
ECCCCCEEEECCCCCCCCEEECCCCEEEEECCCCEEECCCCCCCCCEECCCEEEEEECCC
FDNTAGSVVAGANGATVQAASILNRSGSLSSGGTLGVNAQQTLDNTAGTVAGTGAATLQA
CCCCCCCEEECCCCCEEEHHHHHCCCCCCCCCCEECCCHHHHHCCCCCCEECCCCHHHHH
ANTINRGGSLGSTQGAVQVTGALDNTGGKTLAAGDLSVSGGAITNDQGQMSGQNVRLDAG
HHHHCCCCCCCCCCCEEEEEEEEECCCCCEEEECCEEECCCEEECCCCCCCCCEEEECCC
AQAFSNVHGTINGAGTTTVTAGSVQNQGGAVTSTGTLDVRTPGSIENAGGTLAANGATTL
HHHHHCCCCEEECCCCEEEEECCCCCCCCEEEECCEEEEECCCCCCCCCCEEEECCCCEE
TAAKVNNQAGTIGSVSSSLTGNGPLDNTGGKALAGTDVAMSGGAITNDQGQIAGQSVTLD
EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCEEECCCCCCCCCEEEHH
AAEQAVSNAQGSIAGGIGATTIVAASLQNQQGSITSGGTLAVQTPGVIENAGGTLGATGA
HHHHHHHCCCCCCCCCCCHHHHEEEHHCCCCCCCCCCCEEEEECCCCEECCCCCCCCCCC
ATLSASQINNQAGTLGSVKDALTVNAPLDNTNGSATAGTNLTIQGGPIVNDHGQLSAGQT
CEEEHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCEEEEECCCEECCCCCCCCCCE
LKLDTEGQALVNTAGTMSAQDIQADTGTVNNQGGLVLAKGTLQANTHGQAYDNSQGGQTI
EEECCCCCEEEECCCCCCHHHCCCCCCCCCCCCCEEEEECEEEECCCCCCCCCCCCCCEE
AGGAMTLTTGALNNANGVVSGQQAVTVNGAAMANQQGQIIAGGPLAVTGDSLANAGGQVA
ECCEEEEEECCCCCCCCEECCCEEEEECCCEECCCCCCEEECCCEEEECCHHHCCCCEEE
ANGDVTLRMANTLDNTAGFTHAGGALDVQAATILNANTLGGTDANPLGMEGSTVQLVASA
ECCCEEEEEECCCCCCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHH
IDNILGALLADTALTVTAATLNNTQGEVTSGGTAQLNVDATTNTQGLLAANQKLGVTGAS
HHHHHHHHHHHHHEEEEEEEECCCCCCEECCCEEEEEEECCCCCCEEEEECCCCCCCCCC
LTGDGTVQSAQGDVALSLKSDFNNSGEVKAAINLALDTTGDVNNSGTMRAGNGLDVHGRN
CCCCCCEECCCCCEEEEEECCCCCCCCEEEEEEEEEECCCCCCCCCCEECCCCCEECCCC
LNNTGELYGAVSNHLRADQSVNNSGLIDGGAVRIDAGTTVTNVDRIFGDTVSIGAGQQIL
CCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEHHHHHHCCEEECCCCHHHH
NDVNPATGQGGVIASRVGDINLGAPDIINREHALIYASQDLNVGGALDANGKATGQANSL
HCCCCCCCCCCEEEEECCCCCCCCCCCCCCCEEEEEEECCCCCCCEECCCCCCCCCCCCC
TNASATIDVARDANVNATSINNLNNHFETQVTDTGVVNTVTYRLRGSDQDIDPTTAIFWD
CCCCEEEEEECCCCCCCEEECCCCCCEEEEECCCCCEEEEEEEEECCCCCCCCCEEEEEE
WKRGSPDAAHPATDLGWLYQDANERGAYRWLILPSTQYPFSQFGPPFDWSRLPDGSTGPN
CCCCCCCCCCCCCHHHHHHHCCCCCCCEEEEEECCCCCCHHHCCCCCCCCCCCCCCCCCC
RGYYDAVEGDNAFLPAEQWTPVGLALAQFSQTDNVGNVVAVTDEHFYYQPGDAIWDKLGV
CCEEECCCCCCCEECCCCCCCHHHHHHHHCCCCCCCCEEEEECCCEEECCCHHHHHHHCC
QRPSSAPPPFQAACASDAPASCQDAYQAYQTWRQANFAQYQALNDKIKAFNLDFHSRVVR
CCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHCCCEEEEECCHHHHHHH
DFYSVNEQTQTRDETVKTTDPARLLVGGNATLNGAVVNDKSQILVGGDLIVPAPVDNRGY
HHHHCCCHHHCCCCCEECCCCCEEEECCCCEECCEEECCCCEEEECCCEEEECCCCCCCC
TGTRIETVTGSQDWNYINYGVNDPDRRTTPGPLPPINVNLPLVLATGTSLANQGTIGHDG
CCCEEEEEECCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCC
SAPGQGAGLGTMAGAQGAGTGLAPLATQPVLKELVLPTSGNGLAFNGPGARLGGATIRQV
CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCHHHHH
TPALAMPQNALFHVNTAPGAHYLIETDPRFTDQRQWLSSDFMLSQLGQDPNNVVKRLGDG
CHHHCCCCCCEEEEECCCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHCCC
FYEARLVADAVMLGTGQRFVGDYSDNEAQYIGLMKAGVTFAQQFHLTVGTELTPDQMAAL
HHHHHHHHHHHHCCCCCCEECCCCCCCEEEEEEHHHCHHEEEEEEEEECCCCCHHHHHHH
TSDMVWLVEKTVTLPDGSTQKVLVPQVYLMSHVGELKADGTLISANNVGIQTTGDVNNTG
HHHHEEEEEEEEECCCCCCCEEECHHHHHHHHHHCCCCCCEEEECCCCCEEECCCCCCCC
TISGRKLAVIDAQNINNIGGTLNGGTLVLNAQQDINNLAGKITGGNVAAQAGRDINFTTT
CCCCCEEEEEECCCCCCCCCEECCCEEEEECHHHHHHHCCEECCCCEECCCCCCEEEEEE
TTTATGVAGEAVHSRTVISGVSELNADNATLLAGRDLTATAASIATTGDLGLGAGRNVNL
CCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHEECCCCCCCCCCCCCC
GTVEIAERRDSVADDKNRTSVARSTEIGTQIQAGGDATLLAGQDVNAKAAYVSAGGAIGV
CEEEEHHHHCCCCCCCCCHHHHHHHCCCCEEECCCCEEEEECCCCCCEEEEEECCCEEEC
GAGHDINIRAGQASVSIRDEQSRTSGGFLSSQSTHTIDQKASTDAVGSTFSGNTVDMQAK
CCCCEEEEECCCEEEEEECCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEC
HDLTLAGSTVAGTKDVNLSAGHNLEIGTTETQSSAYSFKEEKKSGFGATGNGISYGSRDQ
CCEEEECCEECCCCCCEECCCCCEEECCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCC
KDTTHDAGTQQVGSMVGSTDGSVHLNAGNTLSVKGSSLIAAQDITGKGADINIEAAQNAQ
CCCCCCCCHHHHHHHHCCCCCEEEECCCCEEEECCCEEEEEECCCCCCCEEEEEECCCCC
HHDETHEVKQSGFTLGVGGTVGQVMSAAQKINNASKSQDGRASALWGIAAARDAYDGASA
CCHHHHHHHHCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHH
IGSMVGGSGGGSPASGQQPSATVQLSWGSSQSKQTLTQDSTSHNGSRVSAGGTASFQATG
HHHHHCCCCCCCCCCCCCCCEEEEEECCCCCCHHHHHCCCCCCCCCEEECCCCCEEEEEC
VDADGNKTAGNLNVIGSDINASKVALQAKHDVNIVSATDTDESHSTNKSSSVSVGVSVGT
CCCCCCCCCCEEEEEECCCCCCEEEEEECCCEEEEEECCCCCCCCCCCCCCEEEEEEECC
TGYGVSASGSMAKGNSDSTGTSQANSHVRGSESVTIVSENDTNILGGTVSGGHVAMDVGG
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCEEEEECCCCCEEEEEECC
NLNLASRQDTQQMHADQQSASGGVSFSTMGGFSGNASYMQGKANGSYANVGEQTGVYAGQ
CCEECCCHHHHHHHHCHHHCCCCEEEEECCCCCCCCCEEECCCCCCCCCCCCCCCEEECC
GGFDINVKGNTDLKGAVIASDATKDRNNLSTGTLTYSDLQNHSGYSATSVGTSVGTSPSA
CCEEEEECCCCCCCEEEEECCCCCCCCCCCCCEEEHHHHHCCCCCCHHHCCCCCCCCCCC
MSPMIPQHKSQSENGVAQSAIADGAITIKDQAKQSQDVAGLKRDTTSTNSQVGNNPNLKN
CCCCCCCCCCCCCCCHHHHHHCCCEEEEHHHHCCHHHHCCCCCCCCCCCCCCCCCCCHHH
VLDSQADTMAAAQAAGAAVARTVGDIAQSKQDDAQKRMETAGQALKQDPSPENQAAFDAA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH
KADYEGWNEGGQYRAGLQAAGGALIGGLGGGSALTAAGGAAGAGAASLAATKLEDVANNV
HCCCCCCCCCCCEECCHHHCCCEEEECCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHH
SKAVGSDNPVLNQAIGNLVSSVAAGGLGALVGGVAGAATGANVDIYNRQLHPDEQTLANR
HHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHH
LSAASGGKYTPQQVGDAMRAASNGKYKEDVTAGMVVDGVSNPNGVYDNGAMFTVPSQDGR
HHHCCCCCCCHHHHHHHHHHHCCCCCHHCCCCCEEEECCCCCCCCCCCCEEEEECCCCCC
TLVQTIPNSVDPNLAGYIQSMTGGKSSPYSWNDETLGKTAPTMASTPTNPFTPAPNGCIT
HHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE
AECAAGLGQQGRGLIPDYATGGVSVLSGSASATVNLYDGTSYVAGGVTQNFPSVSWKPGV
HHHHHCCCCCCCCCCCCCCCCCEEEEECCCEEEEEEECCCCEEECCCCCCCCCCCCCCCC
TGTVGWIFGANDAKAANSFLNGDGNQAFVSIPTPFKFNVVGAVTHAYGGSTAIEFGLGSP
CEEEEEEEECCCHHHHHHHHCCCCCEEEEECCCCEEEEEEEEEEECCCCCEEEEECCCCC
GTIGYGVTPWSHGVPVTNGGK
CCCCCCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 2388559; 12910271; 2539596; 1696934; 1791761 [H]