The gene/protein map for NC_009800 is currently unavailable.
Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is eaeH [H]

Identifier: 157159804

GI number: 157159804

Start: 361814

End: 366067

Strand: Direct

Name: eaeH [H]

Synonym: EcHS_A0351

Alternate gene names: 157159804

Gene position: 361814-366067 (Clockwise)

Preceding gene: 157159800

Following gene: 157159806

Centisome position: 7.79

GC content: 52.26

Gene sequence:

>4254_bases
ATGTCGCGTTATAAAACAGATCATAAACAACCACGATTTCGTTATTCAGTTCTGGCCCGCTGCGTGGCGTGGGCAAATAT
CTCTGTTCAGGTTCTTTTTCCACTCGCTGTCACCTTTACCCCAGTAATGGCGGCACGTGCGCAGCATGCGGTTCAGCCAC
GGTTGAGCATGGGAAATACTACGGTAACTGCTGATAATAACGTGGAGAAAAATGTCGCGTCGTTTGCCGCAAATGCCGGG
ACATTTTTAAGCAGTCAGCCAGATAGCGATGCGACACGTAACTTTATTACCGGAATGGCCACAGCTAAAGCTAACCAGGA
AATACAGGAGTGGCTCGGGAAATATGGTACTGCGCGCGTCAAACTGAATGTCGATAAAGATTTCTCGCTGAAGGATTCTT
CGCTGGAAATGCTTTATCCGATTTATGATACGCCAACAAATATGTTGTTCACTCAGGGAGCAATACATCGTACCGACGAT
CGTACTCAGTCAAATATTGGTTTTGGCTGGCGTCATTTTTCAGGAAATGACTGGATGGCGGGGGTGAACACCTTTATCGA
CCATGATTTATCCCGTAGTCATACCCGCATTGGTGTTGGTGCGGAATACTGGCGCGATTATCTGAAACTGAGCGCCAATG
GTTATATTCGGGCTTCTGGCTGGAAAAAATCGCCGGATATTGAGGATTATCAGGAACGCCCGGCGAATGGTTGGGATATC
CGCGCAGAGGGCTATTTACCTGCCTGGCCGCAGCTTGGCGCAAGCCTGATGTATGAACAGTATTATGGCGATGAAGTCGG
GCTGTTTGGTAAAGATAAGCGCCAGAAAGACCCGCATGCTATTTCTGCCGAGGTGACCTATACGCCAGTGCCTCTTCTGA
CACTGAGCGCCGGGCATAAGCAGGGCAAGAGCGGTGAGAATGACACTCGCTTTGGCCTGGAAGTTAATTATCGGATTGGC
GAACCTCTGGAGAAACAACTCGATACAGACAGCATTCGCGAGCGTCGAATGCTGGCAGGCAGCCGCTATGACCTGGTTGA
GCGTAATAACAACATCGTTCTTGAGTACCGCAAATCTGAAGTGATCCGTATTGCTCTGCCTGAGCGTATTGAAGGTAAGG
GCGGTCAGACACTTTCCCTGGGGCTTGTGGTCAGCAAAGCAACTCACGGACTGAAAAATGTGCAGTGGGAAGCGCCGTCA
TTACTGGCTGAAGGTGGCAAAATTACCGGTCAGGGTAGTCAGTGGCAAGTAACGCTCCCGGCTTATCGTCCAGGCAAAGA
CAATTATTATGCGATTTCTGCGGTTGCCTACGATAACAAAGGCAATGCCTCAAAACGCGTGCAGACAGAGGTGGTCATTA
CCGGAGCAGGTATGAGTGCCGATCGCACGGCGTTAACGCTTGACGGTCAGAGCCGTATTCAAATGCTTGCTAACGGTAAT
GAGCAAAGACCGCTGGTGCTGTCTCTGCGCGACGCCGAGGGCCAGCCAGTCACGGGCATGAAAGATCAGATCAAGACTGA
ACTAACTTTCAAACCGGCTGGAAATATTGTGACTCGTACCCTGAAGGCCACTAAATCACAGGCAAAGCCAACACTGGGTG
AGTTCACCGAAACTGAAGCCGGGGTGTATCAGTCTGTCTTTACTACCGGAACGCAGTCAGGTGAGGCAACGATTACTGTT
AGTGTTGATGGCATGAGCAAAACCGTCACTGCAGAACTGCGGGCCACGATGATGGATGTGGCAAACTCCACCCTGAGCGC
TAACGAGCCGTCAGGTGACGTGGTTGCTGATGGTCAGCAAGCCTACACGCTGACGCTGACTGCGGTGGATACTGATGGTA
ACCCGGTGACGGGAGAGGCCAGCCGCTTGCGATTTGTTCCGCAAGACACTAATGGTGTCACCATTGGTACAATTTCGGAG
ATAAAACCAGGCGTTTACAGCGCCACGGTTTCTTCGACCCGTGCCGGAAACGTTGTTGTGCGTGCTTTCAGCGAGCAGTA
TCAGCTGGGCACATTACAACAAACGCTGAAGTTTGTTGCCGGGCCGCTTGATGCAGCACATTCGTCCATCACACTGAATC
CTGATAAACCGGTGGTTGGCGGTACAGTTACGGCAATCTGGACGGCAAAAGATGCTAATGACAACCCTGTAACTGGTCTC
AATCCGGATGCACCGTCATTATCGGGCGCAGCTGCTGCTGGTTCTACGGCATCAGGCTGGACGGATAATGGCGACGGGAC
CTGGACTGCGCAGATTTCTCTCGGCACTACGGCGGGTGAATTAGAGGTTATTCCGAAGCTAAATGGACAGGATGCGGCAG
CAAATGCGGCAAAAGTAACCGTGGTGGCTGATGCGTTATCTTCAAACCAGTCGAAAGTCTCTGTCGCAGAAGATCACGTA
AAAGCCGGCGAAAGCACAACCGTGACGCTTATTGCAAAAGATGCACATGGCAACGCTATCAGTGGTCTTTCCCTGTCGGC
AAGCCTGACGGGTGCTGCGTCTGAAGGGGCGACTGTTTCTGGTTGGACCGAAAAAGGTGATGGTTCCTATGTCGCTACGC
TGACAACAGGTGGAAAGACGGGTGAGCTTCTCGTCATGCCGCTATTCAACGGCCAGCCAGCAGCCACCGAAGCCGCGCAG
TTGACTGTCATTGCGGGGGAGATGTCATCAGCGAACTCTACGCTTGTTGCTGACAATAAGGCTCCGACCGTCAAAACGAC
GACGAAACTCACCTTCACCGTGAAGGATGCGTACGGGAACCTTGTCACCGGGCTGAAGCCAGATGCACCGCAGTTTAGTG
GTGCCGCCAGCACGGGGACAGAGCGACCTTCAACAGGAGACTGGACAGAAACAAGTAATGGGGTCTACGTGGCGACCTTG
ACTCTGGGATCTGCCGCGGGCCAGTTGTCTGTGATGCCGCGAGTGAACGGCCAAAATGCCGTTGCTCAGCCACTGGTGCT
GAATGTTGCTGGTGACGCATCTAAGGCTGAGATTCGTGATATGACGGTGAAGGTTGATAACCAGCTGGCTAATGGACAAT
CGACTAACCAGGTAACCCTGACCGTTGTGGACACCTATGGTAACCCGTTGCAGGGACAAAATGTGACGCTGACTCTGCCG
AAAGGTGTGACCAGCAAGACGGGGAATACGGTAACAACCGATGCGGCAGGTAAAGCCGACATTGAGCTGATGTCAACGGT
TGCCGGGGAACACAGCATCACGGCCTCAGTGAATAATGCTCAGAAGACGGTTACGGTGAAATTCAAGGCGGATTTCAGTA
CCGGTCAGGCGAGTCTGGAGGTTGATAGCGCCGCGCCAAAAGTAGCAAACGGCAAAGATGCCTTTACGCTGACGGCGACC
GTTGAGGATAAAAATGGTAACCCTGTTCCAGGGAGCCTGGTGACCTTTAATCTGCCCCGGGGTGTCAAGCCGCTTACAGG
CGATAATGTCTGGGTGAAAGCCAACGATGAGGGGAAAGCAGAGTTGCAGGTGGTTTCAGTGACTGCCGGAACGTATGAGA
TCACGGCATCGGCGGGGAATAGCCAGCCTTCGGATACGCAGACTATAACGTTTGTAGCCGATAAGGCTACCGCAACCGTC
TCCGGTATTGAGGTGATTGGCAACTATGCGCTGGCGGACGGCAAAGCCAAACAAACGTATAAAGTTACGGTGACTGATGC
CAATAACAATTTGGTGAAAGATAGCGACGTGACGCTGACTGCCAGCCCGGCTTCGTTAAACCTGGAACCGAATGGCACTG
CGACAACGAATGAGCAAGGGCAGGCTATTTTCACCGCTACCACTACTGTTGCGGCGACATACACACTCAAGGCGCAAGTG
AGTCAGACCAACGGTCAGGTATCAACGAAAACTGCCGAATCTAAATTCGTTGCGGATGATAAAAACGCGGTACTCACCGC
ATCATCTGATATGCAATCTCTGGTGGCGGATGGGAAATCGACTGCGAAGCTGGAGGTGACACTGATGTCGGCAAACAACC
CCGTTGGCGGGAATATGTGGGTCGACATTCAGACGCCGGAAGGGGTGACGGAGAAGGATTATCAGTTCCTGCCGTCGAAA
AATGACCATTTCGTGAGCGGAAAAATCACGCGTAAATTTAGTACCAGCAAGCCTGGTGTCTATACGTTCACATTTAACGC
CCTGACCTATGGCGGGTACGAAATGAAGCCAGTGACGGTGACCATTACCGCGGTGGATGCCGATACGGCAAAGGACGAGG
AGGCGATGAAATAA

Upstream 100 bases:

>100_bases
CGGAAAAGGAAATCGGGAAATCCCCGGTTTTTCTGACAAGCAGACGCCATTATTTGTGTCTGCCTATGTTCGTTAATTCG
TTCATCAGGAAATTATCTCA

Downstream 100 bases:

>100_bases
TTTTAAGAATATGAAAGTAACATTCTAATAAACAATAATGGCGTTGACATCTTCAACGCCATTATTTATTGAATTTCCTG
ACAGTAGTCTCGTTATAAAT

Product: putative intimin

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 1417; Mature: 1416

Protein sequence:

>1417_residues
MSRYKTDHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAG
TFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD
RTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI
RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIG
EPLEKQLDTDSIRERRMLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTLSLGLVVSKATHGLKNVQWEAPS
LLAEGGKITGQGSQWQVTLPAYRPGKDNYYAISAVAYDNKGNASKRVQTEVVITGAGMSADRTALTLDGQSRIQMLANGN
EQRPLVLSLRDAEGQPVTGMKDQIKTELTFKPAGNIVTRTLKATKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITV
SVDGMSKTVTAELRATMMDVANSTLSANEPSGDVVADGQQAYTLTLTAVDTDGNPVTGEASRLRFVPQDTNGVTIGTISE
IKPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGPLDAAHSSITLNPDKPVVGGTVTAIWTAKDANDNPVTGL
NPDAPSLSGAAAAGSTASGWTDNGDGTWTAQISLGTTAGELEVIPKLNGQDAAANAAKVTVVADALSSNQSKVSVAEDHV
KAGESTTVTLIAKDAHGNAISGLSLSASLTGAASEGATVSGWTEKGDGSYVATLTTGGKTGELLVMPLFNGQPAATEAAQ
LTVIAGEMSSANSTLVADNKAPTVKTTTKLTFTVKDAYGNLVTGLKPDAPQFSGAASTGTERPSTGDWTETSNGVYVATL
TLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVDNQLANGQSTNQVTLTVVDTYGNPLQGQNVTLTLP
KGVTSKTGNTVTTDAAGKADIELMSTVAGEHSITASVNNAQKTVTVKFKADFSTGQASLEVDSAAPKVANGKDAFTLTAT
VEDKNGNPVPGSLVTFNLPRGVKPLTGDNVWVKANDEGKAELQVVSVTAGTYEITASAGNSQPSDTQTITFVADKATATV
SGIEVIGNYALADGKAKQTYKVTVTDANNNLVKDSDVTLTASPASLNLEPNGTATTNEQGQAIFTATTTVAATYTLKAQV
SQTNGQVSTKTAESKFVADDKNAVLTASSDMQSLVADGKSTAKLEVTLMSANNPVGGNMWVDIQTPEGVTEKDYQFLPSK
NDHFVSGKITRKFSTSKPGVYTFTFNALTYGGYEMKPVTVTITAVDADTAKDEEAMK

Sequences:

>Translated_1417_residues
MSRYKTDHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAG
TFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD
RTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI
RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIG
EPLEKQLDTDSIRERRMLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTLSLGLVVSKATHGLKNVQWEAPS
LLAEGGKITGQGSQWQVTLPAYRPGKDNYYAISAVAYDNKGNASKRVQTEVVITGAGMSADRTALTLDGQSRIQMLANGN
EQRPLVLSLRDAEGQPVTGMKDQIKTELTFKPAGNIVTRTLKATKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITV
SVDGMSKTVTAELRATMMDVANSTLSANEPSGDVVADGQQAYTLTLTAVDTDGNPVTGEASRLRFVPQDTNGVTIGTISE
IKPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGPLDAAHSSITLNPDKPVVGGTVTAIWTAKDANDNPVTGL
NPDAPSLSGAAAAGSTASGWTDNGDGTWTAQISLGTTAGELEVIPKLNGQDAAANAAKVTVVADALSSNQSKVSVAEDHV
KAGESTTVTLIAKDAHGNAISGLSLSASLTGAASEGATVSGWTEKGDGSYVATLTTGGKTGELLVMPLFNGQPAATEAAQ
LTVIAGEMSSANSTLVADNKAPTVKTTTKLTFTVKDAYGNLVTGLKPDAPQFSGAASTGTERPSTGDWTETSNGVYVATL
TLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVDNQLANGQSTNQVTLTVVDTYGNPLQGQNVTLTLP
KGVTSKTGNTVTTDAAGKADIELMSTVAGEHSITASVNNAQKTVTVKFKADFSTGQASLEVDSAAPKVANGKDAFTLTAT
VEDKNGNPVPGSLVTFNLPRGVKPLTGDNVWVKANDEGKAELQVVSVTAGTYEITASAGNSQPSDTQTITFVADKATATV
SGIEVIGNYALADGKAKQTYKVTVTDANNNLVKDSDVTLTASPASLNLEPNGTATTNEQGQAIFTATTTVAATYTLKAQV
SQTNGQVSTKTAESKFVADDKNAVLTASSDMQSLVADGKSTAKLEVTLMSANNPVGGNMWVDIQTPEGVTEKDYQFLPSK
NDHFVSGKITRKFSTSKPGVYTFTFNALTYGGYEMKPVTVTITAVDADTAKDEEAMK
>Mature_1416_residues
SRYKTDHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGT
FLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDR
TQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR
AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIGE
PLEKQLDTDSIRERRMLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTLSLGLVVSKATHGLKNVQWEAPSL
LAEGGKITGQGSQWQVTLPAYRPGKDNYYAISAVAYDNKGNASKRVQTEVVITGAGMSADRTALTLDGQSRIQMLANGNE
QRPLVLSLRDAEGQPVTGMKDQIKTELTFKPAGNIVTRTLKATKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITVS
VDGMSKTVTAELRATMMDVANSTLSANEPSGDVVADGQQAYTLTLTAVDTDGNPVTGEASRLRFVPQDTNGVTIGTISEI
KPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGPLDAAHSSITLNPDKPVVGGTVTAIWTAKDANDNPVTGLN
PDAPSLSGAAAAGSTASGWTDNGDGTWTAQISLGTTAGELEVIPKLNGQDAAANAAKVTVVADALSSNQSKVSVAEDHVK
AGESTTVTLIAKDAHGNAISGLSLSASLTGAASEGATVSGWTEKGDGSYVATLTTGGKTGELLVMPLFNGQPAATEAAQL
TVIAGEMSSANSTLVADNKAPTVKTTTKLTFTVKDAYGNLVTGLKPDAPQFSGAASTGTERPSTGDWTETSNGVYVATLT
LGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVDNQLANGQSTNQVTLTVVDTYGNPLQGQNVTLTLPK
GVTSKTGNTVTTDAAGKADIELMSTVAGEHSITASVNNAQKTVTVKFKADFSTGQASLEVDSAAPKVANGKDAFTLTATV
EDKNGNPVPGSLVTFNLPRGVKPLTGDNVWVKANDEGKAELQVVSVTAGTYEITASAGNSQPSDTQTITFVADKATATVS
GIEVIGNYALADGKAKQTYKVTVTDANNNLVKDSDVTLTASPASLNLEPNGTATTNEQGQAIFTATTTVAATYTLKAQVS
QTNGQVSTKTAESKFVADDKNAVLTASSDMQSLVADGKSTAKLEVTLMSANNPVGGNMWVDIQTPEGVTEKDYQFLPSKN
DHFVSGKITRKFSTSKPGVYTFTFNALTYGGYEMKPVTVTITAVDADTAKDEEAMK

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the intimin/invasin family [H]

Homologues:

Organism=Escherichia coli, GI145693153, Length=1361, Percent_Identity=32.6965466568699, Blast_Score=563, Evalue=1e-161,
Organism=Escherichia coli, GI145693120, Length=362, Percent_Identity=34.8066298342541, Blast_Score=194, Evalue=5e-50,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR003535 [H]

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 149888; Mature: 149757

Theoretical pI: Translated: 5.31; Mature: 5.31

Prosite motif: PS51127 BIG1 ; PS00178 AA_TRNA_LIGASE_I

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.1 %Cys     (Translated Protein)
1.8 %Met     (Translated Protein)
1.8 %Cys+Met (Translated Protein)
0.1 %Cys     (Mature Protein)
1.7 %Met     (Mature Protein)
1.8 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSRYKTDHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT
CCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEHHHHHHHHHHHHHHHHCCCCCCCCCE
TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV
EEEECCCCHHHHHHHHCCCCCEECCCCCCHHHHHHHHHHHHHHCHHHHHHHHHCCCCEEE
KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA
EEEECCCCCCCCCCCEEEEEEECCCCCEEEECCCEECCCCCCCCCCCEEEEEECCCCCCH
GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI
HHHHHHHCCCCCCCEEECCCHHHHHHHHEECCCCEEEECCCCCCCCCHHHHHCCCCCCEE
RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLLTLSAGHK
EECCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEEEECCCCEEEEECCCC
QGKSGENDTRFGLEVNYRIGEPLEKQLDTDSIRERRMLAGSRYDLVERNNNIVLEYRKSE
CCCCCCCCCEEEEEEEEECCCCHHHHCCHHHHHHHHHHCCCCEEEEECCCCEEEEEECCC
VIRIALPERIEGKGGQTLSLGLVVSKATHGLKNVQWEAPSLLAEGGKITGQGSQWQVTLP
EEEEECCCCCCCCCCCEEEEEEEEECHHCCCCCCCCCCCHHHHCCCEEECCCCEEEEECC
AYRPGKDNYYAISAVAYDNKGNASKRVQTEVVITGAGMSADRTALTLDGQSRIQMLANGN
CCCCCCCCEEEEEEEEECCCCCCCCEEEEEEEEEECCCCCCCEEEEECCHHHEEEEECCC
EQRPLVLSLRDAEGQPVTGMKDQIKTELTFKPAGNIVTRTLKATKSQAKPTLGEFTETEA
CCCCEEEEEECCCCCCCCCCHHHHEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCCCHH
GVYQSVFTTGTQSGEATITVSVDGMSKTVTAELRATMMDVANSTLSANEPSGDVVADGQQ
HHHHHHHHCCCCCCCEEEEEEECCCCEEEHHHHHHHHHHHHCCCCCCCCCCCCEEECCCE
AYTLTLTAVDTDGNPVTGEASRLRFVPQDTNGVTIGTISEIKPGVYSATVSSTRAGNVVV
EEEEEEEEEECCCCCCCCCHHEEEEEECCCCCEEEEEHHHCCCCEEEEEECCCCCCCEEE
RAFSEQYQLGTLQQTLKFVAGPLDAAHSSITLNPDKPVVGGTVTAIWTAKDANDNPVTGL
EEECCCCCHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCEEEEEEEECCCCCCCCCCC
NPDAPSLSGAAAAGSTASGWTDNGDGTWTAQISLGTTAGELEVIPKLNGQDAAANAAKVT
CCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCEEEEECCCCCCCCCCCEEEE
VVADALSSNQSKVSVAEDHVKAGESTTVTLIAKDAHGNAISGLSLSASLTGAASEGATVS
EEEECCCCCCCEEEEEHHHHHCCCCEEEEEEEECCCCCEECCEEEEEEECCCCCCCCEEE
GWTEKGDGSYVATLTTGGKTGELLVMPLFNGQPAATEAAQLTVIAGEMSSANSTLVADNK
CCCCCCCCCEEEEEECCCCCCCEEEEEECCCCCCCCCCEEEEEEEECCCCCCCEEEECCC
APTVKTTTKLTFTVKDAYGNLVTGLKPDAPQFSGAASTGTERPSTGDWTETSNGVYVATL
CCCEEEEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEE
TLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVDNQLANGQSTNQVTL
EECCCCCEEEEEECCCCCCCCCCCEEEEECCCCCCCEEEEEEEEECCEECCCCCCCEEEE
TVVDTYGNPLQGQNVTLTLPKGVTSKTGNTVTTDAAGKADIELMSTVAGEHSITASVNNA
EEEECCCCCCCCCEEEEECCCCCCCCCCCEEEECCCCCCCHHHHHHHCCCCEEEEEECCC
QKTVTVKFKADFSTGQASLEVDSAAPKVANGKDAFTLTATVEDKNGNPVPGSLVTFNLPR
CEEEEEEEEECCCCCCEEEEECCCCCCCCCCCCEEEEEEEEECCCCCCCCCCEEEEECCC
GVKPLTGDNVWVKANDEGKAELQVVSVTAGTYEITASAGNSQPSDTQTITFVADKATATV
CCCCCCCCEEEEEECCCCCEEEEEEEEECCEEEEEEECCCCCCCCCEEEEEEECCCEEEE
SGIEVIGNYALADGKAKQTYKVTVTDANNNLVKDSDVTLTASPASLNLEPNGTATTNEQG
EEEEEEECEEECCCCCCEEEEEEEEECCCCEEECCCEEEEECCCEEEECCCCCCCCCCCC
QAIFTATTTVAATYTLKAQVSQTNGQVSTKTAESKFVADDKNAVLTASSDMQSLVADGKS
CEEEEEEEEEEEEEEEEEEEECCCCEEEEEECCCEEEECCCCEEEEECHHHHHHHHCCCC
TAKLEVTLMSANNPVGGNMWVDIQTPEGVTEKDYQFLPSKNDHFVSGKITRKFSTSKPGV
CEEEEEEEEECCCCCCCCEEEEEECCCCCCCCCCEECCCCCCEEEEEEEEEEECCCCCCE
YTFTFNALTYGGYEMKPVTVTITAVDADTAKDEEAMK
EEEEEEEEEECCEEEEEEEEEEEEEECCCCCCHHHCC
>Mature Secondary Structure 
SRYKTDHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT
CCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEHHHHHHHHHHHHHHHHCCCCCCCCCE
TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV
EEEECCCCHHHHHHHHCCCCCEECCCCCCHHHHHHHHHHHHHHCHHHHHHHHHCCCCEEE
KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA
EEEECCCCCCCCCCCEEEEEEECCCCCEEEECCCEECCCCCCCCCCCEEEEEECCCCCCH
GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI
HHHHHHHCCCCCCCEEECCCHHHHHHHHEECCCCEEEECCCCCCCCCHHHHHCCCCCCEE
RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLLTLSAGHK
EECCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEEEECCCCEEEEECCCC
QGKSGENDTRFGLEVNYRIGEPLEKQLDTDSIRERRMLAGSRYDLVERNNNIVLEYRKSE
CCCCCCCCCEEEEEEEEECCCCHHHHCCHHHHHHHHHHCCCCEEEEECCCCEEEEEECCC
VIRIALPERIEGKGGQTLSLGLVVSKATHGLKNVQWEAPSLLAEGGKITGQGSQWQVTLP
EEEEECCCCCCCCCCCEEEEEEEEECHHCCCCCCCCCCCHHHHCCCEEECCCCEEEEECC
AYRPGKDNYYAISAVAYDNKGNASKRVQTEVVITGAGMSADRTALTLDGQSRIQMLANGN
CCCCCCCCEEEEEEEEECCCCCCCCEEEEEEEEEECCCCCCCEEEEECCHHHEEEEECCC
EQRPLVLSLRDAEGQPVTGMKDQIKTELTFKPAGNIVTRTLKATKSQAKPTLGEFTETEA
CCCCEEEEEECCCCCCCCCCHHHHEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCCCHH
GVYQSVFTTGTQSGEATITVSVDGMSKTVTAELRATMMDVANSTLSANEPSGDVVADGQQ
HHHHHHHHCCCCCCCEEEEEEECCCCEEEHHHHHHHHHHHHCCCCCCCCCCCCEEECCCE
AYTLTLTAVDTDGNPVTGEASRLRFVPQDTNGVTIGTISEIKPGVYSATVSSTRAGNVVV
EEEEEEEEEECCCCCCCCCHHEEEEEECCCCCEEEEEHHHCCCCEEEEEECCCCCCCEEE
RAFSEQYQLGTLQQTLKFVAGPLDAAHSSITLNPDKPVVGGTVTAIWTAKDANDNPVTGL
EEECCCCCHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCEEEEEEEECCCCCCCCCCC
NPDAPSLSGAAAAGSTASGWTDNGDGTWTAQISLGTTAGELEVIPKLNGQDAAANAAKVT
CCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCEEEEECCCCCCCCCCCEEEE
VVADALSSNQSKVSVAEDHVKAGESTTVTLIAKDAHGNAISGLSLSASLTGAASEGATVS
EEEECCCCCCCEEEEEHHHHHCCCCEEEEEEEECCCCCEECCEEEEEEECCCCCCCCEEE
GWTEKGDGSYVATLTTGGKTGELLVMPLFNGQPAATEAAQLTVIAGEMSSANSTLVADNK
CCCCCCCCCEEEEEECCCCCCCEEEEEECCCCCCCCCCEEEEEEEECCCCCCCEEEECCC
APTVKTTTKLTFTVKDAYGNLVTGLKPDAPQFSGAASTGTERPSTGDWTETSNGVYVATL
CCCEEEEEEEEEEEECCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEE
TLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVDNQLANGQSTNQVTL
EECCCCCEEEEEECCCCCCCCCCCEEEEECCCCCCCEEEEEEEEECCEECCCCCCCEEEE
TVVDTYGNPLQGQNVTLTLPKGVTSKTGNTVTTDAAGKADIELMSTVAGEHSITASVNNA
EEEECCCCCCCCCEEEEECCCCCCCCCCCEEEECCCCCCCHHHHHHHCCCCEEEEEECCC
QKTVTVKFKADFSTGQASLEVDSAAPKVANGKDAFTLTATVEDKNGNPVPGSLVTFNLPR
CEEEEEEEEECCCCCCEEEEECCCCCCCCCCCCEEEEEEEEECCCCCCCCCCEEEEECCC
GVKPLTGDNVWVKANDEGKAELQVVSVTAGTYEITASAGNSQPSDTQTITFVADKATATV
CCCCCCCCEEEEEECCCCCEEEEEEEEECCEEEEEEECCCCCCCCCEEEEEEECCCEEEE
SGIEVIGNYALADGKAKQTYKVTVTDANNNLVKDSDVTLTASPASLNLEPNGTATTNEQG
EEEEEEECEEECCCCCCEEEEEEEEECCCCEEECCCEEEEECCCEEEECCCCCCCCCCCC
QAIFTATTTVAATYTLKAQVSQTNGQVSTKTAESKFVADDKNAVLTASSDMQSLVADGKS
CEEEEEEEEEEEEEEEEEEEECCCCEEEEEECCCEEEECCCCEEEEECHHHHHHHHCCCC
TAKLEVTLMSANNPVGGNMWVDIQTPEGVTEKDYQFLPSKNDHFVSGKITRKFSTSKPGV
CEEEEEEEEECCCCCCCCEEEEEECCCCCCCCCCEECCCCCCEEEEEEEEEEECCCCCCE
YTFTFNALTYGGYEMKPVTVTITAVDADTAKDEEAMK
EEEEEEEEEECCEEEEEEEEEEEEEECCCCCCHHHCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9278503 [H]