| Definition | Shigella flexneri 2a str. 2457T, complete genome. |
|---|---|
| Accession | NC_004741 |
| Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is yehB
Identifier: 30063545
GI number: 30063545
Start: 2175483
End: 2177963
Strand: Reverse
Name: yehB
Synonym: S2296
Alternate gene names: 30063545
Gene position: 2177963-2175483 (Counterclockwise)
Preceding gene: 30063546
Following gene: 30063544
Centisome position: 47.35
GC content: 45.55
Gene sequence:
>2481_bases ATGTTGAGAATGACCCCACTTGCATCAGCAATCGTAGCGTTATTGCTCGGCATTGAAGCTTATGCAGCTGAAGAAACCTT TGATACCCATTTTATGATAGGTGGAATGAAAGACCAGCAGGTTGCAAATATTCGTCTTGATGATAATCAACCCTTACCGG GGCAGTATGACATCGATATTTATGTCAATAAGCAATGGCGCGGGAAATATGAGATTATTGTTAAAGACAACCCGCAAGAA ACATGTTTATCAAGAGAAGTTATCAAGCGGTTAGGCATTAATAGCGATAACTTCGCCAGCGGTAAGCAATGTTTAACATT TGAGCAACTTGTTCAGGGTGGGAGCTATAGCTGGGATATCGGGGTTTTTCGTCTCGATTTCAGTGTCCCGCAGGCTTGGG TGGAAGAACTGGAAAGTGGCTATGTTCCACCGGAAAACTGGGAGCGGGGTATTAATGCGTTTTATACTTCTTATTATGTG AGTCAGTATTACAGCGACTATAAAGCGTCGGGTAACAACAAGAGTACATATGTACGTTTTAACAGCGGGTTAAATTTACT GGAGTGGCAACTGCATTCTGATGCCAGTTTCAGTAAAACAAATAACAATCCAGGGGTGTGGAAAAGCAATACCCTGTATC TGGAACGTGGATTTGCCCAATTTCTCGGCACGCTTCGCGTGGGTGATATGTACACATCAAGCGATATTTTTGATTCTGTT CGCTTCAGCGGTGTGCGGTTGTTTCGTGATATGCAGATGTTGCCTAACTCGAAACAAAATTTTACGCCACGGGTGCAGGG GATTGCTCAGAGTAACGCGCTGGTAACTATTGAACAGAATGGTTTTGTGGTTTATCAGAAAGAGGTTCCTCCTGGCCCGT TTGCGATTACAGATTTGCAGTTGGCCGGTGGTGGAGCAGATCTTGATGTTAGCGTGAAAGAGGCGGACGGCTCGGTAACC ACCTATCTGGTGCCTTATGCAGCGGTGCCAAATATGCTGCAACCCGGCGTGTCGAAATATGATTTTGCGGCGGGCCGTAG CCATATTGAAGGGGCGAGCAAGCAAAGTGATTTTGTCCAGGCAGGTTATCAGTATGGTTTTAATAACTTATTGACGCTGT ATGGCGGCACGATGGTTGCTAATAATTACTATGCGTTTACCCTCGGAACGGGTTGGAACACGCGCATTGGCGCAATTTCC GTCGATGCCACGAAGTCGCATAGTAAACAAGACAACGGCGATGTGTTTGACGGGCAAAGTTATCAAATTGCCTACAACAA ATTTGTGAGCCAAACGTCGACGCGTTTTGGTCTGGCGGCCTGGCGTTATTCGTCGCGTGATTACCGGACATTTAACGATC ACGTTTGGGCAAACAATAAAGATAATTATCGCCGTGATGAAAACGATATCTATGACATTGCCGATTATTACCAGAACGAT TTTGGCCGTAAAAATAGCTTTTCTGCCAATATGAGCCAGTCATTGCCAGAAGGCTGGGGTTCTGTGTCCTTAAGTACGTT ATGGCGAGATTACTGGGGGCGTAGCGGCAGCAGTAAGGATTATCAGTTGAGTTATTCCAATAACTGGCGGCGGATAAGCT ATACCCTCGCGGCAAGCCAGGCGTATGGCGAGAATCATCATGAAGAGAAACGTTTTAATATTTTTATATCAATTCCCTGT GATTGGGGTGATGACGTTACGACGCCTCGTCGGCAAATATATATGTCTAACTCAACGACGTTTGATGATCAGGGTTTTGC CTCAAATAATACGGGATTATCAGGAACAGTAGGGAGTCGGGATCAGTTCAATTATGGTGTCAACCTGAGTCATCAACATC AGGGAAATGAAACGACAGCTGGGGCGAATTTGACCTGGAACGCGCCGGTTGCGACAGTGAATGGCAGTTATAGTCAGTCG AGTACTTATCGACAGACTGGAGCCAGTGTTTCAGGGGGCATTGTCGCCTGGTCGGGTGGCGTTAATCTGGCGAACCGTCT TTCCGAAACGTTTGCTGTGATGAATGCGCCAGGAATTAAAGATGCTTATGTCAATGGGCAAAAATATCGCACAACAAACC GTAATGGAGTGGTGGTATACGACGGAATGACACCTTATCGGGAAAATCACCTGATGCTGGATGTGTCGCAAAGCGATAGC GAAGCAGAATTACGTGGCAACCGGAAAATTGCCGCCCCTTATCGCGGCGCGGTTGTACTGGTTAATTTTGATACCGATCA GCGCAAGCCCTGGTTTATAAAAGCGTTAAGAGCGGATGGTCAACCATTAACGTTTGGTTATGAAGTCAATGATATCCATG GTCATAATATTGGCGTTGTCGGCCAGGGAAGCCAGTTATTTATTCGCACCAATGAAATACCGCCATCGGTTAATGTAGCA ATTGATAAGCAACAAGGACTTTCATGCACAATCACCTTCGGTAAAGAGATTGATGAAAGTAGAAATTATATTTGCCAGTA A
Upstream 100 bases:
>100_bases TGTTAATATCAAAAGTAATAATGCAAATAACTGGTATCTGACCATTATCAATGACCACGGCAACTATATTAGTGACAAAA TTTAATTGCAGGAGCTGCCT
Downstream 100 bases:
>100_bases TAGTCAGGTGGTTCTATGGAAATTCGCATAATGCTATTTATATTAATGATGATGGTTATGCCTGTGAGCTATGCGGCATG TTATAGTGAGTTATCTGTTC
Product: putative outer membrane protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 826; Mature: 826
Protein sequence:
>826_residues MLRMTPLASAIVALLLGIEAYAAEETFDTHFMIGGMKDQQVANIRLDDNQPLPGQYDIDIYVNKQWRGKYEIIVKDNPQE TCLSREVIKRLGINSDNFASGKQCLTFEQLVQGGSYSWDIGVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYV SQYYSDYKASGNNKSTYVRFNSGLNLLEWQLHSDASFSKTNNNPGVWKSNTLYLERGFAQFLGTLRVGDMYTSSDIFDSV RFSGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGFVVYQKEVPPGPFAITDLQLAGGGADLDVSVKEADGSVT TYLVPYAAVPNMLQPGVSKYDFAAGRSHIEGASKQSDFVQAGYQYGFNNLLTLYGGTMVANNYYAFTLGTGWNTRIGAIS VDATKSHSKQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNKDNYRRDENDIYDIADYYQND FGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSGSSKDYQLSYSNNWRRISYTLAASQAYGENHHEEKRFNIFISIPC DWGDDVTTPRRQIYMSNSTTFDDQGFASNNTGLSGTVGSRDQFNYGVNLSHQHQGNETTAGANLTWNAPVATVNGSYSQS STYRQTGASVSGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYRTTNRNGVVVYDGMTPYRENHLMLDVSQSDS EAELRGNRKIAAPYRGAVVLVNFDTDQRKPWFIKALRADGQPLTFGYEVNDIHGHNIGVVGQGSQLFIRTNEIPPSVNVA IDKQQGLSCTITFGKEIDESRNYICQ
Sequences:
>Translated_826_residues MLRMTPLASAIVALLLGIEAYAAEETFDTHFMIGGMKDQQVANIRLDDNQPLPGQYDIDIYVNKQWRGKYEIIVKDNPQE TCLSREVIKRLGINSDNFASGKQCLTFEQLVQGGSYSWDIGVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYV SQYYSDYKASGNNKSTYVRFNSGLNLLEWQLHSDASFSKTNNNPGVWKSNTLYLERGFAQFLGTLRVGDMYTSSDIFDSV RFSGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGFVVYQKEVPPGPFAITDLQLAGGGADLDVSVKEADGSVT TYLVPYAAVPNMLQPGVSKYDFAAGRSHIEGASKQSDFVQAGYQYGFNNLLTLYGGTMVANNYYAFTLGTGWNTRIGAIS VDATKSHSKQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNKDNYRRDENDIYDIADYYQND FGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSGSSKDYQLSYSNNWRRISYTLAASQAYGENHHEEKRFNIFISIPC DWGDDVTTPRRQIYMSNSTTFDDQGFASNNTGLSGTVGSRDQFNYGVNLSHQHQGNETTAGANLTWNAPVATVNGSYSQS STYRQTGASVSGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYRTTNRNGVVVYDGMTPYRENHLMLDVSQSDS EAELRGNRKIAAPYRGAVVLVNFDTDQRKPWFIKALRADGQPLTFGYEVNDIHGHNIGVVGQGSQLFIRTNEIPPSVNVA IDKQQGLSCTITFGKEIDESRNYICQ >Mature_826_residues MLRMTPLASAIVALLLGIEAYAAEETFDTHFMIGGMKDQQVANIRLDDNQPLPGQYDIDIYVNKQWRGKYEIIVKDNPQE TCLSREVIKRLGINSDNFASGKQCLTFEQLVQGGSYSWDIGVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYV SQYYSDYKASGNNKSTYVRFNSGLNLLEWQLHSDASFSKTNNNPGVWKSNTLYLERGFAQFLGTLRVGDMYTSSDIFDSV RFSGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGFVVYQKEVPPGPFAITDLQLAGGGADLDVSVKEADGSVT TYLVPYAAVPNMLQPGVSKYDFAAGRSHIEGASKQSDFVQAGYQYGFNNLLTLYGGTMVANNYYAFTLGTGWNTRIGAIS VDATKSHSKQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNKDNYRRDENDIYDIADYYQND FGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSGSSKDYQLSYSNNWRRISYTLAASQAYGENHHEEKRFNIFISIPC DWGDDVTTPRRQIYMSNSTTFDDQGFASNNTGLSGTVGSRDQFNYGVNLSHQHQGNETTAGANLTWNAPVATVNGSYSQS STYRQTGASVSGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYRTTNRNGVVVYDGMTPYRENHLMLDVSQSDS EAELRGNRKIAAPYRGAVVLVNFDTDQRKPWFIKALRADGQPLTFGYEVNDIHGHNIGVVGQGSQLFIRTNEIPPSVNVA IDKQQGLSCTITFGKEIDESRNYICQ
Specific function: Involved in the export and assembly of a fimbrial subunit across the outer membrane [H]
COG id: COG3188
COG function: function code NU; P pilus assembly protein, porin PapC
Gene ontology:
Cell location: Cell outer membrane; Multi-pass membrane protein [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the fimbrial export usher family [H]
Homologues:
Organism=Escherichia coli, GI1788427, Length=826, Percent_Identity=97.9418886198547, Blast_Score=1680, Evalue=0.0, Organism=Escherichia coli, GI1786332, Length=842, Percent_Identity=32.0665083135392, Blast_Score=400, Evalue=1e-112, Organism=Escherichia coli, GI1790772, Length=858, Percent_Identity=29.2540792540793, Blast_Score=357, Evalue=1e-99, Organism=Escherichia coli, GI1787172, Length=774, Percent_Identity=31.7829457364341, Blast_Score=346, Evalue=3e-96, Organism=Escherichia coli, GI1786744, Length=827, Percent_Identity=29.9879081015719, Blast_Score=320, Evalue=2e-88, Organism=Escherichia coli, GI1789533, Length=790, Percent_Identity=27.3417721518987, Blast_Score=293, Evalue=3e-80, Organism=Escherichia coli, GI87081778, Length=570, Percent_Identity=27.719298245614, Blast_Score=196, Evalue=7e-51, Organism=Escherichia coli, GI1789610, Length=838, Percent_Identity=24.1050119331742, Blast_Score=126, Evalue=6e-30,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000015 - InterPro: IPR018030 [H]
Pfam domain/function: PF00577 Usher [H]
EC number: NA
Molecular weight: Translated: 92409; Mature: 92409
Theoretical pI: Translated: 5.69; Mature: 5.69
Prosite motif: PS01151 FIMBRIAL_USHER
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 1.7 %Met (Translated Protein) 2.3 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 1.7 %Met (Mature Protein) 2.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MLRMTPLASAIVALLLGIEAYAAEETFDTHFMIGGMKDQQVANIRLDDNQPLPGQYDIDI CCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCEEEEEEECCCCCCCCEEEEEE YVNKQWRGKYEIIVKDNPQETCLSREVIKRLGINSDNFASGKQCLTFEQLVQGGSYSWDI EECCCCCCEEEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHCCCCCEEEE GVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYVSQYYSDYKASGNNKSTYVRF EEEEEECCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEE NSGLNLLEWQLHSDASFSKTNNNPGVWKSNTLYLERGFAQFLGTLRVGDMYTSSDIFDSV CCCCCEEEEEECCCCCCCCCCCCCCCCCCCEEEEECCHHHHHHHEEECCEECCHHHHHHH RFSGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGFVVYQKEVPPGPFAITDLQ HHHHHHHHHHHHHCCCCCCCCCCHHHEEECCCEEEEEECCCEEEEECCCCCCCEEEEEEE LAGGGADLDVSVKEADGSVTTYLVPYAAVPNMLQPGVSKYDFAAGRSHIEGASKQSDFVQ EECCCCCEEEEEEECCCCEEEEEEEHHHCCHHHCCCCCCHHHHCCCHHHCCCCCCHHHHH AGYQYGFNNLLTLYGGTMVANNYYAFTLGTGWNTRIGAISVDATKSHSKQDNGDVFDGQS HHHHHCCCCEEEEECCEEEECCEEEEEEECCCCCEEEEEEECCCCCCCCCCCCCEECCCC YQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNKDNYRRDENDIYDIADYYQND EEEHHHHHHHHCCCCCEEEEEEECCCCCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHH FGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSGSSKDYQLSYSNNWRRISYTLAASQ CCCCCCCCCCHHHHCCCCCCCEEHHHHHHHHHCCCCCCCCEEEEECCCCEEEEEEEEEHH AYGENHHEEKRFNIFISIPCDWGDDVTTPRRQIYMSNSTTFDDQGFASNNTGLSGTVGSR HCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCC DQFNYGVNLSHQHQGNETTAGANLTWNAPVATVNGSYSQSSTYRQTGASVSGGIVAWSGG CCCCCCCCCCCCCCCCCCCCCCEEEECCCEEEECCCCCCCCHHHHCCCCCCCCEEEEECC VNLANRLSETFAVMNAPGIKDAYVNGQKYRTTNRNGVVVYDGMTPYRENHLMLDVSQSDS CHHHHHHHHHHHHHCCCCCCHHEECCCEEEECCCCCEEEECCCCCCCCCEEEEEECCCCC EAELRGNRKIAAPYRGAVVLVNFDTDQRKPWFIKALRADGQPLTFGYEVNDIHGHNIGVV CHHHCCCCEEECCCCCEEEEEECCCCCCCCEEEEEEECCCCEEEEEEEECCCCCCEEEEE GQGSQLFIRTNEIPPSVNVAIDKQQGLSCTITFGKEIDESRNYICQ ECCCEEEEEECCCCCCEEEEEECCCCCEEEEEECCCCCCCCCCCCC >Mature Secondary Structure MLRMTPLASAIVALLLGIEAYAAEETFDTHFMIGGMKDQQVANIRLDDNQPLPGQYDIDI CCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCEEEEEEECCCCCCCCEEEEEE YVNKQWRGKYEIIVKDNPQETCLSREVIKRLGINSDNFASGKQCLTFEQLVQGGSYSWDI EECCCCCCEEEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHCCCCCEEEE GVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYVSQYYSDYKASGNNKSTYVRF EEEEEECCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEE NSGLNLLEWQLHSDASFSKTNNNPGVWKSNTLYLERGFAQFLGTLRVGDMYTSSDIFDSV CCCCCEEEEEECCCCCCCCCCCCCCCCCCCEEEEECCHHHHHHHEEECCEECCHHHHHHH RFSGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGFVVYQKEVPPGPFAITDLQ HHHHHHHHHHHHHCCCCCCCCCCHHHEEECCCEEEEEECCCEEEEECCCCCCCEEEEEEE LAGGGADLDVSVKEADGSVTTYLVPYAAVPNMLQPGVSKYDFAAGRSHIEGASKQSDFVQ EECCCCCEEEEEEECCCCEEEEEEEHHHCCHHHCCCCCCHHHHCCCHHHCCCCCCHHHHH AGYQYGFNNLLTLYGGTMVANNYYAFTLGTGWNTRIGAISVDATKSHSKQDNGDVFDGQS HHHHHCCCCEEEEECCEEEECCEEEEEEECCCCCEEEEEEECCCCCCCCCCCCCEECCCC YQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNKDNYRRDENDIYDIADYYQND EEEHHHHHHHHCCCCCEEEEEEECCCCCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHH FGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSGSSKDYQLSYSNNWRRISYTLAASQ CCCCCCCCCCHHHHCCCCCCCEEHHHHHHHHHCCCCCCCCEEEEECCCCEEEEEEEEEHH AYGENHHEEKRFNIFISIPCDWGDDVTTPRRQIYMSNSTTFDDQGFASNNTGLSGTVGSR HCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCC DQFNYGVNLSHQHQGNETTAGANLTWNAPVATVNGSYSQSSTYRQTGASVSGGIVAWSGG CCCCCCCCCCCCCCCCCCCCCCEEEECCCEEEECCCCCCCCHHHHCCCCCCCCEEEEECC VNLANRLSETFAVMNAPGIKDAYVNGQKYRTTNRNGVVVYDGMTPYRENHLMLDVSQSDS CHHHHHHHHHHHHHCCCCCCHHEECCCEEEECCCCCEEEECCCCCCCCCEEEEEECCCCC EAELRGNRKIAAPYRGAVVLVNFDTDQRKPWFIKALRADGQPLTFGYEVNDIHGHNIGVV CHHHCCCCEEECCCCCEEEEEECCCCCCCCEEEEEEECCCCEEEEEEEECCCCCCEEEEE GQGSQLFIRTNEIPPSVNVAIDKQQGLSCTITFGKEIDESRNYICQ ECCCEEEEEECCCCCCEEEEEECCCCCEEEEEECCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9278503; 9097040 [H]