Definition | Shigella boydii Sb227, complete genome. |
---|---|
Accession | NC_007613 |
Length | 4,519,823 |
Click here to switch to the map view.
The map label for this gene is yfcU [H]
Identifier: 82544820
GI number: 82544820
Start: 2362690
End: 2365332
Strand: Reverse
Name: yfcU [H]
Synonym: SBO_2376
Alternate gene names: 82544820
Gene position: 2365332-2362690 (Counterclockwise)
Preceding gene: 82544821
Following gene: 82544819
Centisome position: 52.33
GC content: 51.15
Gene sequence:
>2643_bases ATGCCTGACCATTCTCTTTTTCGATTACGGATTCTTCCGTGGTGCATTGCGCTGGCAATGTCAGGGAGTTATAGCAGTGT CTGGGCTGAAGACGACATTCAGTTTGATTCCCGTTTTCTGGAATTAAAAGGCGACACAAAAATTGATCTGAAGCGTTTTT CCAGCCAGGGATATGTTGAGCCCGGAAAATACAATTTACAGGTTCAACTAAATAAACAGCCATTGGCGGAAGAGTACGAT ATTTACTGGTATGCTGGTGAAGATGACGCGAGCAAAAGCTATGCTTGTCTGACACCGGAACTGGTAGCGCAGTTTGGTTT AAAAGAAGACGTGGCGAAAAATCTGCAATGGAGCCACGATGCTAAATGCCTGAAATCCGGTCAACTGGAAGGCATGGAAA TTAAGGCTGATTTAAGCCAGTCCGCATTAGTCATTTCACTGCCACAGGCTTACCTCGAATATACTTATCCCGACTGGGAT CCGCCTTCACGTTGGGATGACGGCATCTCCGGGATCGTCGCGGACTACAGCATCAACGCACAAACCCGGCACGAAGAAAA TGGCGGTGATGATAGTAACGAGATCAGCGGCAACGGGACGGTCGGGGTTAACCTGGGGCCGTGGCGTATGCGTGCTGACT GGCAGACTAACTATCAACATACTCGCAGTAATGATGACGATGAATTCAGCGGCGATGAAACTCAAAAAAAATGGGAGTGG AGTCGCTACTATGCCTGGCGGGCGTTACCATCATTAAAAGCCAAACTGGCGCTGGGCGAGGATTACCTCAGATCCGATAT TTTTGATGGTTTTAACTATGTTGGTGGTAGTGTCAGTACTGACGATCAAATGTTGCCTCCCAATCTGCGCGGCTACGCGC CAGACATTTCCGGCGTGGCACACACCACAGCAAAAGTGACCGTCAGCCAGATGGGGCGTGTGATTTACGAAACGCAGGTT CCGGCTGGACCGTTTCGTATTCAGGATCTTGGTGATTCTGTCTCCGGTACGTTGCATATTCGCATTGAAGAACAGAACGG CCAGGTGCAGGAATATGACATCAGCACCGCCTCGATGCCATACCTCACTCGTCCAGGTCAGATTCGCTATAAGATCATGA TGGGCCGTCCGCAAGAGTGGGGACACCATGTCGAGGGTGAATTTTTTTCTGGTGCTGAAGCTTCCTGGGGGATCGCTAAC GGCTGGTCGTTATATGGCGGCGCACTGGGAGATGAAAACTATCAGTCTGCGGCGCTTGGCGTCGGTCGCGATTTGTCTAC ATTCGGCGCGGTCGCGTTTGATGTTACTCACTCGCACACCAAACTGGATAAAGACACCGCTTATGGCAAAGGTTCGCTGG ACGGTAACTCCTTCCGTGTGAGTTATTCCAAAGACTTTGACCAGCTCAACAGCCGCGTTACTTTTGCTGGATATCGCTTC TCGGAAGAGAACTTTATGACCATGAGCGAGTATCTGGATGCCAGTGACAGCGGAATGGTACGCACGGGCAACGACAAAGA GATGTGCACCGCCACTTATAACCAGAACTTCCGCGATGCGGGTGTTTCGGTTTATCTCAACTATACCCGCCATACCTACT GGGATCGCGAGGAGCAGATAAACTACAACATCATGCTCTCGCACTATTTCAATATGGGCAGTATTCGTAATGTCAGCATC TCGATGACTGGCTACCGTTACGAGTATGACAACCAGGCCGACAAAGGCATGTACATTTCGCTCAGTATGCCGTGGGGCGA CAACAGTACCGTTAGCTATAACGGTAACTATGGCAGTGGGACGGACAGCAGTCAGGTCGGTTATTTCAGCCGTGTCGATG ACGCGACTCACTATCAGTTGAACGTCGGCACCAGTGACAAACACACCAGCGTTGACGGCTATTACAGCCATGATGGTTCG CTGGCGCAGGTTGACCTCAGTGCGAACTACCATGAAGGGCAATACACCTCTGCGGGCTTGTCGTTACAGGGCGGCGCAAC GCTTACTGCCCACGGTGGCGCACTTCACCGTACCCAGAATATGGGCGGGACACGCTTGTTGATTGATGCCGATGGCGTTG CCGATGTTCCGGTGGAAGGTAACGGGGCTGCTGTTTATACCAATATGTTTGGTAAAGCCGTCGTTTCTGACGTCAATAAC TATTACCGCAATCAGGCGTATATCGACCTCAACAGATTGCCTGAAAACGCTGAAGCAACCCAGTCGGTGGTGCAAGCCAC GCTAACTGAAGGTGCCATTGGCTACCGTAAATTTGCTGTCATTAGCGGACAAAAAGCGATGGCGGTGCTGCGCCTGAGCG ACGGCAGCCATCCTCCGTTTGGCGCAGAAGTAAAAAATGATAACGAGCAGACAGTGGGCCTTGTCGATGATGACGGCAAT GTTTATCTGGCTGGGGTGAAACCTGGCGAACACATGAGTGTGTTCTGGAGTGGTGTTGCGCATTGCGATATCAACCTGCC GGACCCGCTGCCTGCCGATCTGTTTAACGGCTTGTTACTACCATGCCAGCATAAAGGCAATGTAGCACCTATCACTTCGC CGGCGGTCAAACCGGCGATTCAGGAACAGACACAGCGGGTGACGCCAACGGAACCTCCGACTTCAATTTCAGTAAACCAG TAA
Upstream 100 bases:
>100_bases CCAGATTACTTACCTGTAAGGCATACCCCCATAGCGACACTGCTATGGGGGATTTAAATGGATTTTGACCGAACAAGAAT AAAAACGGATAGAAACGTGT
Downstream 100 bases:
>100_bases CGTGATTAAGGAATGATCCATGTTTAATCTGACTAACACTGCAAAAATCGTTGTCCCGGCACTGGCGCTGCTGGCGACAG CGGTCAGTTTCTCCAGCCAC
Product: PapC-like porin protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 880; Mature: 879
Protein sequence:
>880_residues MPDHSLFRLRILPWCIALAMSGSYSSVWAEDDIQFDSRFLELKGDTKIDLKRFSSQGYVEPGKYNLQVQLNKQPLAEEYD IYWYAGEDDASKSYACLTPELVAQFGLKEDVAKNLQWSHDAKCLKSGQLEGMEIKADLSQSALVISLPQAYLEYTYPDWD PPSRWDDGISGIVADYSINAQTRHEENGGDDSNEISGNGTVGVNLGPWRMRADWQTNYQHTRSNDDDEFSGDETQKKWEW SRYYAWRALPSLKAKLALGEDYLRSDIFDGFNYVGGSVSTDDQMLPPNLRGYAPDISGVAHTTAKVTVSQMGRVIYETQV PAGPFRIQDLGDSVSGTLHIRIEEQNGQVQEYDISTASMPYLTRPGQIRYKIMMGRPQEWGHHVEGEFFSGAEASWGIAN GWSLYGGALGDENYQSAALGVGRDLSTFGAVAFDVTHSHTKLDKDTAYGKGSLDGNSFRVSYSKDFDQLNSRVTFAGYRF SEENFMTMSEYLDASDSGMVRTGNDKEMCTATYNQNFRDAGVSVYLNYTRHTYWDREEQINYNIMLSHYFNMGSIRNVSI SMTGYRYEYDNQADKGMYISLSMPWGDNSTVSYNGNYGSGTDSSQVGYFSRVDDATHYQLNVGTSDKHTSVDGYYSHDGS LAQVDLSANYHEGQYTSAGLSLQGGATLTAHGGALHRTQNMGGTRLLIDADGVADVPVEGNGAAVYTNMFGKAVVSDVNN YYRNQAYIDLNRLPENAEATQSVVQATLTEGAIGYRKFAVISGQKAMAVLRLSDGSHPPFGAEVKNDNEQTVGLVDDDGN VYLAGVKPGEHMSVFWSGVAHCDINLPDPLPADLFNGLLLPCQHKGNVAPITSPAVKPAIQEQTQRVTPTEPPTSISVNQ
Sequences:
>Translated_880_residues MPDHSLFRLRILPWCIALAMSGSYSSVWAEDDIQFDSRFLELKGDTKIDLKRFSSQGYVEPGKYNLQVQLNKQPLAEEYD IYWYAGEDDASKSYACLTPELVAQFGLKEDVAKNLQWSHDAKCLKSGQLEGMEIKADLSQSALVISLPQAYLEYTYPDWD PPSRWDDGISGIVADYSINAQTRHEENGGDDSNEISGNGTVGVNLGPWRMRADWQTNYQHTRSNDDDEFSGDETQKKWEW SRYYAWRALPSLKAKLALGEDYLRSDIFDGFNYVGGSVSTDDQMLPPNLRGYAPDISGVAHTTAKVTVSQMGRVIYETQV PAGPFRIQDLGDSVSGTLHIRIEEQNGQVQEYDISTASMPYLTRPGQIRYKIMMGRPQEWGHHVEGEFFSGAEASWGIAN GWSLYGGALGDENYQSAALGVGRDLSTFGAVAFDVTHSHTKLDKDTAYGKGSLDGNSFRVSYSKDFDQLNSRVTFAGYRF SEENFMTMSEYLDASDSGMVRTGNDKEMCTATYNQNFRDAGVSVYLNYTRHTYWDREEQINYNIMLSHYFNMGSIRNVSI SMTGYRYEYDNQADKGMYISLSMPWGDNSTVSYNGNYGSGTDSSQVGYFSRVDDATHYQLNVGTSDKHTSVDGYYSHDGS LAQVDLSANYHEGQYTSAGLSLQGGATLTAHGGALHRTQNMGGTRLLIDADGVADVPVEGNGAAVYTNMFGKAVVSDVNN YYRNQAYIDLNRLPENAEATQSVVQATLTEGAIGYRKFAVISGQKAMAVLRLSDGSHPPFGAEVKNDNEQTVGLVDDDGN VYLAGVKPGEHMSVFWSGVAHCDINLPDPLPADLFNGLLLPCQHKGNVAPITSPAVKPAIQEQTQRVTPTEPPTSISVNQ >Mature_879_residues PDHSLFRLRILPWCIALAMSGSYSSVWAEDDIQFDSRFLELKGDTKIDLKRFSSQGYVEPGKYNLQVQLNKQPLAEEYDI YWYAGEDDASKSYACLTPELVAQFGLKEDVAKNLQWSHDAKCLKSGQLEGMEIKADLSQSALVISLPQAYLEYTYPDWDP PSRWDDGISGIVADYSINAQTRHEENGGDDSNEISGNGTVGVNLGPWRMRADWQTNYQHTRSNDDDEFSGDETQKKWEWS RYYAWRALPSLKAKLALGEDYLRSDIFDGFNYVGGSVSTDDQMLPPNLRGYAPDISGVAHTTAKVTVSQMGRVIYETQVP AGPFRIQDLGDSVSGTLHIRIEEQNGQVQEYDISTASMPYLTRPGQIRYKIMMGRPQEWGHHVEGEFFSGAEASWGIANG WSLYGGALGDENYQSAALGVGRDLSTFGAVAFDVTHSHTKLDKDTAYGKGSLDGNSFRVSYSKDFDQLNSRVTFAGYRFS EENFMTMSEYLDASDSGMVRTGNDKEMCTATYNQNFRDAGVSVYLNYTRHTYWDREEQINYNIMLSHYFNMGSIRNVSIS MTGYRYEYDNQADKGMYISLSMPWGDNSTVSYNGNYGSGTDSSQVGYFSRVDDATHYQLNVGTSDKHTSVDGYYSHDGSL AQVDLSANYHEGQYTSAGLSLQGGATLTAHGGALHRTQNMGGTRLLIDADGVADVPVEGNGAAVYTNMFGKAVVSDVNNY YRNQAYIDLNRLPENAEATQSVVQATLTEGAIGYRKFAVISGQKAMAVLRLSDGSHPPFGAEVKNDNEQTVGLVDDDGNV YLAGVKPGEHMSVFWSGVAHCDINLPDPLPADLFNGLLLPCQHKGNVAPITSPAVKPAIQEQTQRVTPTEPPTSISVNQ
Specific function: Involved in the export and assembly of a fimbrial subunit across the outer membrane [H]
COG id: NA
COG function: NA
Gene ontology:
Cell location: Cell outer membrane; Multi-pass membrane protein [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the fimbrial export usher family [H]
Homologues:
Organism=Escherichia coli, GI87081778, Length=849, Percent_Identity=34.8645465253239, Blast_Score=495, Evalue=1e-141, Organism=Escherichia coli, GI1790772, Length=889, Percent_Identity=26.6591676040495, Blast_Score=282, Evalue=5e-77, Organism=Escherichia coli, GI1787172, Length=870, Percent_Identity=28.0459770114943, Blast_Score=275, Evalue=7e-75, Organism=Escherichia coli, GI1789533, Length=862, Percent_Identity=27.3781902552204, Blast_Score=273, Evalue=3e-74, Organism=Escherichia coli, GI1786332, Length=860, Percent_Identity=28.4883720930233, Blast_Score=265, Evalue=7e-72, Organism=Escherichia coli, GI1786744, Length=867, Percent_Identity=26.0668973471742, Blast_Score=248, Evalue=1e-66, Organism=Escherichia coli, GI1788427, Length=773, Percent_Identity=24.8382923673997, Blast_Score=211, Evalue=2e-55, Organism=Escherichia coli, GI1789610, Length=852, Percent_Identity=22.4178403755869, Blast_Score=121, Evalue=2e-28,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR000015 - InterPro: IPR018030 [H]
Pfam domain/function: PF00577 Usher [H]
EC number: NA
Molecular weight: Translated: 97342; Mature: 97211
Theoretical pI: Translated: 4.57; Mature: 4.57
Prosite motif: PS00133 CARBOXYPEPT_ZN_2 ; PS01151 FIMBRIAL_USHER
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.7 %Cys (Translated Protein) 2.5 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 0.7 %Cys (Mature Protein) 2.4 %Met (Mature Protein) 3.1 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MPDHSLFRLRILPWCIALAMSGSYSSVWAEDDIQFDSRFLELKGDTKIDLKRFSSQGYVE CCCCCEEEHHHHHHHHHHHHCCCCCEEECCCCCCCCCEEEEECCCCEEEHEECCCCCCCC PGKYNLQVQLNKQPLAEEYDIYWYAGEDDASKSYACLTPELVAQFGLKEDVAKNLQWSHD CCCEEEEEEECCCCCCCCCEEEEEECCCCCCCCEEEECHHHHHHCCCHHHHHHCCCCCCC AKCLKSGQLEGMEIKADLSQSALVISLPQAYLEYTYPDWDPPSRWDDGISGIVADYSINA HHHHHCCCCCCEEEEECCCCCEEEEEECHHHHEECCCCCCCCCCCCCCCCCEEEEEEECC QTRHEENGGDDSNEISGNGTVGVNLGPWRMRADWQTNYQHTRSNDDDEFSGDETQKKWEW CCCCCCCCCCCCCCCCCCCEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHH SRYYAWRALPSLKAKLALGEDYLRSDIFDGFNYVGGSVSTDDQMLPPNLRGYAPDISGVA HHEEEECCCCCHHEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC HTTAKVTVSQMGRVIYETQVPAGPFRIQDLGDSVSGTLHIRIEEQNGQVQEYDISTASMP HHHHEEEHHHCCCEEEEECCCCCCEEEEECCCCCCCEEEEEEECCCCCEEEEECCCCCCC YLTRPGQIRYKIMMGRPQEWGHHVEGEFFSGAEASWGIANGWSLYGGALGDENYQSAALG EEECCCCEEEEEEECCCHHHCCCCCCCEECCCCCCCCCCCCCEEECCCCCCCCCCHHHHC VGRDLSTFGAVAFDVTHSHTKLDKDTAYGKGSLDGNSFRVSYSKDFDQLNSRVTFAGYRF CCCCHHHHCEEEEEEECCCCCCCCCCCCCCCCCCCCEEEEEECCCHHHHCCEEEEEEEEE SEENFMTMSEYLDASDSGMVRTGNDKEMCTATYNQNFRDAGVSVYLNYTRHTYWDREEQI CCCCCEEHHHHCCCCCCCEEEECCCCCEEEEECCCCCCCCCEEEEEEECCCCCCCHHHCC NYNIMLSHYFNMGSIRNVSISMTGYRYEYDNQADKGMYISLSMPWGDNSTVSYNGNYGSG CEEEEEEEEECCCCEEEEEEEEEEEEEECCCCCCCCEEEEEECCCCCCCEEEECCCCCCC TDSSQVGYFSRVDDATHYQLNVGTSDKHTSVDGYYSHDGSLAQVDLSANYHEGQYTSAGL CCCCCCCEEEECCCCEEEEEECCCCCCCCCCCCEECCCCCEEEEEECCCCCCCCEEECCE SLQGGATLTAHGGALHRTQNMGGTRLLIDADGVADVPVEGNGAAVYTNMFGKAVVSDVNN EECCCEEEEECCCEEEEECCCCCEEEEEECCCCEECCCCCCCCEEEECHHHHHHHHHHHH YYRNQAYIDLNRLPENAEATQSVVQATLTEGAIGYRKFAVISGQKAMAVLRLSDGSHPPF HHCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEEECCCEEEEEEECCCCCCCC GAEVKNDNEQTVGLVDDDGNVYLAGVKPGEHMSVFWSGVAHCDINLPDPLPADLFNGLLL CCEECCCCCEEEEEEECCCCEEEEECCCCCCEEEEECCEEEEECCCCCCCCHHHHCCCEE PCQHKGNVAPITSPAVKPAIQEQTQRVTPTEPPTSISVNQ EECCCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCEEECCC >Mature Secondary Structure PDHSLFRLRILPWCIALAMSGSYSSVWAEDDIQFDSRFLELKGDTKIDLKRFSSQGYVE CCCCEEEHHHHHHHHHHHHCCCCCEEECCCCCCCCCEEEEECCCCEEEHEECCCCCCCC PGKYNLQVQLNKQPLAEEYDIYWYAGEDDASKSYACLTPELVAQFGLKEDVAKNLQWSHD CCCEEEEEEECCCCCCCCCEEEEEECCCCCCCCEEEECHHHHHHCCCHHHHHHCCCCCCC AKCLKSGQLEGMEIKADLSQSALVISLPQAYLEYTYPDWDPPSRWDDGISGIVADYSINA HHHHHCCCCCCEEEEECCCCCEEEEEECHHHHEECCCCCCCCCCCCCCCCCEEEEEEECC QTRHEENGGDDSNEISGNGTVGVNLGPWRMRADWQTNYQHTRSNDDDEFSGDETQKKWEW CCCCCCCCCCCCCCCCCCCEEEEECCCEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHH SRYYAWRALPSLKAKLALGEDYLRSDIFDGFNYVGGSVSTDDQMLPPNLRGYAPDISGVA HHEEEECCCCCHHEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC HTTAKVTVSQMGRVIYETQVPAGPFRIQDLGDSVSGTLHIRIEEQNGQVQEYDISTASMP HHHHEEEHHHCCCEEEEECCCCCCEEEEECCCCCCCEEEEEEECCCCCEEEEECCCCCCC YLTRPGQIRYKIMMGRPQEWGHHVEGEFFSGAEASWGIANGWSLYGGALGDENYQSAALG EEECCCCEEEEEEECCCHHHCCCCCCCEECCCCCCCCCCCCCEEECCCCCCCCCCHHHHC VGRDLSTFGAVAFDVTHSHTKLDKDTAYGKGSLDGNSFRVSYSKDFDQLNSRVTFAGYRF CCCCHHHHCEEEEEEECCCCCCCCCCCCCCCCCCCCEEEEEECCCHHHHCCEEEEEEEEE SEENFMTMSEYLDASDSGMVRTGNDKEMCTATYNQNFRDAGVSVYLNYTRHTYWDREEQI CCCCCEEHHHHCCCCCCCEEEECCCCCEEEEECCCCCCCCCEEEEEEECCCCCCCHHHCC NYNIMLSHYFNMGSIRNVSISMTGYRYEYDNQADKGMYISLSMPWGDNSTVSYNGNYGSG CEEEEEEEEECCCCEEEEEEEEEEEEEECCCCCCCCEEEEEECCCCCCCEEEECCCCCCC TDSSQVGYFSRVDDATHYQLNVGTSDKHTSVDGYYSHDGSLAQVDLSANYHEGQYTSAGL CCCCCCCEEEECCCCEEEEEECCCCCCCCCCCCEECCCCCEEEEEECCCCCCCCEEECCE SLQGGATLTAHGGALHRTQNMGGTRLLIDADGVADVPVEGNGAAVYTNMFGKAVVSDVNN EECCCEEEEECCCEEEEECCCCCEEEEEECCCCEECCCCCCCCEEEECHHHHHHHHHHHH YYRNQAYIDLNRLPENAEATQSVVQATLTEGAIGYRKFAVISGQKAMAVLRLSDGSHPPF HHCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEEECCCEEEEEEECCCCCCCC GAEVKNDNEQTVGLVDDDGNVYLAGVKPGEHMSVFWSGVAHCDINLPDPLPADLFNGLLL CCEECCCCCEEEEEEECCCCEEEEECCCCCCEEEEECCEEEEECCCCCCCCHHHHCCCEE PCQHKGNVAPITSPAVKPAIQEQTQRVTPTEPPTSISVNQ EECCCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCEEECCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9205837; 9278503 [H]