| Definition | Escherichia coli O157:H7 str. EC4115, complete genome. |
|---|---|
| Accession | NC_011353 |
| Length | 5,572,075 |
Click here to switch to the map view.
The map label for this gene is ssaV [H]
Identifier: 209395993
GI number: 209395993
Start: 4710122
End: 4712149
Strand: Reverse
Name: ssaV [H]
Synonym: ECH74115_5063
Alternate gene names: 209395993
Gene position: 4712149-4710122 (Counterclockwise)
Preceding gene: 209397924
Following gene: 209396453
Centisome position: 84.57
GC content: 40.53
Gene sequence:
>2028_bases ATGAATAAACTCTTAAATATATTTAAAAAAGCAGAGTCATATCACGATCTTATTCTGGCTCTCTTCTTCTTTATGGCTGT AATGATGATGATTATTCCACTACCAACAGTTGTGGTGGATATCATTATCGCGATTAATATTTCGACAGCTTTACTTTTAT TAATGCTCTCAATCTATATAAAAAATCCTCTTGAACTAACTTCTTTCCCCACAATCCTGTTGATTACGACGTTGATGCGC CTGTCGCTCAGTGTTAGTACAACCAGACTTATTCTGCTGCATCACGATGCGGGTGACATTATTTACTCCTTCGGTAACTT TGTTGTTGGCGGTAATATTGTTGTTGGCCTGGTTATTTTTACTATTATCACCATTGTTCAATTTATGGTTATTACAAAAG GGGCCGAGCGTGTTGCTGAAGTGAGTGCTCGTTTTTCCCTTGATGGTATGCCAGGAAAGCAAATGAGTATTGATGGTGAT ATGCGCGCAGGCGTTATTGACCCGCTTGAGGCAAAAGTTCTTCGCTCCCGAGTGCAAAAGGAAAGCCAGTTTTATGGTTC AATGGACGGGGCGATGAAGTTTGTAAAAGGGGACGCTATTGCTGGTATCATTATTGTTTTAGTTAACCTTTTTGGTGGCG TGCTCATTGGTATGTGGCAATTTGACATGCCATTTAGTGCGGCACTTAGCCTGTTTTCTGTATTGTCTGTCGGTGATGCT TTGGTTGCCCAGATCCCTGCGCTTATTATTTCTGTCACCGCAGGCGTGGTTGTTACTCGTGTGCCCGGTGAAAGCGAAAA AGAAGAAAACCTTGCAGGTGATATTGTTCAACAGGTTTCTGTTAATAGTCGCCCTTTTTTAATCAGTGCCGCACTAATGC TCGTAATGGCGATTATTCCGGGCTTTCCTGCATTGGTCTTCTTATTTCTGGCGGTTTGCCTGTTGGGGATAGCCTGGAAA TTACAGAAGAAAAGAACATTTGGAACTGGCAATAATAAGGATGCTATGGGAGCTGATTTGTCTAATAGCCAAAACATCTC ACCGGGCGCTGAGCCATTGATTTTAAACTTAAGTAGTAATATTTATAGTTCAGACATTACACAGCAAATTGAGGTCATGC GTTGGAATTTCTTTGAGGAAAGTGGAATTCCATTGCCTAAGATTATTGTTAATCCGGTTAAAAATAATGATAGCGCAATA GAATTTTTGCTCTATCAAGAGTCAATATACAAAGATACTCTTATAGATGATACTGTCTATTTTGAGGCTGGGCATGCAGA GATATCATTCGAATTTGTCCAGGAAAAGCTTTCGACTAACTCTATCGTATATAAGACGAATAAGACTAATCAACAGCTCG CTCACCTTACAGGTATGGATGTTTATGCAACAACAAATGATAAGATAACGTTTTTGCTAAAGAAACTTGTATTGTCTAAT GCCAAAGAGTTCATCGGCGTACAAGAAACGCGTTATTTGATGGACATCATGGAGAGAAAATATAACGAGCTTGTGAAAGA GCTGCAGCGCCAGCTTGGTTTGAGCAAAATTGTTGACATCCTACAACGTCTCGTAGAGGAAAATGTCTCAATTAGAGACC TGAGAACTATCTTTGAGACGCTTATTTTTTGGTCAACAAAAGAAAAGGATGTGGTTATCTTGTGCGAATACGTTCGTATT GCCCTGCGTCGGCATATTTTAGGTCGCTATAGCGTTAGCGGTACACTTCTGAACGTTTGGCTTATTGGCTCTGATATTGA AAATGAGCTACGAGAGTCGATCAGACAAACGTCATCAGGTTCGTATCTGAATATCTCACCGGAGCGAACTGAGCAGATAA TTGGCTTCTTAAAAAATATCATGAATCCAACGGGGAACGGCGTCATTCTGACCGCTTTAGATATCAGGCGCTATGTGAAG AAAATGATTGAAGGTTCGTTCCCGTCAGTCCCCGTGCTCTCTTTTCAGGAGGTTGGGAATAATATCGAACTTAAAGTATT AGGAACGGTAAATGATTTCAGAGCATGA
Upstream 100 bases:
>100_bases TGTGCTTAAGTTGTTTGTTAACCAATATCGATGTTTTTTTTCTAATGAATACTTTTCAACAGCATGTGCAGATTATTGAG CGCGTTCGCAGGATGACATC
Downstream 100 bases:
>100_bases TTCTGTATTGGAAAAATACCCACGTATTCAGAAAGTGCTCAATAGCACTGTGCCGGCATTATCATTAAATTCGTCTACCA GATATGAAGGCAAGATTATC
Product: type III secretion protein, HrcV family
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 675; Mature: 675
Protein sequence:
>675_residues MNKLLNIFKKAESYHDLILALFFFMAVMMMIIPLPTVVVDIIIAINISTALLLLMLSIYIKNPLELTSFPTILLITTLMR LSLSVSTTRLILLHHDAGDIIYSFGNFVVGGNIVVGLVIFTIITIVQFMVITKGAERVAEVSARFSLDGMPGKQMSIDGD MRAGVIDPLEAKVLRSRVQKESQFYGSMDGAMKFVKGDAIAGIIIVLVNLFGGVLIGMWQFDMPFSAALSLFSVLSVGDA LVAQIPALIISVTAGVVVTRVPGESEKEENLAGDIVQQVSVNSRPFLISAALMLVMAIIPGFPALVFLFLAVCLLGIAWK LQKKRTFGTGNNKDAMGADLSNSQNISPGAEPLILNLSSNIYSSDITQQIEVMRWNFFEESGIPLPKIIVNPVKNNDSAI EFLLYQESIYKDTLIDDTVYFEAGHAEISFEFVQEKLSTNSIVYKTNKTNQQLAHLTGMDVYATTNDKITFLLKKLVLSN AKEFIGVQETRYLMDIMERKYNELVKELQRQLGLSKIVDILQRLVEENVSIRDLRTIFETLIFWSTKEKDVVILCEYVRI ALRRHILGRYSVSGTLLNVWLIGSDIENELRESIRQTSSGSYLNISPERTEQIIGFLKNIMNPTGNGVILTALDIRRYVK KMIEGSFPSVPVLSFQEVGNNIELKVLGTVNDFRA
Sequences:
>Translated_675_residues MNKLLNIFKKAESYHDLILALFFFMAVMMMIIPLPTVVVDIIIAINISTALLLLMLSIYIKNPLELTSFPTILLITTLMR LSLSVSTTRLILLHHDAGDIIYSFGNFVVGGNIVVGLVIFTIITIVQFMVITKGAERVAEVSARFSLDGMPGKQMSIDGD MRAGVIDPLEAKVLRSRVQKESQFYGSMDGAMKFVKGDAIAGIIIVLVNLFGGVLIGMWQFDMPFSAALSLFSVLSVGDA LVAQIPALIISVTAGVVVTRVPGESEKEENLAGDIVQQVSVNSRPFLISAALMLVMAIIPGFPALVFLFLAVCLLGIAWK LQKKRTFGTGNNKDAMGADLSNSQNISPGAEPLILNLSSNIYSSDITQQIEVMRWNFFEESGIPLPKIIVNPVKNNDSAI EFLLYQESIYKDTLIDDTVYFEAGHAEISFEFVQEKLSTNSIVYKTNKTNQQLAHLTGMDVYATTNDKITFLLKKLVLSN AKEFIGVQETRYLMDIMERKYNELVKELQRQLGLSKIVDILQRLVEENVSIRDLRTIFETLIFWSTKEKDVVILCEYVRI ALRRHILGRYSVSGTLLNVWLIGSDIENELRESIRQTSSGSYLNISPERTEQIIGFLKNIMNPTGNGVILTALDIRRYVK KMIEGSFPSVPVLSFQEVGNNIELKVLGTVNDFRA >Mature_675_residues MNKLLNIFKKAESYHDLILALFFFMAVMMMIIPLPTVVVDIIIAINISTALLLLMLSIYIKNPLELTSFPTILLITTLMR LSLSVSTTRLILLHHDAGDIIYSFGNFVVGGNIVVGLVIFTIITIVQFMVITKGAERVAEVSARFSLDGMPGKQMSIDGD MRAGVIDPLEAKVLRSRVQKESQFYGSMDGAMKFVKGDAIAGIIIVLVNLFGGVLIGMWQFDMPFSAALSLFSVLSVGDA LVAQIPALIISVTAGVVVTRVPGESEKEENLAGDIVQQVSVNSRPFLISAALMLVMAIIPGFPALVFLFLAVCLLGIAWK LQKKRTFGTGNNKDAMGADLSNSQNISPGAEPLILNLSSNIYSSDITQQIEVMRWNFFEESGIPLPKIIVNPVKNNDSAI EFLLYQESIYKDTLIDDTVYFEAGHAEISFEFVQEKLSTNSIVYKTNKTNQQLAHLTGMDVYATTNDKITFLLKKLVLSN AKEFIGVQETRYLMDIMERKYNELVKELQRQLGLSKIVDILQRLVEENVSIRDLRTIFETLIFWSTKEKDVVILCEYVRI ALRRHILGRYSVSGTLLNVWLIGSDIENELRESIRQTSSGSYLNISPERTEQIIGFLKNIMNPTGNGVILTALDIRRYVK KMIEGSFPSVPVLSFQEVGNNIELKVLGTVNDFRA
Specific function: Required For Formation Of The Rod Structure Of The Flagellar Apparatus. Together With Flii And Flih, May Constitute The Export Apparatus Of Flagellin. [C]
COG id: COG4789
COG function: function code U; Type III secretory pathway, component EscV
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the FHIPEP (flagella/HR/invasion proteins export pore) family [H]
Homologues:
Organism=Escherichia coli, GI1788187, Length=689, Percent_Identity=32.510885341074, Blast_Score=320, Evalue=3e-88,
Paralogues:
None
Copy number: 10-20 (rich media) [C]
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR001712 - InterPro: IPR006302 [H]
Pfam domain/function: PF00771 FHIPEP [H]
EC number: NA
Molecular weight: Translated: 75098; Mature: 75098
Theoretical pI: Translated: 5.82; Mature: 5.82
Prosite motif: PS00994 FHIPEP
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 3.6 %Met (Translated Protein) 3.9 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 3.6 %Met (Mature Protein) 3.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MNKLLNIFKKAESYHDLILALFFFMAVMMMIIPLPTVVVDIIIAINISTALLLLMLSIYI CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH KNPLELTSFPTILLITTLMRLSLSVSTTRLILLHHDAGDIIYSFGNFVVGGNIVVGLVIF CCCCCHHCCCHHHHHHHHHHHHCCCCEEEEEEEEECCCHHHHHHCCEEECCHHHHHHHHH TIITIVQFMVITKGAERVAEVSARFSLDGMPGKQMSIDGDMRAGVIDPLEAKVLRSRVQK HHHHHHHHHHHHCCHHHHHHHHHHEEECCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHH ESQFYGSMDGAMKFVKGDAIAGIIIVLVNLFGGVLIGMWQFDMPFSAALSLFSVLSVGDA HHHHHCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH LVAQIPALIISVTAGVVVTRVPGESEKEENLAGDIVQQVSVNSRPFLISAALMLVMAIIP HHHHHHHHHHHHHHCEEEEECCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHC GFPALVFLFLAVCLLGIAWKLQKKRTFGTGNNKDAMGADLSNSQNISPGAEPLILNLSSN CHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCC IYSSDITQQIEVMRWNFFEESGIPLPKIIVNPVKNNDSAIEFLLYQESIYKDTLIDDTVY CHHHHHHHHHHHHHHCCCCCCCCCCCHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEE FEAGHAEISFEFVQEKLSTNSIVYKTNKTNQQLAHLTGMDVYATTNDKITFLLKKLVLSN EECCCCEEHHHHHHHHHCCCCEEEEECCCHHHHHHHCCCEEEEECCCHHHHHHHHHHHHC AKEFIGVQETRYLMDIMERKYNELVKELQRQLGLSKIVDILQRLVEENVSIRDLRTIFET HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHHHHHH LIFWSTKEKDVVILCEYVRIALRRHILGRYSVSGTLLNVWLIGSDIENELRESIRQTSSG HHHCCCCCCCEEEEHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCC SYLNISPERTEQIIGFLKNIMNPTGNGVILTALDIRRYVKKMIEGSFPSVPVLSFQEVGN CEEECCHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHCCCCCCCCCEEHHHCCC NIELKVLGTVNDFRA CEEEEEEECCHHCCC >Mature Secondary Structure MNKLLNIFKKAESYHDLILALFFFMAVMMMIIPLPTVVVDIIIAINISTALLLLMLSIYI CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHH KNPLELTSFPTILLITTLMRLSLSVSTTRLILLHHDAGDIIYSFGNFVVGGNIVVGLVIF CCCCCHHCCCHHHHHHHHHHHHCCCCEEEEEEEEECCCHHHHHHCCEEECCHHHHHHHHH TIITIVQFMVITKGAERVAEVSARFSLDGMPGKQMSIDGDMRAGVIDPLEAKVLRSRVQK HHHHHHHHHHHHCCHHHHHHHHHHEEECCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHH ESQFYGSMDGAMKFVKGDAIAGIIIVLVNLFGGVLIGMWQFDMPFSAALSLFSVLSVGDA HHHHHCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH LVAQIPALIISVTAGVVVTRVPGESEKEENLAGDIVQQVSVNSRPFLISAALMLVMAIIP HHHHHHHHHHHHHHCEEEEECCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHC GFPALVFLFLAVCLLGIAWKLQKKRTFGTGNNKDAMGADLSNSQNISPGAEPLILNLSSN CHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCC IYSSDITQQIEVMRWNFFEESGIPLPKIIVNPVKNNDSAIEFLLYQESIYKDTLIDDTVY CHHHHHHHHHHHHHHCCCCCCCCCCCHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEE FEAGHAEISFEFVQEKLSTNSIVYKTNKTNQQLAHLTGMDVYATTNDKITFLLKKLVLSN EECCCCEEHHHHHHHHHCCCCEEEEECCCHHHHHHHCCCEEEEECCCHHHHHHHHHHHHC AKEFIGVQETRYLMDIMERKYNELVKELQRQLGLSKIVDILQRLVEENVSIRDLRTIFET HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHHHHHH LIFWSTKEKDVVILCEYVRIALRRHILGRYSVSGTLLNVWLIGSDIENELRESIRQTSSG HHHCCCCCCCEEEEHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCHHHHHHHHHHHCCCC SYLNISPERTEQIIGFLKNIMNPTGNGVILTALDIRRYVKKMIEGSFPSVPVLSFQEVGN CEEECCHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHHCCCCCCCCCEEHHHCCC NIELKVLGTVNDFRA CEEEEEEECCHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 9140973; 11677609 [H]