| Definition | Vibrio cholerae M66-2 chromosome I, complete genome. |
|---|---|
| Accession | NC_012578 |
| Length | 2,892,523 |
Click here to switch to the map view.
The map label for this gene is yqfA [H]
Identifier: 227080277
GI number: 227080277
Start: 37847
End: 38494
Strand: Direct
Name: yqfA [H]
Synonym: VCM66_0040
Alternate gene names: 227080277
Gene position: 37847-38494 (Clockwise)
Preceding gene: 227080276
Following gene: 227080284
Centisome position: 1.31
GC content: 47.22
Gene sequence:
>648_bases GTGAGTATGTCGAACAGTTATGGCTTTAAAGAAGAAGTGGCCAATGCGATAAGTCATGGCGTTGGCCTTATCTTAGGGAT AGTTGGTTTAGTGCTGCTGTTGGTCAAAGCGGTGGATCAGCAAGCCGATGCATTGACTATTACCAGCATGAGCATTTATG GCGGCAGTATGATTGCGCTATTTTTGGCTTCTACGCTGTACCACGCTATCCCTTATCAGCGTGCAAAGCGTTGGCTAAAA ACCTTTGATCACTGTGCTATTTATTTACTGATTGCGGGCAGTTATACCCCATTTTTACTGGTCAGTTTGCGTACACCACT GGCGGTTGGTTTGATGATAGTGATCTGGTCGCTCGCGCTTATTGGCATTCTGATGAAAATTGCTTTTGTCTACCGCTTCA AAAAGCTCTCTTTGGTGACGTATCTGACCATGGGTTGGCTTTCGCTGATCGTGATTTACCAGCTTGCCATTCATCTTGAG GTGGGTGGGCTCACTCTGCTGGCGGCAGGTGGACTTATCTATTCACTCGGGGTGATTTTCTACGTCGCCAAACGGATCCC TTACAACCATGCCATTTGGCACGCTTTTGTGCTGGCGGGATGCGCATGCCATTTCTTAGCCATTTATCTGTATGTCGAGC CGATTTAG
Upstream 100 bases:
>100_bases CGTTTCTTGATCGCAGCTGCTGATCACGCGGTCTGAAATTTTAGCTAACACCTGTAAGCTCAATGTGCGATACTAAAGCC AAGTTTCAGACTCGGATTGA
Downstream 100 bases:
>100_bases CTTTGACGGATAAGATGGGTAAAAGAGGAAACGCGCATTTAGCGCGTTTCAATCTGTTTGATGGTAGCGTGTAGTGTGTA CTGACTCTGCTCTGCGAGGT
Product: putative hemolysin
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 215; Mature: 214
Protein sequence:
>215_residues MSMSNSYGFKEEVANAISHGVGLILGIVGLVLLLVKAVDQQADALTITSMSIYGGSMIALFLASTLYHAIPYQRAKRWLK TFDHCAIYLLIAGSYTPFLLVSLRTPLAVGLMIVIWSLALIGILMKIAFVYRFKKLSLVTYLTMGWLSLIVIYQLAIHLE VGGLTLLAAGGLIYSLGVIFYVAKRIPYNHAIWHAFVLAGCACHFLAIYLYVEPI
Sequences:
>Translated_215_residues MSMSNSYGFKEEVANAISHGVGLILGIVGLVLLLVKAVDQQADALTITSMSIYGGSMIALFLASTLYHAIPYQRAKRWLK TFDHCAIYLLIAGSYTPFLLVSLRTPLAVGLMIVIWSLALIGILMKIAFVYRFKKLSLVTYLTMGWLSLIVIYQLAIHLE VGGLTLLAAGGLIYSLGVIFYVAKRIPYNHAIWHAFVLAGCACHFLAIYLYVEPI >Mature_214_residues SMSNSYGFKEEVANAISHGVGLILGIVGLVLLLVKAVDQQADALTITSMSIYGGSMIALFLASTLYHAIPYQRAKRWLKT FDHCAIYLLIAGSYTPFLLVSLRTPLAVGLMIVIWSLALIGILMKIAFVYRFKKLSLVTYLTMGWLSLIVIYQLAIHLEV GGLTLLAAGGLIYSLGVIFYVAKRIPYNHAIWHAFVLAGCACHFLAIYLYVEPI
Specific function: Unknown
COG id: COG1272
COG function: function code R; Predicted membrane protein, hemolysin III homolog
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein [H]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the UPF0073 (Hly-III) family [H]
Homologues:
Organism=Homo sapiens, GI52630445, Length=214, Percent_Identity=35.981308411215, Blast_Score=109, Evalue=2e-24, Organism=Homo sapiens, GI154759275, Length=208, Percent_Identity=33.1730769230769, Blast_Score=105, Evalue=2e-23, Organism=Homo sapiens, GI154759277, Length=232, Percent_Identity=29.7413793103448, Blast_Score=92, Evalue=2e-19, Organism=Escherichia coli, GI1789266, Length=210, Percent_Identity=72.8571428571428, Blast_Score=312, Evalue=1e-86, Organism=Caenorhabditis elegans, GI71996356, Length=211, Percent_Identity=33.175355450237, Blast_Score=92, Evalue=2e-19, Organism=Drosophila melanogaster, GI24640324, Length=228, Percent_Identity=32.4561403508772, Blast_Score=94, Evalue=9e-20,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR004254 - InterPro: IPR005744 [H]
Pfam domain/function: PF03006 HlyIII [H]
EC number: NA
Molecular weight: Translated: 23664; Mature: 23533
Theoretical pI: Translated: 9.37; Mature: 9.37
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.4 %Cys (Translated Protein) 3.3 %Met (Translated Protein) 4.7 %Cys+Met (Translated Protein) 1.4 %Cys (Mature Protein) 2.8 %Met (Mature Protein) 4.2 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MSMSNSYGFKEEVANAISHGVGLILGIVGLVLLLVKAVDQQADALTITSMSIYGGSMIAL CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEHHHCHHHHHH FLASTLYHAIPYQRAKRWLKTFDHCAIYLLIAGSYTPFLLVSLRTPLAVGLMIVIWSLAL HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH IGILMKIAFVYRFKKLSLVTYLTMGWLSLIVIYQLAIHLEVGGLTLLAAGGLIYSLGVIF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCHHHHHHHHHHHHHHHHH YVAKRIPYNHAIWHAFVLAGCACHFLAIYLYVEPI HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure SMSNSYGFKEEVANAISHGVGLILGIVGLVLLLVKAVDQQADALTITSMSIYGGSMIAL CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEHHHCHHHHHH FLASTLYHAIPYQRAKRWLKTFDHCAIYLLIAGSYTPFLLVSLRTPLAVGLMIVIWSLAL HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH IGILMKIAFVYRFKKLSLVTYLTMGWLSLIVIYQLAIHLEVGGLTLLAAGGLIYSLGVIF HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCHHHHHHHHHHHHHHHHH YVAKRIPYNHAIWHAFVLAGCACHFLAIYLYVEPI HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 7.0
TargetDB status: NA
Availability: NA
References: 11206551; 11258796 [H]