Definition | Yersinia pestis Nepal516, complete genome. |
---|---|
Accession | NC_008149 |
Length | 4,534,590 |
Click here to switch to the map view.
The map label for this gene is yfhM [C]
Identifier: 108811221
GI number: 108811221
Start: 1214070
End: 1220048
Strand: Reverse
Name: yfhM [C]
Synonym: YPN_1057
Alternate gene names: 108811221
Gene position: 1220048-1214070 (Counterclockwise)
Preceding gene: 108811222
Following gene: 108811219
Centisome position: 26.91
GC content: 50.01
Gene sequence:
>5979_bases ATGGATTTGTTGAGGTTTCTGCTGATCTCCCCCTTTGCCCTTATCAAGGGACTCTATCGGTTAAGCGCTTATCTTTTACG GTTAGTGGGCCGCCTATTGCGCCCAGTCGTCGGTAACCTGAATTGGCGCGCGCCACAATGGATGACGAAAACCGCCAACG GGCTACACTGTGCCTTCAACCGTTCAGAGCAATGGGTCGCTAAGCATCCCAAAGGGATCAGTGCGGCAATAGTGCTGTTG ATGGCTGCCGCCAGCGCCGCCTTCTATGGCTATCACTGGTATCTTAACCGGCCCCAACCGATTGAACCGGCACCTATGGT TTATCAGGAAACCAGTATCAGAGTCTCGGCGCCAAGAACGGTTAACTACCAGGCGCAAAAACCAGAAGCCCAGCCGCTTA GTCTTAATTTTATGCATTCAGCCGCACCTATCACCGCGATGGGTCAGGTCGTCGATCAGGGCATCTCATTAACCCCCGCG ATAGAGGGTGAATGGAAATGGGCAACTGAACGCACGTTGGTGTTTACTCCCAAAAAAGCCTGGCCGATGGGCGCTAACTA CCAAATTACGATCGATACAGAAAAACTTTTAGCACCACAGATTAAGCTCAATCAGACCGAACTTAATTTCACAACGCCAG CTTTTGCTTATCAATTGGAAAAAGCGGAATATTATCAGGATCCGCAAGAGGCCCAAAAGCGCAGCACTATTTTCCATGTG CAATTTAATGCCCCAGTTGATGTTGCCAGCTTTGAAAAACAGATCCTCTTGGGATTGGTCGAAGGTAAATCCAAGTCAGA GAAGAAACTTAATTTCTCCGTCGTTTATGATGAGAAAAAGCTTAATGCCTGGATACATTCGCAACCCTTGATGCCAATGG ATAAAGGCGGTTCGGTCCATCTATCGATTAATAAAGGGGTGAATGCCAGTGTCGCCGCCACGCCTACGACACAGGCACAG AATAAATGGGTATCCGTCCCTAACCTATATAGCCTGGCGGTTAATAGTATCAATGCCACGTTGGTCGAGTCAGATAACAA TAATGGTGAGCGGGCCTTAATTATTGCTATCAGCGACGCGGTTAAAGATAAAGAGATCAAAAATGCGGTCAAAGCCTGGT TACTGCCGCAACATAATTTTCAAGCGAAAGAGAGCGCCAAAACATCAACCGATTTCTATCCTTGGGATATGGATGATATT GACGATAATCTGCTGCAACAATCAACGCCGCTGGCGCTGACCCTCAATGAGGCCGAGCAAGAGTATCAGCCAATATTCAG CTTTAAGTTTGATGCCCCTTCCTATCGCACACTGCTGATCGAGGTTAACAATAGCCTGACATCGGTGGGCGGTTATAAAA TGCCGGAAAAAATCTACCAAATAGTCAGGGTTCCCGATTACCCTAAGACGCTGCGCTTTATGTCACAAGGCTCGTTATTA TCGATGCAGGGTGATAAGCAGATCAGCGTCGCCGCCCGTAATATGACTGGCATGAAACTGGATATTAAGCGGGTTATTCC TAGCCAGTTACAACATATTGTGTCATTTAAAAGCAGCGAATATTCATCAGCTCACTTTAACCGCCTGAGTGATGAATATT TTACTGAACACTTCCAGTACCAAACCGCGCTGAATAATGACAACCCCGGCGAGATCAATTATCAAGGGGTCGATCTGTCC CGTTATCTTGCAAATAATCCGAGTGCTCGGCGTGGGGTGTTCTTACTCACCCTGTCAGCTTGGGATCCGGAGAAAAGGGA TAATCAGCAACACAGCGAGGAAGACTACGACGAAGACCAGGAATGGGTCGGCGATTCACGCTTTGTGGTGATCACGGACT TAGGCATTATCACCAAGCAATCGCAGGATAGATCCCGTGATGTGTTTGTGCAATCCATTCACTCGGGTCTGCCCGCCGCC GATGCTAAAGTCTCTGTGGTGGCAAAAAATGGTGTGGTCTTACTGAGCCAAATCACCGATAGCAAAGGGCATGTCCATTT TCCTGCGCTGGACGCCTTTAAAAATGAACGCCAACCGGTCATGTTCCTGGTGGAAAAAGAAGGGGATGTCTCCTTCCTGC CCACCCGAGCCACCTATGACCGTAACCTTGATTTCTCACGTTTTGATATTGATGGCGAAGAGACCCCGTCCGACCCACGT ACTCTAAGCAGCTATCTGTTTTCTGACCGGGGAGTTTATCGCCCAGGCGACCGCTTCAATATTGGTCTGATCACCCGGAC CGCCAACTGGGCTACCGCACTCGATGGCGTCCCCCTGCGGGCGGAGATCCGTGACCCACGAGATACCTTGATGAGTACCC TGCCGATAACCTTGGACAGCAGTGGTTTCAATGAGCTCAGCTATACGACCGGTGAAAACTCACCTACCGGTGAATGGAAC GTCTATCTCTATCTGGTTGGTAAGAATAATGAAACGTCGATGTTGCTGGGGCACACCACCGTAAATGTTAAAGAGTTCGA GCCTGATCGCTTAAAAGTGCAACTGCAACTGACGCCAGAGCGTCAACAAGGCTGGGTTAAACCGCAGGAGCTGCAAGCCA ATATCAATGTACAAAATCTATTCGGTACACCAGCACAGGAGCGCCGTGTCACCTCTAGACTGATCTTGCGGCCAATGTAC CCGAGTTTTGCCCCGTTCCCTGATTACCTGTTCTATGAGAATCGCCATAACAGCGATGGTTTTGAGACCGAACTGGAAGA GCAAACGACCGATCTACAGGGGATGGCGACCATTCCATTGGATCTGAAATCCTATGCTGACGCCACCTATCAACTGCAAT TGCTGTCGGAAGCCTTTGAAGCGGGTGGAGGCCGCTCTGTGGCCGCGACTGCGCGGGTTCTGGTCTCACCTTACGACTCT CTGGTTGGGGTGAAAGCCGATGGCGATCTGAGTTATATCAACCGTGATGCCGTGCGTAAGCTGAATATTATTGCCGTTGA CCCGAGCCTGAATAAAATTGCGCTGCCAGACTTGAGTCTGTCATTGATTGAGCAGAAGTATATTTCAGTGCTAACCAAAC AGGATTCAGGCGTTTATAAATATCAATCACGGCTAAAGGAGCAGTTGGTCTCAGAGCAACCGCTACAAATCAGCCCGACA GGGACGGATTTCACCCTGGTGACCCAGCAGCCTGGTGATTTTATTCTGGTGGTTAAGGACAGTCAGGGGCAGGTTCTGAA CCGTATTAGTTATACGGTGGCGGGTAACGCAAACCTGACCCGCTCACTGGATCGCAACACCGAATTAAAGCTAAAACTGA ATCAGGCCGAATATCTGCAAGGCGAAGAAATTGAGATTGCGATTAATGCACCTTATGCCGGTAGCGGTCTGATCACGATA GAAAAAGATAAAGTGTATAGCTGGCAGTGGTTCCACAGTGATACCACCAGCTCTGTGCAGAGAATCCGCATCCCACCGGC AATGGAAGGCAATGGCTATATCAACGTACAATTCGTGCGTGATGTGAATTCCGATGAGATCTTTATGAGCCCACTGAGTT ACGGTGTGATGCCATTTAAGATCAGTACCAAAGCGCGTCAGGCGGCTATCGAGTTAGCGTCGCCGTCAGTCATTAAACCG GGTGAAGTGTTACCGATTAAAGTGACCACCGATTCACCACAGCGCGTGGTGGTGTTTGCCGTCGATGAAGGTATTTTGCA GGTGGCACGCTATCGCCTGAAAGATCCACTGGATTACTTCTTCCGTAAACGTGAACTGAGTGTACAGAGTGCACAAATTC TCGATTTGATCCTGCCGGAATTCAGCAAGCTGATGGCACTGACCTCCGCACCTGGAGGCGACGCCGGGGAAGGGCTGGAT CTGCACCTCAATCCGTTTAAACGCAAACAAGACAAGCCGGTGGCTTATTGGTCTGGTATCACCGAAGTGAATGGTGAAAC CACCTTCAATTACCCGATTCCCGACTATTTCAATGGTAAAATTCGCGTGATGGCCATCTCTGCGACCCCTGATCGCATTG GTAAAGTCCAGACCTCGACCACCGTGCGGGATAACTTTATTCTGACGCCGAATGTCCCCGCGATGGTAGCACCGGGAGAT GAATTTGATGTCACCGTGGGTGTGAGTAACAACCTGCAAGGATTGAAGGGTAAAGCGGTTGATATCACCGTGCGTCTGAC ACCACCGCCACAACTGGAAGTGGTGGGTGAAGCGCAACACAGCCTGTCGCTGGCAGAAAAACGTGAAACGCTTGTCAGCT TCCGCCTACGCGCCCGTTCAGCATTGGGTGATGCTCCACTGGTGTTTGATGCCAGCTATGGCTCTCAATCCAGCCGCCGG ACGGTCAGTACCTCGGTACGCCCGGCGATGCCATTCCGAACGCAATCGGTGATGGGCCGGATGGAGGGTAACAAGCATAC TGTGACCAATCTGCGCCAGATGTTTGATAATTATGCTCAACGTCAGGCGACCGCTTCCCACTCACCGTTGGTCTTAACCC AAGGTCTGGCGCGGTACCTGGCTGATTACCCGTACTACAGTTCTGAGCAAATTGTCAGCCGCTCGATTCCGTTGATTATG CAAAGCAAACATCCTGAAATGGACAGTGCCCTCAATCAGAATGAGGTCCGTGATCAACTGAAAAACATGCTACGTATCCT GAGCTCTCGGCAGAATAGCACTGGTGCAATCGGTTTGTGGCACGCCTCCCCTACCCCTGATCCGTTTGTCACACCTTATG TCGTGCAATTTCTGCTGGAAGCGAAATCTGCCGGTTACAGCTTGCCGAATGACATCTTGGAGGGGGCCAACAACGCACTG CGTCTGTTAGCGGCTCGACCTTATGATGACCTTTACTCTCTGCGTTTGCGGGCCTTTGCTGTTTACCTGTTGACCTTGCA GGGGGAGATCACCACCAATACTCTGGCATCGGTGCAAAGTACGTTACAGCAACTTTATCCTGACAGTTGGCAGACTGATC TGAGTGCCATTTATCTGGCCTCATCATACCGTCTGCTCAAAATGGATGACGAAGCCAATAAACTGCTGCAACCCACCTGG AAACAACTGGGTAAAGCCTACAGCAAGGCCTGGTGGACGCAGAATTATTTTGATCCACTGGTGCAAGATGCAACCCGGTT GTATCTGATCACTCGCCATTTCCCAGAGAAAGTCTCTTCTATTCCGCCACAAGCACTGGAAAATATGGTGCTGGCACTGA GGGATGAGCATTACACGACCTATTCATCCGCGATGAGCATTCTGGCACTGGAAAGTTACACCAGCCAGGTAGCCGCCCAG CAAGATACGCCAGAAACCCTGCAAATCATCGAGATCAGTAAAAGCAAAGGGATCGACCCTAACGTTATCTCAACGCTGAA CGGCCTGTTCGTTCAAGGTGATTTTACCGGTGAGGCTAAAGCGATTCAGTTTAACAACTATGCCTCGGCACCCGCTTGGT ATGTGGTCAATCAATCAGGCTATGACCTTCAGCCACCAAAAGACGCCATCTCTAATGGGCTGGAAATCAGCCGCAGCTAC ACCGATGAGCAGGGTAAGCCGGTGACCCAAGTCACCTTAGGGCAGAAAGTTAACGTGCACCTAAAAATCCGGGCTAACGC TAAACAAGGTCAAAATAATCTGGCGATTGTCGATCTACTGCCGGGCGGTTTTGAAGTGGTACAACAAACGGCACCTGAAC CAGAGTTTTATGATAATCAGGATGATCAGGATGAGGAAACTGGCAGCGGCTGGCAGTCGCCGCTAATGGTATCTGGCTCC AGTTGGTACCCTGACTACAGTGATATTCGTGAAGATCGCGTGATCATTTATGGCAGTGCCAGTACCGACGTTAAAGAGTT TATCTACCAAATCAAATCAACCAATACGGGTCGCTTTGTGGTGCCACCGGCTTACGGCGAAGCCATGTATGATCGTAATG TACAGGCGCTGTCGGTCGGTAAAGGGCATATCCTTGTCGTTCCACCTGAGGCAAAATAG
Upstream 100 bases:
>100_bases AAAATATAACCTGAATGGGGAAACTGTGACTTATCCGTCCGCGGACAGTAACTAACCTCTTGGGTTGTTGTGCTGTCTGG TGTTTAAAGGATTATTATCT
Downstream 100 bases:
>100_bases CTGAAAAATAAATATCCTCCGGCATAGCCGGAGGTTTTTCATATGCGCCTATAAGGCTCTGTTACCAGCCGCGCCCTAAC AGGCGCATCGCGATCTGACA
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 1992; Mature: 1992
Protein sequence:
>1992_residues MDLLRFLLISPFALIKGLYRLSAYLLRLVGRLLRPVVGNLNWRAPQWMTKTANGLHCAFNRSEQWVAKHPKGISAAIVLL MAAASAAFYGYHWYLNRPQPIEPAPMVYQETSIRVSAPRTVNYQAQKPEAQPLSLNFMHSAAPITAMGQVVDQGISLTPA IEGEWKWATERTLVFTPKKAWPMGANYQITIDTEKLLAPQIKLNQTELNFTTPAFAYQLEKAEYYQDPQEAQKRSTIFHV QFNAPVDVASFEKQILLGLVEGKSKSEKKLNFSVVYDEKKLNAWIHSQPLMPMDKGGSVHLSINKGVNASVAATPTTQAQ NKWVSVPNLYSLAVNSINATLVESDNNNGERALIIAISDAVKDKEIKNAVKAWLLPQHNFQAKESAKTSTDFYPWDMDDI DDNLLQQSTPLALTLNEAEQEYQPIFSFKFDAPSYRTLLIEVNNSLTSVGGYKMPEKIYQIVRVPDYPKTLRFMSQGSLL SMQGDKQISVAARNMTGMKLDIKRVIPSQLQHIVSFKSSEYSSAHFNRLSDEYFTEHFQYQTALNNDNPGEINYQGVDLS RYLANNPSARRGVFLLTLSAWDPEKRDNQQHSEEDYDEDQEWVGDSRFVVITDLGIITKQSQDRSRDVFVQSIHSGLPAA DAKVSVVAKNGVVLLSQITDSKGHVHFPALDAFKNERQPVMFLVEKEGDVSFLPTRATYDRNLDFSRFDIDGEETPSDPR TLSSYLFSDRGVYRPGDRFNIGLITRTANWATALDGVPLRAEIRDPRDTLMSTLPITLDSSGFNELSYTTGENSPTGEWN VYLYLVGKNNETSMLLGHTTVNVKEFEPDRLKVQLQLTPERQQGWVKPQELQANINVQNLFGTPAQERRVTSRLILRPMY PSFAPFPDYLFYENRHNSDGFETELEEQTTDLQGMATIPLDLKSYADATYQLQLLSEAFEAGGGRSVAATARVLVSPYDS LVGVKADGDLSYINRDAVRKLNIIAVDPSLNKIALPDLSLSLIEQKYISVLTKQDSGVYKYQSRLKEQLVSEQPLQISPT GTDFTLVTQQPGDFILVVKDSQGQVLNRISYTVAGNANLTRSLDRNTELKLKLNQAEYLQGEEIEIAINAPYAGSGLITI EKDKVYSWQWFHSDTTSSVQRIRIPPAMEGNGYINVQFVRDVNSDEIFMSPLSYGVMPFKISTKARQAAIELASPSVIKP GEVLPIKVTTDSPQRVVVFAVDEGILQVARYRLKDPLDYFFRKRELSVQSAQILDLILPEFSKLMALTSAPGGDAGEGLD LHLNPFKRKQDKPVAYWSGITEVNGETTFNYPIPDYFNGKIRVMAISATPDRIGKVQTSTTVRDNFILTPNVPAMVAPGD EFDVTVGVSNNLQGLKGKAVDITVRLTPPPQLEVVGEAQHSLSLAEKRETLVSFRLRARSALGDAPLVFDASYGSQSSRR TVSTSVRPAMPFRTQSVMGRMEGNKHTVTNLRQMFDNYAQRQATASHSPLVLTQGLARYLADYPYYSSEQIVSRSIPLIM QSKHPEMDSALNQNEVRDQLKNMLRILSSRQNSTGAIGLWHASPTPDPFVTPYVVQFLLEAKSAGYSLPNDILEGANNAL RLLAARPYDDLYSLRLRAFAVYLLTLQGEITTNTLASVQSTLQQLYPDSWQTDLSAIYLASSYRLLKMDDEANKLLQPTW KQLGKAYSKAWWTQNYFDPLVQDATRLYLITRHFPEKVSSIPPQALENMVLALRDEHYTTYSSAMSILALESYTSQVAAQ QDTPETLQIIEISKSKGIDPNVISTLNGLFVQGDFTGEAKAIQFNNYASAPAWYVVNQSGYDLQPPKDAISNGLEISRSY TDEQGKPVTQVTLGQKVNVHLKIRANAKQGQNNLAIVDLLPGGFEVVQQTAPEPEFYDNQDDQDEETGSGWQSPLMVSGS SWYPDYSDIREDRVIIYGSASTDVKEFIYQIKSTNTGRFVVPPAYGEAMYDRNVQALSVGKGHILVVPPEAK
Sequences:
>Translated_1992_residues MDLLRFLLISPFALIKGLYRLSAYLLRLVGRLLRPVVGNLNWRAPQWMTKTANGLHCAFNRSEQWVAKHPKGISAAIVLL MAAASAAFYGYHWYLNRPQPIEPAPMVYQETSIRVSAPRTVNYQAQKPEAQPLSLNFMHSAAPITAMGQVVDQGISLTPA IEGEWKWATERTLVFTPKKAWPMGANYQITIDTEKLLAPQIKLNQTELNFTTPAFAYQLEKAEYYQDPQEAQKRSTIFHV QFNAPVDVASFEKQILLGLVEGKSKSEKKLNFSVVYDEKKLNAWIHSQPLMPMDKGGSVHLSINKGVNASVAATPTTQAQ NKWVSVPNLYSLAVNSINATLVESDNNNGERALIIAISDAVKDKEIKNAVKAWLLPQHNFQAKESAKTSTDFYPWDMDDI DDNLLQQSTPLALTLNEAEQEYQPIFSFKFDAPSYRTLLIEVNNSLTSVGGYKMPEKIYQIVRVPDYPKTLRFMSQGSLL SMQGDKQISVAARNMTGMKLDIKRVIPSQLQHIVSFKSSEYSSAHFNRLSDEYFTEHFQYQTALNNDNPGEINYQGVDLS RYLANNPSARRGVFLLTLSAWDPEKRDNQQHSEEDYDEDQEWVGDSRFVVITDLGIITKQSQDRSRDVFVQSIHSGLPAA DAKVSVVAKNGVVLLSQITDSKGHVHFPALDAFKNERQPVMFLVEKEGDVSFLPTRATYDRNLDFSRFDIDGEETPSDPR TLSSYLFSDRGVYRPGDRFNIGLITRTANWATALDGVPLRAEIRDPRDTLMSTLPITLDSSGFNELSYTTGENSPTGEWN VYLYLVGKNNETSMLLGHTTVNVKEFEPDRLKVQLQLTPERQQGWVKPQELQANINVQNLFGTPAQERRVTSRLILRPMY PSFAPFPDYLFYENRHNSDGFETELEEQTTDLQGMATIPLDLKSYADATYQLQLLSEAFEAGGGRSVAATARVLVSPYDS LVGVKADGDLSYINRDAVRKLNIIAVDPSLNKIALPDLSLSLIEQKYISVLTKQDSGVYKYQSRLKEQLVSEQPLQISPT GTDFTLVTQQPGDFILVVKDSQGQVLNRISYTVAGNANLTRSLDRNTELKLKLNQAEYLQGEEIEIAINAPYAGSGLITI EKDKVYSWQWFHSDTTSSVQRIRIPPAMEGNGYINVQFVRDVNSDEIFMSPLSYGVMPFKISTKARQAAIELASPSVIKP GEVLPIKVTTDSPQRVVVFAVDEGILQVARYRLKDPLDYFFRKRELSVQSAQILDLILPEFSKLMALTSAPGGDAGEGLD LHLNPFKRKQDKPVAYWSGITEVNGETTFNYPIPDYFNGKIRVMAISATPDRIGKVQTSTTVRDNFILTPNVPAMVAPGD EFDVTVGVSNNLQGLKGKAVDITVRLTPPPQLEVVGEAQHSLSLAEKRETLVSFRLRARSALGDAPLVFDASYGSQSSRR TVSTSVRPAMPFRTQSVMGRMEGNKHTVTNLRQMFDNYAQRQATASHSPLVLTQGLARYLADYPYYSSEQIVSRSIPLIM QSKHPEMDSALNQNEVRDQLKNMLRILSSRQNSTGAIGLWHASPTPDPFVTPYVVQFLLEAKSAGYSLPNDILEGANNAL RLLAARPYDDLYSLRLRAFAVYLLTLQGEITTNTLASVQSTLQQLYPDSWQTDLSAIYLASSYRLLKMDDEANKLLQPTW KQLGKAYSKAWWTQNYFDPLVQDATRLYLITRHFPEKVSSIPPQALENMVLALRDEHYTTYSSAMSILALESYTSQVAAQ QDTPETLQIIEISKSKGIDPNVISTLNGLFVQGDFTGEAKAIQFNNYASAPAWYVVNQSGYDLQPPKDAISNGLEISRSY TDEQGKPVTQVTLGQKVNVHLKIRANAKQGQNNLAIVDLLPGGFEVVQQTAPEPEFYDNQDDQDEETGSGWQSPLMVSGS SWYPDYSDIREDRVIIYGSASTDVKEFIYQIKSTNTGRFVVPPAYGEAMYDRNVQALSVGKGHILVVPPEAK >Mature_1992_residues MDLLRFLLISPFALIKGLYRLSAYLLRLVGRLLRPVVGNLNWRAPQWMTKTANGLHCAFNRSEQWVAKHPKGISAAIVLL MAAASAAFYGYHWYLNRPQPIEPAPMVYQETSIRVSAPRTVNYQAQKPEAQPLSLNFMHSAAPITAMGQVVDQGISLTPA IEGEWKWATERTLVFTPKKAWPMGANYQITIDTEKLLAPQIKLNQTELNFTTPAFAYQLEKAEYYQDPQEAQKRSTIFHV QFNAPVDVASFEKQILLGLVEGKSKSEKKLNFSVVYDEKKLNAWIHSQPLMPMDKGGSVHLSINKGVNASVAATPTTQAQ NKWVSVPNLYSLAVNSINATLVESDNNNGERALIIAISDAVKDKEIKNAVKAWLLPQHNFQAKESAKTSTDFYPWDMDDI DDNLLQQSTPLALTLNEAEQEYQPIFSFKFDAPSYRTLLIEVNNSLTSVGGYKMPEKIYQIVRVPDYPKTLRFMSQGSLL SMQGDKQISVAARNMTGMKLDIKRVIPSQLQHIVSFKSSEYSSAHFNRLSDEYFTEHFQYQTALNNDNPGEINYQGVDLS RYLANNPSARRGVFLLTLSAWDPEKRDNQQHSEEDYDEDQEWVGDSRFVVITDLGIITKQSQDRSRDVFVQSIHSGLPAA DAKVSVVAKNGVVLLSQITDSKGHVHFPALDAFKNERQPVMFLVEKEGDVSFLPTRATYDRNLDFSRFDIDGEETPSDPR TLSSYLFSDRGVYRPGDRFNIGLITRTANWATALDGVPLRAEIRDPRDTLMSTLPITLDSSGFNELSYTTGENSPTGEWN VYLYLVGKNNETSMLLGHTTVNVKEFEPDRLKVQLQLTPERQQGWVKPQELQANINVQNLFGTPAQERRVTSRLILRPMY PSFAPFPDYLFYENRHNSDGFETELEEQTTDLQGMATIPLDLKSYADATYQLQLLSEAFEAGGGRSVAATARVLVSPYDS LVGVKADGDLSYINRDAVRKLNIIAVDPSLNKIALPDLSLSLIEQKYISVLTKQDSGVYKYQSRLKEQLVSEQPLQISPT GTDFTLVTQQPGDFILVVKDSQGQVLNRISYTVAGNANLTRSLDRNTELKLKLNQAEYLQGEEIEIAINAPYAGSGLITI EKDKVYSWQWFHSDTTSSVQRIRIPPAMEGNGYINVQFVRDVNSDEIFMSPLSYGVMPFKISTKARQAAIELASPSVIKP GEVLPIKVTTDSPQRVVVFAVDEGILQVARYRLKDPLDYFFRKRELSVQSAQILDLILPEFSKLMALTSAPGGDAGEGLD LHLNPFKRKQDKPVAYWSGITEVNGETTFNYPIPDYFNGKIRVMAISATPDRIGKVQTSTTVRDNFILTPNVPAMVAPGD EFDVTVGVSNNLQGLKGKAVDITVRLTPPPQLEVVGEAQHSLSLAEKRETLVSFRLRARSALGDAPLVFDASYGSQSSRR TVSTSVRPAMPFRTQSVMGRMEGNKHTVTNLRQMFDNYAQRQATASHSPLVLTQGLARYLADYPYYSSEQIVSRSIPLIM QSKHPEMDSALNQNEVRDQLKNMLRILSSRQNSTGAIGLWHASPTPDPFVTPYVVQFLLEAKSAGYSLPNDILEGANNAL RLLAARPYDDLYSLRLRAFAVYLLTLQGEITTNTLASVQSTLQQLYPDSWQTDLSAIYLASSYRLLKMDDEANKLLQPTW KQLGKAYSKAWWTQNYFDPLVQDATRLYLITRHFPEKVSSIPPQALENMVLALRDEHYTTYSSAMSILALESYTSQVAAQ QDTPETLQIIEISKSKGIDPNVISTLNGLFVQGDFTGEAKAIQFNNYASAPAWYVVNQSGYDLQPPKDAISNGLEISRSY TDEQGKPVTQVTLGQKVNVHLKIRANAKQGQNNLAIVDLLPGGFEVVQQTAPEPEFYDNQDDQDEETGSGWQSPLMVSGS SWYPDYSDIREDRVIIYGSASTDVKEFIYQIKSTNTGRFVVPPAYGEAMYDRNVQALSVGKGHILVVPPEAK
Specific function: Unknown
COG id: COG2373
COG function: function code R; Large extracellular alpha-helical protein
Gene ontology:
Cell location: Attached to the membrane by a lipid anchor (Potential) [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the UPF0192 family
Homologues:
Organism=Escherichia coli, GI1788868, Length=1256, Percent_Identity=22.8503184713376, Blast_Score=215, Evalue=3e-56,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y2573_YERPE (Q8ZDJ2)
Other databases:
- EMBL: AL590842 - EMBL: AE009952 - EMBL: AE017042 - PIR: AC0314 - RefSeq: NP_668470.1 - RefSeq: NP_992509.1 - RefSeq: YP_002347532.1 - ProteinModelPortal: Q8ZDJ2 - IntAct: Q8ZDJ2 - GeneID: 1146090 - GeneID: 1175404 - GeneID: 2767042 - GenomeReviews: AE009952_GR - GenomeReviews: AE017042_GR - GenomeReviews: AL590842_GR - KEGG: ype:YPO2573 - KEGG: ypk:y1143 - KEGG: ypm:YP_1141 - HOGENOM: HBG368529 - OMA: VYAMHFL - ProtClustDB: CLSK870189 - BioCyc: YPES187410:Y1143-MONOMER - BioCyc: YPES214092:YPO2573-MONOMER - InterPro: IPR002890 - InterPro: IPR011625 - InterPro: IPR021868 - InterPro: IPR001599 - InterPro: IPR008930
Pfam domain/function: PF00207 A2M; PF01835 A2M_N; PF07703 A2M_N_2; PF11974 MG1; SSF48239 Terp_cyc_toroid
EC number: NA
Molecular weight: Translated: 222824; Mature: 222824
Theoretical pI: Translated: 5.79; Mature: 5.79
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.1 %Cys (Translated Protein) 1.9 %Met (Translated Protein) 2.0 %Cys+Met (Translated Protein) 0.1 %Cys (Mature Protein) 1.9 %Met (Mature Protein) 2.0 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MDLLRFLLISPFALIKGLYRLSAYLLRLVGRLLRPVVGNLNWRAPQWMTKTANGLHCAFN CHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCEEEEEC RSEQWVAKHPKGISAAIVLLMAAASAAFYGYHWYLNRPQPIEPAPMVYQETSIRVSAPRT CCCHHHHCCCCCHHHHHHHHHHHHHHHEEEEEEECCCCCCCCCCCEEEEECEEEEECCCE VNYQAQKPEAQPLSLNFMHSAAPITAMGQVVDQGISLTPAIEGEWKWATERTLVFTPKKA ECCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCEECCCCCCCEEEECCEEEEECCCCC WPMGANYQITIDTEKLLAPQIKLNQTELNFTTPAFAYQLEKAEYYQDPQEAQKRSTIFHV CCCCCCEEEEEEHHHHCCCCEEECCEEEEECCCHHHEEEHHHHHCCCHHHHHHCCEEEEE QFNAPVDVASFEKQILLGLVEGKSKSEKKLNFSVVYDEKKLNAWIHSQPLMPMDKGGSVH EECCCCCHHHHHHHHHEEEECCCCCCCCEEEEEEEEECHHHHHEECCCCCCCCCCCCEEE LSINKGVNASVAATPTTQAQNKWVSVPNLYSLAVNSINATLVESDNNNGERALIIAISDA EEECCCCCCEEEECCCCCCCCCEECCCCHHHHHHCCCEEEEEECCCCCCCEEEEEEECCC VKDKEIKNAVKAWLLPQHNFQAKESAKTSTDFYPWDMDDIDDNLLQQSTPLALTLNEAEQ CCHHHHHHHHHHHCCCCCCCCCHHCCCCCCCCCCCCCCCCCHHHHHCCCCEEEEECCHHH EYQPIFSFKFDAPSYRTLLIEVNNSLTSVGGYKMPEKIYQIVRVPDYPKTLRFMSQGSLL HCCCEEEEEECCCCEEEEEEEECCCEECCCCCCCHHHHHHHHCCCCCHHHHHHHCCCCEE SMQGDKQISVAARNMTGMKLDIKRVIPSQLQHIVSFKSSEYSSAHFNRLSDEYFTEHFQY EECCCCEEEEEECCCCCCEEEHHHHHHHHHHHHHHHCCCCCCCHHHHHCCHHHHHHHHEE QTALNNDNPGEINYQGVDLSRYLANNPSARRGVFLLTLSAWDPEKRDNQQHSEEDYDEDQ EEECCCCCCCEEEEECCCHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCHH EWVGDSRFVVITDLGIITKQSQDRSRDVFVQSIHSGLPAADAKVSVVAKNGVVLLSQITD HHCCCCCEEEEEECCEEECCCCCCHHHHHHHHHHCCCCCCCCEEEEEECCCEEEEEEECC SKGHVHFPALDAFKNERQPVMFLVEKEGDVSFLPTRATYDRNLDFSRFDIDGEETPSDPR CCCEEECCCHHHHCCCCCCEEEEEECCCCEEEECCCCCCCCCCCCEEEECCCCCCCCCHH TLSSYLFSDRGVYRPGDRFNIGLITRTANWATALDGVPLRAEIRDPRDTLMSTLPITLDS HHHHHHHCCCCCCCCCCCEEEEEEEECCCHHHHHCCCCEEEECCCHHHHHHHHCCEEECC SGFNELSYTTGENSPTGEWNVYLYLVGKNNETSMLLGHTTVNVKEFEPDRLKVQLQLTPE CCCCCCEEECCCCCCCCCEEEEEEEEECCCCCEEEEEEEEEEEEECCCCEEEEEEEECCH RQQGWVKPQELQANINVQNLFGTPAQERRVTSRLILRPMYPSFAPFPDYLFYENRHNSDG HHHCCCCHHHHEECCEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCCC FETELEEQTTDLQGMATIPLDLKSYADATYQLQLLSEAFEAGGGRSVAATARVLVSPYDS CCCHHHHHHCCCCCEEECCCCHHHHCCCCEEHHHHHHHHHCCCCCEEHHHHHEEECCCHH LVGVKADGDLSYINRDAVRKLNIIAVDPSLNKIALPDLSLSLIEQKYISVLTKQDSGVYK HEEEECCCCHHHCCHHHHCEEEEEEECCCCCEEECCCCHHHHHHHHHHHHHCCCCCCHHH YQSRLKEQLVSEQPLQISPTGTDFTLVTQQPGDFILVVKDSQGQVLNRISYTVAGNANLT HHHHHHHHHHCCCCEEECCCCCCEEEEEECCCCEEEEEECCCCCEEHHEEEEEECCCCEE RSLDRNTELKLKLNQAEYLQGEEIEIAINAPYAGSGLITIEKDKVYSWQWFHSDTTSSVQ ECCCCCCEEEEEECHHHCCCCCEEEEEEECCCCCCEEEEEECCCEEEEEEECCCCCCCCE RIRIPPAMEGNGYINVQFVRDVNSDEIFMSPLSYGVMPFKISTKARQAAIELASPSVIKP EEECCCCCCCCCEEEEEEEEECCCCCEEECHHHCCCEEEEECCHHHHHHHHHCCCCCCCC GEVLPIKVTTDSPQRVVVFAVDEGILQVARYRLKDPLDYFFRKRELSVQSAQILDLILPE CCEEEEEEECCCCCEEEEEEECHHHHHHHHHHHCCCHHHHHHHHHCCHHHHHHHHHHHHH FSKLMALTSAPGGDAGEGLDLHLNPFKRKQDKPVAYWSGITEVNGETTFNYPIPDYFNGK HHHHHEEECCCCCCCCCCEEEEECCCCCCCCCCEEEECCEEECCCCEEECCCCCCCCCCE IRVMAISATPDRIGKVQTSTTVRDNFILTPNVPAMVAPGDEFDVTVGVSNNLQGLKGKAV EEEEEEECCHHHHCCEEECCEECCCEEECCCCCEEEECCCCEEEEEECCCCCCCCCCCEE DITVRLTPPPQLEVVGEAQHSLSLAEKRETLVSFRLRARSALGDAPLVFDASYGSQSSRR EEEEEECCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCC TVSTSVRPAMPFRTQSVMGRMEGNKHTVTNLRQMFDNYAQRQATASHSPLVLTQGLARYL EEECCCCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH ADYPYYSSEQIVSRSIPLIMQSKHPEMDSALNQNEVRDQLKNMLRILSSRQNSTGAIGLW HHCCCCCCHHHHHCCCCEEEECCCCCHHHHCCHHHHHHHHHHHHHHHHCCCCCCCEEEEE HASPTPDPFVTPYVVQFLLEAKSAGYSLPNDILEGANNALRLLAARPYDDLYSLRLRAFA ECCCCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHCCCCEEEEEEECCCHHHHHHHHHHHE VYLLTLQGEITTNTLASVQSTLQQLYPDSWQTDLSAIYLASSYRLLKMDDEANKLLQPTW EEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCEEEEEECCHHHHHHHHHH KQLGKAYSKAWWTQNYFDPLVQDATRLYLITRHFPEKVSSIPPQALENMVLALRDEHYTT HHHHHHHHHHHCCCHHHHHHHHCCCEEEEEEHHCHHHHHCCCHHHHHHHHHHHHCCCCEE YSSAMSILALESYTSQVAAQQDTPETLQIIEISKSKGIDPNVISTLNGLFVQGDFTGEAK HHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCHHHHHCCCEEEEECCCCCEE AIQFNNYASAPAWYVVNQSGYDLQPPKDAISNGLEISRSYTDEQGKPVTQVTLGQKVNVH EEEECCCCCCCEEEEECCCCCCCCCCHHHHCCCCEEEECCCCCCCCCEEEEECCCEEEEE LKIRANAKQGQNNLAIVDLLPGGFEVVQQTAPEPEFYDNQDDQDEETGSGWQSPLMVSGS EEEEECCCCCCCCEEEEEECCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCEEEECC SWYPDYSDIREDRVIIYGSASTDVKEFIYQIKSTNTGRFVVPPAYGEAMYDRNVQALSVG CCCCCHHHHCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEECCCCCCHHHCCCCEEEEEC KGHILVVPPEAK CCEEEEECCCCC >Mature Secondary Structure MDLLRFLLISPFALIKGLYRLSAYLLRLVGRLLRPVVGNLNWRAPQWMTKTANGLHCAFN CHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCEEEEEC RSEQWVAKHPKGISAAIVLLMAAASAAFYGYHWYLNRPQPIEPAPMVYQETSIRVSAPRT CCCHHHHCCCCCHHHHHHHHHHHHHHHEEEEEEECCCCCCCCCCCEEEEECEEEEECCCE VNYQAQKPEAQPLSLNFMHSAAPITAMGQVVDQGISLTPAIEGEWKWATERTLVFTPKKA ECCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCEECCCCCCCEEEECCEEEEECCCCC WPMGANYQITIDTEKLLAPQIKLNQTELNFTTPAFAYQLEKAEYYQDPQEAQKRSTIFHV CCCCCCEEEEEEHHHHCCCCEEECCEEEEECCCHHHEEEHHHHHCCCHHHHHHCCEEEEE QFNAPVDVASFEKQILLGLVEGKSKSEKKLNFSVVYDEKKLNAWIHSQPLMPMDKGGSVH EECCCCCHHHHHHHHHEEEECCCCCCCCEEEEEEEEECHHHHHEECCCCCCCCCCCCEEE LSINKGVNASVAATPTTQAQNKWVSVPNLYSLAVNSINATLVESDNNNGERALIIAISDA EEECCCCCCEEEECCCCCCCCCEECCCCHHHHHHCCCEEEEEECCCCCCCEEEEEEECCC VKDKEIKNAVKAWLLPQHNFQAKESAKTSTDFYPWDMDDIDDNLLQQSTPLALTLNEAEQ CCHHHHHHHHHHHCCCCCCCCCHHCCCCCCCCCCCCCCCCCHHHHHCCCCEEEEECCHHH EYQPIFSFKFDAPSYRTLLIEVNNSLTSVGGYKMPEKIYQIVRVPDYPKTLRFMSQGSLL HCCCEEEEEECCCCEEEEEEEECCCEECCCCCCCHHHHHHHHCCCCCHHHHHHHCCCCEE SMQGDKQISVAARNMTGMKLDIKRVIPSQLQHIVSFKSSEYSSAHFNRLSDEYFTEHFQY EECCCCEEEEEECCCCCCEEEHHHHHHHHHHHHHHHCCCCCCCHHHHHCCHHHHHHHHEE QTALNNDNPGEINYQGVDLSRYLANNPSARRGVFLLTLSAWDPEKRDNQQHSEEDYDEDQ EEECCCCCCCEEEEECCCHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCHH EWVGDSRFVVITDLGIITKQSQDRSRDVFVQSIHSGLPAADAKVSVVAKNGVVLLSQITD HHCCCCCEEEEEECCEEECCCCCCHHHHHHHHHHCCCCCCCCEEEEEECCCEEEEEEECC SKGHVHFPALDAFKNERQPVMFLVEKEGDVSFLPTRATYDRNLDFSRFDIDGEETPSDPR CCCEEECCCHHHHCCCCCCEEEEEECCCCEEEECCCCCCCCCCCCEEEECCCCCCCCCHH TLSSYLFSDRGVYRPGDRFNIGLITRTANWATALDGVPLRAEIRDPRDTLMSTLPITLDS HHHHHHHCCCCCCCCCCCEEEEEEEECCCHHHHHCCCCEEEECCCHHHHHHHHCCEEECC SGFNELSYTTGENSPTGEWNVYLYLVGKNNETSMLLGHTTVNVKEFEPDRLKVQLQLTPE CCCCCCEEECCCCCCCCCEEEEEEEEECCCCCEEEEEEEEEEEEECCCCEEEEEEEECCH RQQGWVKPQELQANINVQNLFGTPAQERRVTSRLILRPMYPSFAPFPDYLFYENRHNSDG HHHCCCCHHHHEECCEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCCC FETELEEQTTDLQGMATIPLDLKSYADATYQLQLLSEAFEAGGGRSVAATARVLVSPYDS CCCHHHHHHCCCCCEEECCCCHHHHCCCCEEHHHHHHHHHCCCCCEEHHHHHEEECCCHH LVGVKADGDLSYINRDAVRKLNIIAVDPSLNKIALPDLSLSLIEQKYISVLTKQDSGVYK HEEEECCCCHHHCCHHHHCEEEEEEECCCCCEEECCCCHHHHHHHHHHHHHCCCCCCHHH YQSRLKEQLVSEQPLQISPTGTDFTLVTQQPGDFILVVKDSQGQVLNRISYTVAGNANLT HHHHHHHHHHCCCCEEECCCCCCEEEEEECCCCEEEEEECCCCCEEHHEEEEEECCCCEE RSLDRNTELKLKLNQAEYLQGEEIEIAINAPYAGSGLITIEKDKVYSWQWFHSDTTSSVQ ECCCCCCEEEEEECHHHCCCCCEEEEEEECCCCCCEEEEEECCCEEEEEEECCCCCCCCE RIRIPPAMEGNGYINVQFVRDVNSDEIFMSPLSYGVMPFKISTKARQAAIELASPSVIKP EEECCCCCCCCCEEEEEEEEECCCCCEEECHHHCCCEEEEECCHHHHHHHHHCCCCCCCC GEVLPIKVTTDSPQRVVVFAVDEGILQVARYRLKDPLDYFFRKRELSVQSAQILDLILPE CCEEEEEEECCCCCEEEEEEECHHHHHHHHHHHCCCHHHHHHHHHCCHHHHHHHHHHHHH FSKLMALTSAPGGDAGEGLDLHLNPFKRKQDKPVAYWSGITEVNGETTFNYPIPDYFNGK HHHHHEEECCCCCCCCCCEEEEECCCCCCCCCCEEEECCEEECCCCEEECCCCCCCCCCE IRVMAISATPDRIGKVQTSTTVRDNFILTPNVPAMVAPGDEFDVTVGVSNNLQGLKGKAV EEEEEEECCHHHHCCEEECCEECCCEEECCCCCEEEECCCCEEEEEECCCCCCCCCCCEE DITVRLTPPPQLEVVGEAQHSLSLAEKRETLVSFRLRARSALGDAPLVFDASYGSQSSRR EEEEEECCCCCEEEECCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCC TVSTSVRPAMPFRTQSVMGRMEGNKHTVTNLRQMFDNYAQRQATASHSPLVLTQGLARYL EEECCCCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH ADYPYYSSEQIVSRSIPLIMQSKHPEMDSALNQNEVRDQLKNMLRILSSRQNSTGAIGLW HHCCCCCCHHHHHCCCCEEEECCCCCHHHHCCHHHHHHHHHHHHHHHHCCCCCCCEEEEE HASPTPDPFVTPYVVQFLLEAKSAGYSLPNDILEGANNALRLLAARPYDDLYSLRLRAFA ECCCCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHCCCCEEEEEEECCCHHHHHHHHHHHE VYLLTLQGEITTNTLASVQSTLQQLYPDSWQTDLSAIYLASSYRLLKMDDEANKLLQPTW EEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCEEEEEECCHHHHHHHHHH KQLGKAYSKAWWTQNYFDPLVQDATRLYLITRHFPEKVSSIPPQALENMVLALRDEHYTT HHHHHHHHHHHCCCHHHHHHHHCCCEEEEEEHHCHHHHHCCCHHHHHHHHHHHHCCCCEE YSSAMSILALESYTSQVAAQQDTPETLQIIEISKSKGIDPNVISTLNGLFVQGDFTGEAK HHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCHHHHHCCCEEEEECCCCCEE AIQFNNYASAPAWYVVNQSGYDLQPPKDAISNGLEISRSYTDEQGKPVTQVTLGQKVNVH EEEECCCCCCCEEEEECCCCCCCCCCHHHHCCCCEEEECCCCCCCCCEEEEECCCEEEEE LKIRANAKQGQNNLAIVDLLPGGFEVVQQTAPEPEFYDNQDDQDEETGSGWQSPLMVSGS EEEEECCCCCCCCEEEEEECCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCEEEECC SWYPDYSDIREDRVIIYGSASTDVKEFIYQIKSTNTGRFVVPPAYGEAMYDRNVQALSVG CCCCCHHHHCCCEEEEEECCCCHHHHHHHHHHCCCCCEEEECCCCCCHHHCCCCEEEEEC KGHILVVPPEAK CCEEEEECCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 11586360; 12142430