| Definition | Yersinia pestis Nepal516, complete genome. |
|---|---|
| Accession | NC_008149 |
| Length | 4,534,590 |
Click here to switch to the map view.
The map label for this gene is 108812243
Identifier: 108812243
GI number: 108812243
Start: 2348272
End: 2350305
Strand: Reverse
Name: 108812243
Synonym: YPN_2081
Alternate gene names: NA
Gene position: 2350305-2348272 (Counterclockwise)
Preceding gene: 108812244
Following gene: 108812242
Centisome position: 51.83
GC content: 51.72
Gene sequence:
>2034_bases GTGGGCCGCGACCTGTTGCTGCAAAGCCAGCAGGACAGCGATAACTATGATGCGAAGCAGCAAAGTAGCAGTGTTGGCGG CAGTTTCAGCCCTGGCTCCATGACGGGCAGTATCAGTATCAATGGCAGCCAGGACAAGCTGAACAGCAACTTTGACTCGG TGCAGGAGCAGACGGGTATCTTTGCCGGTTCGGGCGGCTTTGATATCACGGTAGGTGGACACACTCAGCTTGACGGTGCG GTGATTGGCAGCACCGTGACAGCCGATAAAAACACGCTGGATACCGGGACACTGGGCTTCAGTGATATCGATAATCAAGC CGATTTCAAGGTTGAACATCAAAGTGTGGGTATCAGCACCGGGGGGAATATCGGCAGTCAGTTTGTTGGCAATATGGCCA ACGGCTTGCTGGTCGGGGCCAATAACGAAGGTCACGCCGACAGCACCACCCATGCGGCCGTTTCTGAAGGTACGATCACG GTGCGCGACACGGATAACCAGCAGCAGAATGTTGATGACCTGAGCCGTGATGTGGAGCAGGCCAACAATGCCCTTTCCCC TATCTTTGATAAAGAGAAAGAACAAAACCGGCTGAAGGAAGCGCAGCTTATCGGCGAGATAGGCAGTCAGGTGGGTGATG TGTTCCGCACGCAAGGGCAGATTATCGCCACCCAGGCGGCGAATGAAAAAATGCAGGGGGTGAGTGAGGCTGATCGTGAG GCGGCGAAAGCCAACTGGGAAAAAGCCAATCCGGGTCAGGTTGCAACGGCTGAGGATATCAACGGTCAGGTTTATAAAAC GGCCTATGATCAGGCATTCAATGCGTCGGGTTACGGCACCGGGGGTAAATTCCAGCAGACGGTACAAGCGGCGACAGCGG CCCTCCAGGGGCTGGCGGGCGGAGATATAGCCAAAGCGATAGCGGGAGGCAGTGCGCCGTATCTGGCGGAAGTGATTAAG CAAAGCACGGGTGATAACGAAGAAGCGCGACTGGCGGCACATGCGGTGGTCGGTTCTGTTCTGGCACATCTACAGGGCAA TAGTGCGGTTGCGGGAGGCGCAGGTGCCTTGACGGGTGAGATAGCGGCTGATTTAATCATGCAGCAGTTGTACCCGGGGA AAATGGTTAGCGAACTCAGCGAGACAGAAAAACAGACCATCAGCGCGTTAAGTACATTAGCGGCAGGGCTGGCGGGGGGC TTGACGGGAGACAGCAGCGCCGACGCGGTTGCGGGTGCGCAGGCTGGGAAAAATGCGGTGGAGAATAATGCGCTGGGTCT GGCTCTGAAGGGATGTGGTATCGCTGCACCCTGCCGAAGTCTGATAGCGAAGCAAGTTTTGGAGATCGGTGTTAAAGCGG GTATTACGGGTATTGTCGCCAAAGAGATCGCAGACAAGATATCGGAAGACGATCTTGATCATCTTGTTACTCTGAAGATG ATGGGCAATGATGAGATCACCGAAAAATACCTGAATTCGCTTCAGGATAAATATGCACCTGCTCATACCGGTGGTGATCA GAACGCTGGGTCGGGTCCAACCGATACGGGGGGCAATCAAATAGCGGATAATTCGCCAGATCATACAGGTAATGATCAAT CTACAGGTCAGGGTGCTACTAATACTGGTAATACTGATGGCAAGCCGGATGCGGGTGGGAATGTGTTGGTAAATCCGGGG GCGGATCCTCTCACGAAGAAAGACATAGTTTATCTTTCAGAAAACCCTAACGGTAAGATAGATACTGTTATCAATGAAAC TCTATCGGGTAAGAAAAACTTTACCAGTTCGACAACATTGACTTCAGATGAAGCACTCGCTGCTGGTTTAAAATTCTTAG GAACAGGATATAAAGAGATCGGTAAGTCAGGATCTGGTGTTTACCATAGCGCGGATGGAACAAAAGAATTTAGGATTGAC TCAGGTTCAATTGATGGGGCTCATGCGCCAGGGGTTCCACATGTACACTTTGGTGTAAAAAATCCTGAAACAGGTAAGTA TATTTCTAACAATCATGTTCCTTATAAAGATTAA
Upstream 100 bases:
>100_bases TTAAGGATCTCGTTCAACCGATCCTTAACTGATCGGCATTGGGCCTAGCGATACCTCACTGGTCGGTGCGCAGGTCAGCG GCGAAACGGTGAAGGTGGAG
Downstream 100 bases:
>100_bases GGAAGCAAAATGATCTTGAAAAAAGGCATCGTTAATGATGGTGAATATGTTGGATGGGAAATCCAACTCATTGATGACAC TAAGGGTGAAACAGGAGGTT
Product: hypothetical protein
Products: NA
Alternate protein names: Filamentous Haemagglutinin Outer Membrane Protein; Filamentous Hemagglutinin; Hemagglutinin-Related Protein; Hemolysin; Filamentous Haemagglutinin; Hemagglutinin/Hemolysin-Related Protein; Adhesin/Hemagglutinin; Adhesin/Hemolysin; Adhesin; Large Exoprotein Involved In Heme Utilization Or Adhesion; ShlA/HecA/FhaA Exofamily Protein; Cell Surface Protein; Adhesin HecA Family; Adhesin/Hemagglutinin/Hemolysin; Hemagglutinin-Like Protein; Filamentous Hemagglutinin Outer Membrane Protein; Haemolysin-Like Protein; Hemolysin/Hemagglutinin-Like Protein; Hemolysin/Hemagglutinin-Like Protein HecA; Contact-Dependent Inhibitor A; FhaB Protein; Haemolysin/Haemagglutinin; AT-2 Family Transporter; Cell Surface Haemagluttinin Protein; Adhesin/Haemagluttinin; Adhesin/Hemolysin Adhesin HecA; Hemagglutinin; Filamentous Hemagglutinin/Adhesin; Filamentous Haemagglutinin Family Protein; Filamentous Haemagglutinin N-TerminalAdhesin HecA; Contact-Dependent Inhibition Of Growth Factor CdiA; Filamentous Hemagglutinin Protein; Filamentous Hemagglutinin / Adhesin; Hemagglutinin/Adhesin Repeat-Containing Protein; Hemagglutinin/Hemolysin Family Protein; Exoprotein Adhesin Or Hemolysin; Surface Adhesin; Filamentous Haemagglutinin Adhesin HecA; Hemagglutinin/Hemolysin- Related Exoprotein; Adhesin/Hemagglutinin/Hemolysin Fragment; Hemagglutination Activity Domain Protein; Fimbrial Adhesin; Hemagglutinin-Like Secreted Protein; Member Of ShlA/HecA/FhaA Exoprotein Family
Number of amino acids: Translated: 677; Mature: 676
Protein sequence:
>677_residues MGRDLLLQSQQDSDNYDAKQQSSSVGGSFSPGSMTGSISINGSQDKLNSNFDSVQEQTGIFAGSGGFDITVGGHTQLDGA VIGSTVTADKNTLDTGTLGFSDIDNQADFKVEHQSVGISTGGNIGSQFVGNMANGLLVGANNEGHADSTTHAAVSEGTIT VRDTDNQQQNVDDLSRDVEQANNALSPIFDKEKEQNRLKEAQLIGEIGSQVGDVFRTQGQIIATQAANEKMQGVSEADRE AAKANWEKANPGQVATAEDINGQVYKTAYDQAFNASGYGTGGKFQQTVQAATAALQGLAGGDIAKAIAGGSAPYLAEVIK QSTGDNEEARLAAHAVVGSVLAHLQGNSAVAGGAGALTGEIAADLIMQQLYPGKMVSELSETEKQTISALSTLAAGLAGG LTGDSSADAVAGAQAGKNAVENNALGLALKGCGIAAPCRSLIAKQVLEIGVKAGITGIVAKEIADKISEDDLDHLVTLKM MGNDEITEKYLNSLQDKYAPAHTGGDQNAGSGPTDTGGNQIADNSPDHTGNDQSTGQGATNTGNTDGKPDAGGNVLVNPG ADPLTKKDIVYLSENPNGKIDTVINETLSGKKNFTSSTTLTSDEALAAGLKFLGTGYKEIGKSGSGVYHSADGTKEFRID SGSIDGAHAPGVPHVHFGVKNPETGKYISNNHVPYKD
Sequences:
>Translated_677_residues MGRDLLLQSQQDSDNYDAKQQSSSVGGSFSPGSMTGSISINGSQDKLNSNFDSVQEQTGIFAGSGGFDITVGGHTQLDGA VIGSTVTADKNTLDTGTLGFSDIDNQADFKVEHQSVGISTGGNIGSQFVGNMANGLLVGANNEGHADSTTHAAVSEGTIT VRDTDNQQQNVDDLSRDVEQANNALSPIFDKEKEQNRLKEAQLIGEIGSQVGDVFRTQGQIIATQAANEKMQGVSEADRE AAKANWEKANPGQVATAEDINGQVYKTAYDQAFNASGYGTGGKFQQTVQAATAALQGLAGGDIAKAIAGGSAPYLAEVIK QSTGDNEEARLAAHAVVGSVLAHLQGNSAVAGGAGALTGEIAADLIMQQLYPGKMVSELSETEKQTISALSTLAAGLAGG LTGDSSADAVAGAQAGKNAVENNALGLALKGCGIAAPCRSLIAKQVLEIGVKAGITGIVAKEIADKISEDDLDHLVTLKM MGNDEITEKYLNSLQDKYAPAHTGGDQNAGSGPTDTGGNQIADNSPDHTGNDQSTGQGATNTGNTDGKPDAGGNVLVNPG ADPLTKKDIVYLSENPNGKIDTVINETLSGKKNFTSSTTLTSDEALAAGLKFLGTGYKEIGKSGSGVYHSADGTKEFRID SGSIDGAHAPGVPHVHFGVKNPETGKYISNNHVPYKD >Mature_676_residues GRDLLLQSQQDSDNYDAKQQSSSVGGSFSPGSMTGSISINGSQDKLNSNFDSVQEQTGIFAGSGGFDITVGGHTQLDGAV IGSTVTADKNTLDTGTLGFSDIDNQADFKVEHQSVGISTGGNIGSQFVGNMANGLLVGANNEGHADSTTHAAVSEGTITV RDTDNQQQNVDDLSRDVEQANNALSPIFDKEKEQNRLKEAQLIGEIGSQVGDVFRTQGQIIATQAANEKMQGVSEADREA AKANWEKANPGQVATAEDINGQVYKTAYDQAFNASGYGTGGKFQQTVQAATAALQGLAGGDIAKAIAGGSAPYLAEVIKQ STGDNEEARLAAHAVVGSVLAHLQGNSAVAGGAGALTGEIAADLIMQQLYPGKMVSELSETEKQTISALSTLAAGLAGGL TGDSSADAVAGAQAGKNAVENNALGLALKGCGIAAPCRSLIAKQVLEIGVKAGITGIVAKEIADKISEDDLDHLVTLKMM GNDEITEKYLNSLQDKYAPAHTGGDQNAGSGPTDTGGNQIADNSPDHTGNDQSTGQGATNTGNTDGKPDAGGNVLVNPGA DPLTKKDIVYLSENPNGKIDTVINETLSGKKNFTSSTTLTSDEALAAGLKFLGTGYKEIGKSGSGVYHSADGTKEFRIDS GSIDGAHAPGVPHVHFGVKNPETGKYISNNHVPYKD
Specific function: Unknown
COG id: COG3210
COG function: function code U; Large exoproteins involved in heme utilization or adhesion
Gene ontology:
Cell location: Cytoplasmic
Metaboloic importance: NA
Operon status: Not Known
Operon components: None
Similarity: NA
Homologues:
None
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
NA
Pfam domain/function: NA
EC number: NA
Molecular weight: Translated: 69433; Mature: 69301
Theoretical pI: Translated: 4.43; Mature: 4.43
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.3 %Cys (Translated Protein) 1.2 %Met (Translated Protein) 1.5 %Cys+Met (Translated Protein) 0.3 %Cys (Mature Protein) 1.0 %Met (Mature Protein) 1.3 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MGRDLLLQSQQDSDNYDAKQQSSSVGGSFSPGSMTGSISINGSQDKLNSNFDSVQEQTGI CCCCCEEECCCCCCCCCHHHHHCCCCCCCCCCCEEEEEEECCCHHHHCCCHHHHHHHHCE FAGSGGFDITVGGHTQLDGAVIGSTVTADKNTLDTGTLGFSDIDNQADFKVEHQSVGIST EECCCCCEEEECCCCCCCCEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEHHEECCCC GGNIGSQFVGNMANGLLVGANNEGHADSTTHAAVSEGTITVRDTDNQQQNVDDLSRDVEQ CCCHHHHHHHHHHCCEEEECCCCCCCCCCHHHEECCCEEEEECCCCCCCCHHHHHHHHHH ANNALSPIFDKEKEQNRLKEAQLIGEIGSQVGDVFRTQGQIIATQAANEKMQGVSEADRE HHCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHCHHHHHHH AAKANWEKANPGQVATAEDINGQVYKTAYDQAFNASGYGTGGKFQQTVQAATAALQGLAG HHHCCCCCCCCCCEEEHHCCCCCEEEEHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCC GDIAKAIAGGSAPYLAEVIKQSTGDNEEARLAAHAVVGSVLAHLQGNSAVAGGAGALTGE CHHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCEECCCCCHHHHH IAADLIMQQLYPGKMVSELSETEKQTISALSTLAAGLAGGLTGDSSADAVAGAQAGKNAV HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCHHH ENNALGLALKGCGIAAPCRSLIAKQVLEIGVKAGITGIVAKEIADKISEDDLDHLVTLKM HCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCEEEEEE MGNDEITEKYLNSLQDKYAPAHTGGDQNAGSGPTDTGGNQIADNSPDHTGNDQSTGQGAT CCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC NTGNTDGKPDAGGNVLVNPGADPLTKKDIVYLSENPNGKIDTVINETLSGKKNFTSSTTL CCCCCCCCCCCCCCEEECCCCCCCCCCCEEEEECCCCCCCHHHHHHHHCCCCCCCCCCCC TSDEALAAGLKFLGTGYKEIGKSGSGVYHSADGTKEFRIDSGSIDGAHAPGVPHVHFGVK CCHHHHHHHHHHHCCCHHHHCCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCEEEECCC NPETGKYISNNHVPYKD CCCCCCEECCCCCCCCC >Mature Secondary Structure GRDLLLQSQQDSDNYDAKQQSSSVGGSFSPGSMTGSISINGSQDKLNSNFDSVQEQTGI CCCCEEECCCCCCCCCHHHHHCCCCCCCCCCCEEEEEEECCCHHHHCCCHHHHHHHHCE FAGSGGFDITVGGHTQLDGAVIGSTVTADKNTLDTGTLGFSDIDNQADFKVEHQSVGIST EECCCCCEEEECCCCCCCCEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEHHEECCCC GGNIGSQFVGNMANGLLVGANNEGHADSTTHAAVSEGTITVRDTDNQQQNVDDLSRDVEQ CCCHHHHHHHHHHCCEEEECCCCCCCCCCHHHEECCCEEEEECCCCCCCCHHHHHHHHHH ANNALSPIFDKEKEQNRLKEAQLIGEIGSQVGDVFRTQGQIIATQAANEKMQGVSEADRE HHCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHCHHHHHHH AAKANWEKANPGQVATAEDINGQVYKTAYDQAFNASGYGTGGKFQQTVQAATAALQGLAG HHHCCCCCCCCCCEEEHHCCCCCEEEEHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCC GDIAKAIAGGSAPYLAEVIKQSTGDNEEARLAAHAVVGSVLAHLQGNSAVAGGAGALTGE CHHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCEECCCCCHHHHH IAADLIMQQLYPGKMVSELSETEKQTISALSTLAAGLAGGLTGDSSADAVAGAQAGKNAV HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCHHH ENNALGLALKGCGIAAPCRSLIAKQVLEIGVKAGITGIVAKEIADKISEDDLDHLVTLKM HCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCEEEEEE MGNDEITEKYLNSLQDKYAPAHTGGDQNAGSGPTDTGGNQIADNSPDHTGNDQSTGQGAT CCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC NTGNTDGKPDAGGNVLVNPGADPLTKKDIVYLSENPNGKIDTVINETLSGKKNFTSSTTL CCCCCCCCCCCCCCEEECCCCCCCCCCCEEEEECCCCCCCHHHHHHHHCCCCCCCCCCCC TSDEALAAGLKFLGTGYKEIGKSGSGVYHSADGTKEFRIDSGSIDGAHAPGVPHVHFGVK CCHHHHHHHHHHHCCCHHHHCCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCEEEECCC NPETGKYISNNHVPYKD CCCCCCEECCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: NA