Definition Yersinia pestis Nepal516, complete genome.
Accession NC_008149
Length 4,534,590

Click here to switch to the map view.

The map label for this gene is 108812243

Identifier: 108812243

GI number: 108812243

Start: 2348272

End: 2350305

Strand: Reverse

Name: 108812243

Synonym: YPN_2081

Alternate gene names: NA

Gene position: 2350305-2348272 (Counterclockwise)

Preceding gene: 108812244

Following gene: 108812242

Centisome position: 51.83

GC content: 51.72

Gene sequence:

>2034_bases
GTGGGCCGCGACCTGTTGCTGCAAAGCCAGCAGGACAGCGATAACTATGATGCGAAGCAGCAAAGTAGCAGTGTTGGCGG
CAGTTTCAGCCCTGGCTCCATGACGGGCAGTATCAGTATCAATGGCAGCCAGGACAAGCTGAACAGCAACTTTGACTCGG
TGCAGGAGCAGACGGGTATCTTTGCCGGTTCGGGCGGCTTTGATATCACGGTAGGTGGACACACTCAGCTTGACGGTGCG
GTGATTGGCAGCACCGTGACAGCCGATAAAAACACGCTGGATACCGGGACACTGGGCTTCAGTGATATCGATAATCAAGC
CGATTTCAAGGTTGAACATCAAAGTGTGGGTATCAGCACCGGGGGGAATATCGGCAGTCAGTTTGTTGGCAATATGGCCA
ACGGCTTGCTGGTCGGGGCCAATAACGAAGGTCACGCCGACAGCACCACCCATGCGGCCGTTTCTGAAGGTACGATCACG
GTGCGCGACACGGATAACCAGCAGCAGAATGTTGATGACCTGAGCCGTGATGTGGAGCAGGCCAACAATGCCCTTTCCCC
TATCTTTGATAAAGAGAAAGAACAAAACCGGCTGAAGGAAGCGCAGCTTATCGGCGAGATAGGCAGTCAGGTGGGTGATG
TGTTCCGCACGCAAGGGCAGATTATCGCCACCCAGGCGGCGAATGAAAAAATGCAGGGGGTGAGTGAGGCTGATCGTGAG
GCGGCGAAAGCCAACTGGGAAAAAGCCAATCCGGGTCAGGTTGCAACGGCTGAGGATATCAACGGTCAGGTTTATAAAAC
GGCCTATGATCAGGCATTCAATGCGTCGGGTTACGGCACCGGGGGTAAATTCCAGCAGACGGTACAAGCGGCGACAGCGG
CCCTCCAGGGGCTGGCGGGCGGAGATATAGCCAAAGCGATAGCGGGAGGCAGTGCGCCGTATCTGGCGGAAGTGATTAAG
CAAAGCACGGGTGATAACGAAGAAGCGCGACTGGCGGCACATGCGGTGGTCGGTTCTGTTCTGGCACATCTACAGGGCAA
TAGTGCGGTTGCGGGAGGCGCAGGTGCCTTGACGGGTGAGATAGCGGCTGATTTAATCATGCAGCAGTTGTACCCGGGGA
AAATGGTTAGCGAACTCAGCGAGACAGAAAAACAGACCATCAGCGCGTTAAGTACATTAGCGGCAGGGCTGGCGGGGGGC
TTGACGGGAGACAGCAGCGCCGACGCGGTTGCGGGTGCGCAGGCTGGGAAAAATGCGGTGGAGAATAATGCGCTGGGTCT
GGCTCTGAAGGGATGTGGTATCGCTGCACCCTGCCGAAGTCTGATAGCGAAGCAAGTTTTGGAGATCGGTGTTAAAGCGG
GTATTACGGGTATTGTCGCCAAAGAGATCGCAGACAAGATATCGGAAGACGATCTTGATCATCTTGTTACTCTGAAGATG
ATGGGCAATGATGAGATCACCGAAAAATACCTGAATTCGCTTCAGGATAAATATGCACCTGCTCATACCGGTGGTGATCA
GAACGCTGGGTCGGGTCCAACCGATACGGGGGGCAATCAAATAGCGGATAATTCGCCAGATCATACAGGTAATGATCAAT
CTACAGGTCAGGGTGCTACTAATACTGGTAATACTGATGGCAAGCCGGATGCGGGTGGGAATGTGTTGGTAAATCCGGGG
GCGGATCCTCTCACGAAGAAAGACATAGTTTATCTTTCAGAAAACCCTAACGGTAAGATAGATACTGTTATCAATGAAAC
TCTATCGGGTAAGAAAAACTTTACCAGTTCGACAACATTGACTTCAGATGAAGCACTCGCTGCTGGTTTAAAATTCTTAG
GAACAGGATATAAAGAGATCGGTAAGTCAGGATCTGGTGTTTACCATAGCGCGGATGGAACAAAAGAATTTAGGATTGAC
TCAGGTTCAATTGATGGGGCTCATGCGCCAGGGGTTCCACATGTACACTTTGGTGTAAAAAATCCTGAAACAGGTAAGTA
TATTTCTAACAATCATGTTCCTTATAAAGATTAA

Upstream 100 bases:

>100_bases
TTAAGGATCTCGTTCAACCGATCCTTAACTGATCGGCATTGGGCCTAGCGATACCTCACTGGTCGGTGCGCAGGTCAGCG
GCGAAACGGTGAAGGTGGAG

Downstream 100 bases:

>100_bases
GGAAGCAAAATGATCTTGAAAAAAGGCATCGTTAATGATGGTGAATATGTTGGATGGGAAATCCAACTCATTGATGACAC
TAAGGGTGAAACAGGAGGTT

Product: hypothetical protein

Products: NA

Alternate protein names: Filamentous Haemagglutinin Outer Membrane Protein; Filamentous Hemagglutinin; Hemagglutinin-Related Protein; Hemolysin; Filamentous Haemagglutinin; Hemagglutinin/Hemolysin-Related Protein; Adhesin/Hemagglutinin; Adhesin/Hemolysin; Adhesin; Large Exoprotein Involved In Heme Utilization Or Adhesion; ShlA/HecA/FhaA Exofamily Protein; Cell Surface Protein; Adhesin HecA Family; Adhesin/Hemagglutinin/Hemolysin; Hemagglutinin-Like Protein; Filamentous Hemagglutinin Outer Membrane Protein; Haemolysin-Like Protein; Hemolysin/Hemagglutinin-Like Protein; Hemolysin/Hemagglutinin-Like Protein HecA; Contact-Dependent Inhibitor A; FhaB Protein; Haemolysin/Haemagglutinin; AT-2 Family Transporter; Cell Surface Haemagluttinin Protein; Adhesin/Haemagluttinin; Adhesin/Hemolysin Adhesin HecA; Hemagglutinin; Filamentous Hemagglutinin/Adhesin; Filamentous Haemagglutinin Family Protein; Filamentous Haemagglutinin N-TerminalAdhesin HecA; Contact-Dependent Inhibition Of Growth Factor CdiA; Filamentous Hemagglutinin Protein; Filamentous Hemagglutinin / Adhesin; Hemagglutinin/Adhesin Repeat-Containing Protein; Hemagglutinin/Hemolysin Family Protein; Exoprotein Adhesin Or Hemolysin; Surface Adhesin; Filamentous Haemagglutinin Adhesin HecA; Hemagglutinin/Hemolysin- Related Exoprotein; Adhesin/Hemagglutinin/Hemolysin Fragment; Hemagglutination Activity Domain Protein; Fimbrial Adhesin; Hemagglutinin-Like Secreted Protein; Member Of ShlA/HecA/FhaA Exoprotein Family

Number of amino acids: Translated: 677; Mature: 676

Protein sequence:

>677_residues
MGRDLLLQSQQDSDNYDAKQQSSSVGGSFSPGSMTGSISINGSQDKLNSNFDSVQEQTGIFAGSGGFDITVGGHTQLDGA
VIGSTVTADKNTLDTGTLGFSDIDNQADFKVEHQSVGISTGGNIGSQFVGNMANGLLVGANNEGHADSTTHAAVSEGTIT
VRDTDNQQQNVDDLSRDVEQANNALSPIFDKEKEQNRLKEAQLIGEIGSQVGDVFRTQGQIIATQAANEKMQGVSEADRE
AAKANWEKANPGQVATAEDINGQVYKTAYDQAFNASGYGTGGKFQQTVQAATAALQGLAGGDIAKAIAGGSAPYLAEVIK
QSTGDNEEARLAAHAVVGSVLAHLQGNSAVAGGAGALTGEIAADLIMQQLYPGKMVSELSETEKQTISALSTLAAGLAGG
LTGDSSADAVAGAQAGKNAVENNALGLALKGCGIAAPCRSLIAKQVLEIGVKAGITGIVAKEIADKISEDDLDHLVTLKM
MGNDEITEKYLNSLQDKYAPAHTGGDQNAGSGPTDTGGNQIADNSPDHTGNDQSTGQGATNTGNTDGKPDAGGNVLVNPG
ADPLTKKDIVYLSENPNGKIDTVINETLSGKKNFTSSTTLTSDEALAAGLKFLGTGYKEIGKSGSGVYHSADGTKEFRID
SGSIDGAHAPGVPHVHFGVKNPETGKYISNNHVPYKD

Sequences:

>Translated_677_residues
MGRDLLLQSQQDSDNYDAKQQSSSVGGSFSPGSMTGSISINGSQDKLNSNFDSVQEQTGIFAGSGGFDITVGGHTQLDGA
VIGSTVTADKNTLDTGTLGFSDIDNQADFKVEHQSVGISTGGNIGSQFVGNMANGLLVGANNEGHADSTTHAAVSEGTIT
VRDTDNQQQNVDDLSRDVEQANNALSPIFDKEKEQNRLKEAQLIGEIGSQVGDVFRTQGQIIATQAANEKMQGVSEADRE
AAKANWEKANPGQVATAEDINGQVYKTAYDQAFNASGYGTGGKFQQTVQAATAALQGLAGGDIAKAIAGGSAPYLAEVIK
QSTGDNEEARLAAHAVVGSVLAHLQGNSAVAGGAGALTGEIAADLIMQQLYPGKMVSELSETEKQTISALSTLAAGLAGG
LTGDSSADAVAGAQAGKNAVENNALGLALKGCGIAAPCRSLIAKQVLEIGVKAGITGIVAKEIADKISEDDLDHLVTLKM
MGNDEITEKYLNSLQDKYAPAHTGGDQNAGSGPTDTGGNQIADNSPDHTGNDQSTGQGATNTGNTDGKPDAGGNVLVNPG
ADPLTKKDIVYLSENPNGKIDTVINETLSGKKNFTSSTTLTSDEALAAGLKFLGTGYKEIGKSGSGVYHSADGTKEFRID
SGSIDGAHAPGVPHVHFGVKNPETGKYISNNHVPYKD
>Mature_676_residues
GRDLLLQSQQDSDNYDAKQQSSSVGGSFSPGSMTGSISINGSQDKLNSNFDSVQEQTGIFAGSGGFDITVGGHTQLDGAV
IGSTVTADKNTLDTGTLGFSDIDNQADFKVEHQSVGISTGGNIGSQFVGNMANGLLVGANNEGHADSTTHAAVSEGTITV
RDTDNQQQNVDDLSRDVEQANNALSPIFDKEKEQNRLKEAQLIGEIGSQVGDVFRTQGQIIATQAANEKMQGVSEADREA
AKANWEKANPGQVATAEDINGQVYKTAYDQAFNASGYGTGGKFQQTVQAATAALQGLAGGDIAKAIAGGSAPYLAEVIKQ
STGDNEEARLAAHAVVGSVLAHLQGNSAVAGGAGALTGEIAADLIMQQLYPGKMVSELSETEKQTISALSTLAAGLAGGL
TGDSSADAVAGAQAGKNAVENNALGLALKGCGIAAPCRSLIAKQVLEIGVKAGITGIVAKEIADKISEDDLDHLVTLKMM
GNDEITEKYLNSLQDKYAPAHTGGDQNAGSGPTDTGGNQIADNSPDHTGNDQSTGQGATNTGNTDGKPDAGGNVLVNPGA
DPLTKKDIVYLSENPNGKIDTVINETLSGKKNFTSSTTLTSDEALAAGLKFLGTGYKEIGKSGSGVYHSADGTKEFRIDS
GSIDGAHAPGVPHVHFGVKNPETGKYISNNHVPYKD

Specific function: Unknown

COG id: COG3210

COG function: function code U; Large exoproteins involved in heme utilization or adhesion

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 69433; Mature: 69301

Theoretical pI: Translated: 4.43; Mature: 4.43

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
1.2 %Met     (Translated Protein)
1.5 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
1.0 %Met     (Mature Protein)
1.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MGRDLLLQSQQDSDNYDAKQQSSSVGGSFSPGSMTGSISINGSQDKLNSNFDSVQEQTGI
CCCCCEEECCCCCCCCCHHHHHCCCCCCCCCCCEEEEEEECCCHHHHCCCHHHHHHHHCE
FAGSGGFDITVGGHTQLDGAVIGSTVTADKNTLDTGTLGFSDIDNQADFKVEHQSVGIST
EECCCCCEEEECCCCCCCCEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEHHEECCCC
GGNIGSQFVGNMANGLLVGANNEGHADSTTHAAVSEGTITVRDTDNQQQNVDDLSRDVEQ
CCCHHHHHHHHHHCCEEEECCCCCCCCCCHHHEECCCEEEEECCCCCCCCHHHHHHHHHH
ANNALSPIFDKEKEQNRLKEAQLIGEIGSQVGDVFRTQGQIIATQAANEKMQGVSEADRE
HHCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHCHHHHHHH
AAKANWEKANPGQVATAEDINGQVYKTAYDQAFNASGYGTGGKFQQTVQAATAALQGLAG
HHHCCCCCCCCCCEEEHHCCCCCEEEEHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCC
GDIAKAIAGGSAPYLAEVIKQSTGDNEEARLAAHAVVGSVLAHLQGNSAVAGGAGALTGE
CHHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCEECCCCCHHHHH
IAADLIMQQLYPGKMVSELSETEKQTISALSTLAAGLAGGLTGDSSADAVAGAQAGKNAV
HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCHHH
ENNALGLALKGCGIAAPCRSLIAKQVLEIGVKAGITGIVAKEIADKISEDDLDHLVTLKM
HCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCEEEEEE
MGNDEITEKYLNSLQDKYAPAHTGGDQNAGSGPTDTGGNQIADNSPDHTGNDQSTGQGAT
CCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
NTGNTDGKPDAGGNVLVNPGADPLTKKDIVYLSENPNGKIDTVINETLSGKKNFTSSTTL
CCCCCCCCCCCCCCEEECCCCCCCCCCCEEEEECCCCCCCHHHHHHHHCCCCCCCCCCCC
TSDEALAAGLKFLGTGYKEIGKSGSGVYHSADGTKEFRIDSGSIDGAHAPGVPHVHFGVK
CCHHHHHHHHHHHCCCHHHHCCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCEEEECCC
NPETGKYISNNHVPYKD
CCCCCCEECCCCCCCCC
>Mature Secondary Structure 
GRDLLLQSQQDSDNYDAKQQSSSVGGSFSPGSMTGSISINGSQDKLNSNFDSVQEQTGI
CCCCEEECCCCCCCCCHHHHHCCCCCCCCCCCEEEEEEECCCHHHHCCCHHHHHHHHCE
FAGSGGFDITVGGHTQLDGAVIGSTVTADKNTLDTGTLGFSDIDNQADFKVEHQSVGIST
EECCCCCEEEECCCCCCCCEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEHHEECCCC
GGNIGSQFVGNMANGLLVGANNEGHADSTTHAAVSEGTITVRDTDNQQQNVDDLSRDVEQ
CCCHHHHHHHHHHCCEEEECCCCCCCCCCHHHEECCCEEEEECCCCCCCCHHHHHHHHHH
ANNALSPIFDKEKEQNRLKEAQLIGEIGSQVGDVFRTQGQIIATQAANEKMQGVSEADRE
HHCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHCHHHHHHH
AAKANWEKANPGQVATAEDINGQVYKTAYDQAFNASGYGTGGKFQQTVQAATAALQGLAG
HHHCCCCCCCCCCEEEHHCCCCCEEEEHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCC
GDIAKAIAGGSAPYLAEVIKQSTGDNEEARLAAHAVVGSVLAHLQGNSAVAGGAGALTGE
CHHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCEECCCCCHHHHH
IAADLIMQQLYPGKMVSELSETEKQTISALSTLAAGLAGGLTGDSSADAVAGAQAGKNAV
HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCHHH
ENNALGLALKGCGIAAPCRSLIAKQVLEIGVKAGITGIVAKEIADKISEDDLDHLVTLKM
HCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCEEEEEE
MGNDEITEKYLNSLQDKYAPAHTGGDQNAGSGPTDTGGNQIADNSPDHTGNDQSTGQGAT
CCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
NTGNTDGKPDAGGNVLVNPGADPLTKKDIVYLSENPNGKIDTVINETLSGKKNFTSSTTL
CCCCCCCCCCCCCCEEECCCCCCCCCCCEEEEECCCCCCCHHHHHHHHCCCCCCCCCCCC
TSDEALAAGLKFLGTGYKEIGKSGSGVYHSADGTKEFRIDSGSIDGAHAPGVPHVHFGVK
CCHHHHHHHHHHHCCCHHHHCCCCCCEEECCCCCEEEEECCCCCCCCCCCCCCEEEECCC
NPETGKYISNNHVPYKD
CCCCCCEECCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA