Definition Yersinia pestis Nepal516, complete genome.
Accession NC_008149
Length 4,534,590

Click here to switch to the map view.

The map label for this gene is ycjS [H]

Identifier: 108812394

GI number: 108812394

Start: 2521498

End: 2522511

Strand: Reverse

Name: ycjS [H]

Synonym: YPN_2233

Alternate gene names: 108812394

Gene position: 2522511-2521498 (Counterclockwise)

Preceding gene: 108812396

Following gene: 108812393

Centisome position: 55.63

GC content: 54.44

Gene sequence:

>1014_bases
ATGACAGATCGTGTTATTAAAGTTGGCGTTATTGGCGCGGGTTTTATGGGGGCGATGCATGCCCGAATTTGGCAGCAAAT
GTACGGTGTCGAAGTGGTCGGTGTGGCCGATCCAGATTTATCCCGCAGTGAAGCGCTGAAGCCGTGGATCCCTCAACTGA
ATGGCTATGAAGATTTCAACCAGATGCTGGAGCAGGAAAAACCAGATATTGTCAGCATCTGTACCAAAGATGACTTCCAC
CTTGCGCCCGCACTTGCAGCTGCAAAAGTGGGTGCCCACATTTTCCTCGAGAAACCGATTGCGACTACGGTGGAAGACGG
TGAAGCCATCACGCGTGCTGCACGCGAAGCAAACGTCAAGTTGGGTATTGGTTTCCTGCTGCGTTTCGACCCGCGCTACT
CACGTGCACAAGAGATCCTGGCTTCGGGCAGTGCAGGCGAAGTGAGCCATATTAGTGCTCGTCGTAATAGTCCGGCTATC
GAAGGTCCTGCCCGTTATGGCGGTAGCCTGCCGTTGCCGCTGCACGTCACTGTGCATGACGTTGATATGATCCTGTGGTT
GCTAAAGCATACCCACCCGATCAGCGTCTACGCCCAGACCACTAACAAGCTTCTCGGCCATCTGGGGACGGAAGATTCTG
TCTTTGCCATCATTCGCTTTGCCGACGGCACCGTCGTGAATCTGGAGAGCTCTTGGGCGCTGCCAGCCGGTTCACGCACG
TTGCTCGACGCTAAAATGTCGATCCTGGCAACTAAAGGCCTGATTGAAATTGAATGCGGTGAATCCGGTCTGTATCACGC
GTCTGAAAACATGAACCGTTACATCGACACCCAGCACTGGCCGCTGAGTCAGGGTGAACTCAAAGGCGATCTGCGTGAAG
AGCTGATGGCGTTCCTCGGCGATGTTCGCACCGGCACTACCCGTGTAGCAACCGGCGAAGAAGGCAATGAGGCACTGCGT
ATTACCGTTGCCATTATGGACTCTGCCCGTACCGGCGAAATTGTCCGGGTCTGA

Upstream 100 bases:

>100_bases
TTGGTATTAATCTGGACTCATTTTGGCATGGTAATTGAATTACTGAATATGGACGGGGAACGTTACACAGTTTTTCAAAC
TAGTTACCGAGGAAAGTGAA

Downstream 100 bases:

>100_bases
CATTCTGACTGAATAAACCAGGTGATGAGATGAGCATAAAAAAAATAGGTATCGCAGGTATTATCGGCACGTTGCTGATG
GCGGGTAACGCCAGCGCACA

Product: oxidoreductase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 337; Mature: 336

Protein sequence:

>337_residues
MTDRVIKVGVIGAGFMGAMHARIWQQMYGVEVVGVADPDLSRSEALKPWIPQLNGYEDFNQMLEQEKPDIVSICTKDDFH
LAPALAAAKVGAHIFLEKPIATTVEDGEAITRAAREANVKLGIGFLLRFDPRYSRAQEILASGSAGEVSHISARRNSPAI
EGPARYGGSLPLPLHVTVHDVDMILWLLKHTHPISVYAQTTNKLLGHLGTEDSVFAIIRFADGTVVNLESSWALPAGSRT
LLDAKMSILATKGLIEIECGESGLYHASENMNRYIDTQHWPLSQGELKGDLREELMAFLGDVRTGTTRVATGEEGNEALR
ITVAIMDSARTGEIVRV

Sequences:

>Translated_337_residues
MTDRVIKVGVIGAGFMGAMHARIWQQMYGVEVVGVADPDLSRSEALKPWIPQLNGYEDFNQMLEQEKPDIVSICTKDDFH
LAPALAAAKVGAHIFLEKPIATTVEDGEAITRAAREANVKLGIGFLLRFDPRYSRAQEILASGSAGEVSHISARRNSPAI
EGPARYGGSLPLPLHVTVHDVDMILWLLKHTHPISVYAQTTNKLLGHLGTEDSVFAIIRFADGTVVNLESSWALPAGSRT
LLDAKMSILATKGLIEIECGESGLYHASENMNRYIDTQHWPLSQGELKGDLREELMAFLGDVRTGTTRVATGEEGNEALR
ITVAIMDSARTGEIVRV
>Mature_336_residues
TDRVIKVGVIGAGFMGAMHARIWQQMYGVEVVGVADPDLSRSEALKPWIPQLNGYEDFNQMLEQEKPDIVSICTKDDFHL
APALAAAKVGAHIFLEKPIATTVEDGEAITRAAREANVKLGIGFLLRFDPRYSRAQEILASGSAGEVSHISARRNSPAIE
GPARYGGSLPLPLHVTVHDVDMILWLLKHTHPISVYAQTTNKLLGHLGTEDSVFAIIRFADGTVVNLESSWALPAGSRTL
LDAKMSILATKGLIEIECGESGLYHASENMNRYIDTQHWPLSQGELKGDLREELMAFLGDVRTGTTRVATGEEGNEALRI
TVAIMDSARTGEIVRV

Specific function: Unknown

COG id: COG0673

COG function: function code R; Predicted dehydrogenases and related proteins

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the gfo/idh/mocA family [H]

Homologues:

Organism=Escherichia coli, GI1787574, Length=364, Percent_Identity=27.1978021978022, Blast_Score=104, Evalue=9e-24,
Organism=Escherichia coli, GI87082405, Length=271, Percent_Identity=25.830258302583, Blast_Score=76, Evalue=3e-15,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR016040
- InterPro:   IPR000683
- InterPro:   IPR004104 [H]

Pfam domain/function: PF01408 GFO_IDH_MocA; PF02894 GFO_IDH_MocA_C [H]

EC number: 1.-.-.- [C]

Molecular weight: Translated: 36736; Mature: 36605

Theoretical pI: Translated: 5.58; Mature: 5.58

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.6 %Cys     (Translated Protein)
3.0 %Met     (Translated Protein)
3.6 %Cys+Met (Translated Protein)
0.6 %Cys     (Mature Protein)
2.7 %Met     (Mature Protein)
3.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTDRVIKVGVIGAGFMGAMHARIWQQMYGVEVVGVADPDLSRSEALKPWIPQLNGYEDFN
CCCCEEEEEEECCHHHHHHHHHHHHHHHCEEEEEECCCCCCHHHCCCCCCCCCCCHHHHH
QMLEQEKPDIVSICTKDDFHLAPALAAAKVGAHIFLEKPIATTVEDGEAITRAAREANVK
HHHHHCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHCCEE
LGIGFLLRFDPRYSRAQEILASGSAGEVSHISARRNSPAIEGPARYGGSLPLPLHVTVHD
EEEEEEEEECCCHHHHHHHHHCCCCCCCHHHHHCCCCCCCCCHHHCCCCCCCEEEEEEHH
VDMILWLLKHTHPISVYAQTTNKLLGHLGTEDSVFAIIRFADGTVVNLESSWALPAGSRT
HHHHHHHHHCCCCEEEEEEHHHHHHHHCCCCCCEEEEEEECCCEEEEECCCCCCCCCCCH
LLDAKMSILATKGLIEIECGESGLYHASENMNRYIDTQHWPLSQGELKGDLREELMAFLG
HHHHHHHHHEECCEEEEEECCCCCEECCCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHHC
DVRTGTTRVATGEEGNEALRITVAIMDSARTGEIVRV
CCCCCCEEEECCCCCCCEEEEEEEEECCCCCCCEEEC
>Mature Secondary Structure 
TDRVIKVGVIGAGFMGAMHARIWQQMYGVEVVGVADPDLSRSEALKPWIPQLNGYEDFN
CCCEEEEEEECCHHHHHHHHHHHHHHHCEEEEEECCCCCCHHHCCCCCCCCCCCHHHHH
QMLEQEKPDIVSICTKDDFHLAPALAAAKVGAHIFLEKPIATTVEDGEAITRAAREANVK
HHHHHCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHHCCEE
LGIGFLLRFDPRYSRAQEILASGSAGEVSHISARRNSPAIEGPARYGGSLPLPLHVTVHD
EEEEEEEEECCCHHHHHHHHHCCCCCCCHHHHHCCCCCCCCCHHHCCCCCCCEEEEEEHH
VDMILWLLKHTHPISVYAQTTNKLLGHLGTEDSVFAIIRFADGTVVNLESSWALPAGSRT
HHHHHHHHHCCCCEEEEEEHHHHHHHHCCCCCCEEEEEEECCCEEEEECCCCCCCCCCCH
LLDAKMSILATKGLIEIECGESGLYHASENMNRYIDTQHWPLSQGELKGDLREELMAFLG
HHHHHHHHHEECCEEEEEECCCCCEECCCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHHC
DVRTGTTRVATGEEGNEALRITVAIMDSARTGEIVRV
CCCCCCEEEECCCCCCCEEEEEEEEECCCCCCCEEEC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 9097039; 9278503 [H]