Definition Methanopyrus kandleri AV19, complete genome.
Accession NC_003551
Length 1,694,969

Click here to switch to the map view.

The map label for this gene is 20094302

Identifier: 20094302

GI number: 20094302

Start: 819624

End: 820862

Strand: Direct

Name: 20094302

Synonym: MK0866

Alternate gene names: NA

Gene position: 819624-820862 (Clockwise)

Preceding gene: 20094301

Following gene: 20094304

Centisome position: 48.36

GC content: 55.93

Gene sequence:

>1239_bases
TTGACCGACAGGGAGGAAGTGGTAGAACTCCGCGGTCATATTATCGACTCACTCATTTTCTCCCGGGTACTGGACACTAT
CATGGAGATGGGCGGAGATTTTGAAATATTGGAATTCAAGGTAGGGAAGCGTAAAACGGACCCTAGTTTCGCGAAGATCC
TCGTCAAGGGGAAGGATCCTGAGCATCTTAGAGAGATAGTCTCCGAACTACGCAAGTACGGGGCGGTCCCCGTTCACACG
CAGGAAGTCCGACTGGAGCCGGCTCCGGCGGACGGCGTCTGTCCCCGGGGCTTCTACACAACTACGAACCACCGAACGTT
CGTACTTTTCGACGGCGAGTGGATCGAGGTCGAGGACATAGAGATGGACTGCGCGATCGTGGTTTACCCGGAGGAACGCA
GGGCAGTGGCTAAACCCATCCGAGAGGTTCGTGAGGGAGAGTTGGTGGTAGTGGGAGACAGAGGAGTGCGCGTGAAGCCC
CCCGAGAGACCCCGCGGTAGGACTGGAATCTTCGGCTTCATGGAGAGCGAAGTCTCACCTGAAAAGCCCACACCAACGCT
GATCCGAAGAATAGCTGAAGAACTAGAGTGGCACCGAAAGAACGGTAAAATCGTGGTAGTCGTTGGCCCTGCCGTGATTC
ACGCTGGAGCCCGTGATGATCTGGCATGGATGATCAGAGAGGGGTACGTAGACGTACTCTTCGCGGGTAATGCCGTGGCT
ACGCACGACGTTGAAGCTAGTTTATTCGGGACGTCACTCGGTGTGGATTTGGAGACGGGTGAGCCAGTAAAAGGAGGACA
CAGCCACCACCTTTACGCCATCAACGAGATCCGGCGAGTGGGCGGGTTACGCGAAGCCGTCGAGAAAGGAATCCTGAAAG
ATGGGATAATGTACGAGTGCATCGTCAACGATGTTCCGTACGTGTTGGCAGGCTCGATACGTGATGATGGTCCTATCCCG
GACGTAATCACCGACGTCATGGAAGCCCAAGCGGAGATGCGACGTCATCTCAAGGGGGCCACCCTAGTGCTGATGATGGC
GACGATGCTTCACTCGATCGCCACCGGCAACCTCTTGCCTTCCTGGGTCAAGACTATCTGCGTAGATATCAACCCGGCGG
TAGTTACGAAGTTGATGGATCGAGGGACCGCCCAGGCTCTGGGAATAGTGTCCGACGTCGGTGTATTCCTACCGGAACTC
GTGAAGGAGCTCAAGAGGGTCCGCGACGACGAGGCTTAG

Upstream 100 bases:

>100_bases
ATTTCTACGGATCGCCGATCGCCCGTGGTAACCCAAGAATAAAAACTCTCCCATCCTTTCAACCTTGGAGGATGAGGCTA
GCAGCGTGAGGGGGCCTCAC

Downstream 100 bases:

>100_bases
TCCACTCTTCCACCCTGCGCTCGATGAGTTCTTCCATCACCTTTTTAGCCTCTTCCAGCGCATCCTTCGTCCCCCTGAAG
GCTAGGACCGTTGTCTTCGT

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 412; Mature: 411

Protein sequence:

>412_residues
MTDREEVVELRGHIIDSLIFSRVLDTIMEMGGDFEILEFKVGKRKTDPSFAKILVKGKDPEHLREIVSELRKYGAVPVHT
QEVRLEPAPADGVCPRGFYTTTNHRTFVLFDGEWIEVEDIEMDCAIVVYPEERRAVAKPIREVREGELVVVGDRGVRVKP
PERPRGRTGIFGFMESEVSPEKPTPTLIRRIAEELEWHRKNGKIVVVVGPAVIHAGARDDLAWMIREGYVDVLFAGNAVA
THDVEASLFGTSLGVDLETGEPVKGGHSHHLYAINEIRRVGGLREAVEKGILKDGIMYECIVNDVPYVLAGSIRDDGPIP
DVITDVMEAQAEMRRHLKGATLVLMMATMLHSIATGNLLPSWVKTICVDINPAVVTKLMDRGTAQALGIVSDVGVFLPEL
VKELKRVRDDEA

Sequences:

>Translated_412_residues
MTDREEVVELRGHIIDSLIFSRVLDTIMEMGGDFEILEFKVGKRKTDPSFAKILVKGKDPEHLREIVSELRKYGAVPVHT
QEVRLEPAPADGVCPRGFYTTTNHRTFVLFDGEWIEVEDIEMDCAIVVYPEERRAVAKPIREVREGELVVVGDRGVRVKP
PERPRGRTGIFGFMESEVSPEKPTPTLIRRIAEELEWHRKNGKIVVVVGPAVIHAGARDDLAWMIREGYVDVLFAGNAVA
THDVEASLFGTSLGVDLETGEPVKGGHSHHLYAINEIRRVGGLREAVEKGILKDGIMYECIVNDVPYVLAGSIRDDGPIP
DVITDVMEAQAEMRRHLKGATLVLMMATMLHSIATGNLLPSWVKTICVDINPAVVTKLMDRGTAQALGIVSDVGVFLPEL
VKELKRVRDDEA
>Mature_411_residues
TDREEVVELRGHIIDSLIFSRVLDTIMEMGGDFEILEFKVGKRKTDPSFAKILVKGKDPEHLREIVSELRKYGAVPVHTQ
EVRLEPAPADGVCPRGFYTTTNHRTFVLFDGEWIEVEDIEMDCAIVVYPEERRAVAKPIREVREGELVVVGDRGVRVKPP
ERPRGRTGIFGFMESEVSPEKPTPTLIRRIAEELEWHRKNGKIVVVVGPAVIHAGARDDLAWMIREGYVDVLFAGNAVAT
HDVEASLFGTSLGVDLETGEPVKGGHSHHLYAINEIRRVGGLREAVEKGILKDGIMYECIVNDVPYVLAGSIRDDGPIPD
VITDVMEAQAEMRRHLKGATLVLMMATMLHSIATGNLLPSWVKTICVDINPAVVTKLMDRGTAQALGIVSDVGVFLPELV
KELKRVRDDEA

Specific function: Unknown

COG id: COG1915

COG function: function code S; Uncharacterized conserved protein

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

None

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): Y866_METKA (Q8TX14)

Other databases:

- EMBL:   AE009439
- RefSeq:   NP_614149.1
- ProteinModelPortal:   Q8TX14
- SMR:   Q8TX14
- GeneID:   1476967
- GenomeReviews:   AE009439_GR
- KEGG:   mka:MK0866
- NMPDR:   fig|190192.1.peg.862
- HOGENOM:   HBG481668
- OMA:   ATHDIES
- ProtClustDB:   CLSK862676
- BioCyc:   MKAN190192:MK0866-MONOMER
- InterPro:   IPR005239
- InterPro:   IPR007545
- TIGRFAMs:   TIGR00300

Pfam domain/function: PF04455 Saccharop_dh_N

EC number: NA

Molecular weight: Translated: 45747; Mature: 45616

Theoretical pI: Translated: 5.27; Mature: 5.27

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.0 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
4.1 %Cys+Met (Translated Protein)
1.0 %Cys     (Mature Protein)
2.9 %Met     (Mature Protein)
3.9 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTDREEVVELRGHIIDSLIFSRVLDTIMEMGGDFEILEFKVGKRKTDPSFAKILVKGKDP
CCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHEEEEECCCCH
EHLREIVSELRKYGAVPVHTQEVRLEPAPADGVCPRGFYTTTNHRTFVLFDGEWIEVEDI
HHHHHHHHHHHHCCCCCCCHHHEEECCCCCCCCCCCCCEEECCCEEEEEECCCEEEEEEC
EMDCAIVVYPEERRAVAKPIREVREGELVVVGDRGVRVKPPERPRGRTGIFGFMESEVSP
CCCEEEEEECHHHHHHHHHHHHHCCCCEEEECCCCEECCCCCCCCCCCCEEEEHHCCCCC
EKPTPTLIRRIAEELEWHRKNGKIVVVVGPAVIHAGARDDLAWMIREGYVDVLFAGNAVA
CCCCHHHHHHHHHHHHHHHCCCEEEEEECHHHEECCCCCHHHHHHHCCCEEEEEECCEEE
THDVEASLFGTSLGVDLETGEPVKGGHSHHLYAINEIRRVGGLREAVEKGILKDGIMYEC
ECCCHHHHCCCCCCCCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHCCCEEEE
IVNDVPYVLAGSIRDDGPIPDVITDVMEAQAEMRRHLKGATLVLMMATMLHSIATGNLLP
HHCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCH
SWVKTICVDINPAVVTKLMDRGTAQALGIVSDVGVFLPELVKELKRVRDDEA
HHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC
>Mature Secondary Structure 
TDREEVVELRGHIIDSLIFSRVLDTIMEMGGDFEILEFKVGKRKTDPSFAKILVKGKDP
CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHEEEEECCCCH
EHLREIVSELRKYGAVPVHTQEVRLEPAPADGVCPRGFYTTTNHRTFVLFDGEWIEVEDI
HHHHHHHHHHHHCCCCCCCHHHEEECCCCCCCCCCCCCEEECCCEEEEEECCCEEEEEEC
EMDCAIVVYPEERRAVAKPIREVREGELVVVGDRGVRVKPPERPRGRTGIFGFMESEVSP
CCCEEEEEECHHHHHHHHHHHHHCCCCEEEECCCCEECCCCCCCCCCCCEEEEHHCCCCC
EKPTPTLIRRIAEELEWHRKNGKIVVVVGPAVIHAGARDDLAWMIREGYVDVLFAGNAVA
CCCCHHHHHHHHHHHHHHHCCCEEEEEECHHHEECCCCCHHHHHHHCCCEEEEEECCEEE
THDVEASLFGTSLGVDLETGEPVKGGHSHHLYAINEIRRVGGLREAVEKGILKDGIMYEC
ECCCHHHHCCCCCCCCCCCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHCCCEEEE
IVNDVPYVLAGSIRDDGPIPDVITDVMEAQAEMRRHLKGATLVLMMATMLHSIATGNLLP
HHCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCH
SWVKTICVDINPAVVTKLMDRGTAQALGIVSDVGVFLPELVKELKRVRDDEA
HHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11930014