The gene/protein map for NC_012731 is currently unavailable.
Definition Klebsiella pneumoniae NTUH-K2044 chromosome, complete genome.
Accession NC_012731
Length 5,248,520

Click here to switch to the map view.

The map label for this gene is int [H]

Identifier: 238892770

GI number: 238892770

Start: 566263

End: 567945

Strand: Direct

Name: int [H]

Synonym: KP1_0573

Alternate gene names: 238892770

Gene position: 566263-567945 (Clockwise)

Preceding gene: 238892761

Following gene: 238892771

Centisome position: 10.79

GC content: 54.43

Gene sequence:

>1683_bases
ATGGCGCTTACAGATCTCGCAATCCGGCATGCCAGGCCGCTTGGCAAGGCATACAGGCTCTCTGACTGTCACGGTCTTTA
CATTCAGGTTAACCCCAGCGGATCTAAGCTGTGGTATCTGAAATTTCGCTTTGGAAACAAAGAGAACCGTATGGCATTAG
GCCCTTATCCGCTTATTTCTCTCGCACTGGCAAGGGAGAAGCAAGCGGATATAAGAAGGCTTATTTTAGAAGGCATTAAT
CCGGCGGAAAAACGCAGAGAGGAAAAGCGGGGTGGAGAGCCCCTTTATACGTTCGAGTCCGTGGCGCGCGAATGGGTCTC
CAGCAACGTCAACTGGTCGGCGGAACATAAGAAGCGGGTGCTGCGTTATTTCGAGCTTTACGTATTCCCCACGAACGGCA
GCTGGGATATTACCAAAATGAAGGTCAAAGACCTGCTGGTACCCATCAAAGAGGTGGAGAAAGCGGGCAAACTGGACGTA
GCTTCCCGGCTTCAGCAACGTACCGCTTGCGTGATGCGCTATGCCGTCCAGAACGGCATTATCGATCATAACCCCGCATC
AGATTTAACCGGCGCGGTCTCCACGCCCAAAGTACGTCATCACCCGGCGCTGGATCTGAATCTTATCCCTGATTTTCTGG
AAAGAGTCGACGACTTCAAAGGACGTAAGCTGACACAACTGGCGGTAAAGCTGGCGTTGTTATTATTCATCCGCTCCAGC
GAACTGCGCTTTGCCCGCTGGGATGAGATAGACCTGCATAACGCCATGTGGACCATTCCCGCCGAGCGTGAACCGATCCC
CGGCGTTAAATACTCTGCGCGTGGCGCGAAGATGCGATCTCCACACCTGGTACCTCTGTCACATCAGGCCATCGAACTGC
TGCGCGAGGTGCGGCAGCATTGCCGACCGGGAACTGAACTGGTGTTCCCCGGCGACCACAATTACCGCAAACCGATGAGT
GAAAACACCATTAACAAAGCGCTGCGGGTGATGGGCTACGACACCCAGAAAGATGTCTGTGGCCACGGGTTTCGCACCAT
GGCCTGTAGCGCGCTGGTGGAGTCGGGCCTGTGGTCAAGCGACGCGGTAGAGCGCCAGATGAGCCATCAGGAGCGCAAGC
GCGTTCGTGCGGCGTATATCCATAAGGCGCAGCATCTCGAAGAACGCCGGGAAATGATGCAGTGGTGGGCGGATTATCTG
GATGCGAACCGCTTCAGGCATGTGGTGCCGTATGGCTTCAAAAAATCGCCGGGAGGCGCACTCGACCATATGAGCTTCCA
GGAGCGTAATGACCGACAAGTGGAGGAGCTGAAAGCCCGGATACTGGCTGACTCGGAATGGCTGACGGCGTCTGAGTTAT
CGGCAAAAGCGGGTTTTCGTTCTGCCGATCCTGAAGCTGGCCCGAAAGGCTGGAAAGCCGCAGGTAAGATTTTCTCGCTG
AAGGTCGACGGGGAGGATTTGTACCCGGATTATGTTCTGGATGAGAAAATGCGACCTCTCAAGGTTGTCAGGCTTATTCT
TTCGCTGTTCAAAGAGCGTAAAACGCCGTGGGGGCTAGCTATCTGGTTTGGCTCGGCAAACCGCAGGTTGAGAGGTGGGA
AGCCGAAAGATCTGCTGATTTCAAAGTCAGAGTTGGTTCTGATGGTTGCCCAAGAGGAAATTGAGATGAGGGAGCATGGG
TGA

Upstream 100 bases:

>100_bases
TGTGGTCTTTCCGTGTTGGTCATTTTTCATCCATTGACATCCACGCGATTTGGGGGCAAAAAAGGGGGCAAACCCGGTTC
GATAGAGGAGTGACCCCCAA

Downstream 100 bases:

>100_bases
TACAGATGTTTAATCATTCTTGATACGGAGAAAGTTTATGTAGGGTGGGTTGTCAATTGAATTCGCAAGCAATCAAGGGT
CTTAATGGCTTGAAAGAGTT

Product: integrase

Products: NA

Alternate protein names: Int(P4) [H]

Number of amino acids: Translated: 560; Mature: 559

Protein sequence:

>560_residues
MALTDLAIRHARPLGKAYRLSDCHGLYIQVNPSGSKLWYLKFRFGNKENRMALGPYPLISLALAREKQADIRRLILEGIN
PAEKRREEKRGGEPLYTFESVAREWVSSNVNWSAEHKKRVLRYFELYVFPTNGSWDITKMKVKDLLVPIKEVEKAGKLDV
ASRLQQRTACVMRYAVQNGIIDHNPASDLTGAVSTPKVRHHPALDLNLIPDFLERVDDFKGRKLTQLAVKLALLLFIRSS
ELRFARWDEIDLHNAMWTIPAEREPIPGVKYSARGAKMRSPHLVPLSHQAIELLREVRQHCRPGTELVFPGDHNYRKPMS
ENTINKALRVMGYDTQKDVCGHGFRTMACSALVESGLWSSDAVERQMSHQERKRVRAAYIHKAQHLEERREMMQWWADYL
DANRFRHVVPYGFKKSPGGALDHMSFQERNDRQVEELKARILADSEWLTASELSAKAGFRSADPEAGPKGWKAAGKIFSL
KVDGEDLYPDYVLDEKMRPLKVVRLILSLFKERKTPWGLAIWFGSANRRLRGGKPKDLLISKSELVLMVAQEEIEMREHG

Sequences:

>Translated_560_residues
MALTDLAIRHARPLGKAYRLSDCHGLYIQVNPSGSKLWYLKFRFGNKENRMALGPYPLISLALAREKQADIRRLILEGIN
PAEKRREEKRGGEPLYTFESVAREWVSSNVNWSAEHKKRVLRYFELYVFPTNGSWDITKMKVKDLLVPIKEVEKAGKLDV
ASRLQQRTACVMRYAVQNGIIDHNPASDLTGAVSTPKVRHHPALDLNLIPDFLERVDDFKGRKLTQLAVKLALLLFIRSS
ELRFARWDEIDLHNAMWTIPAEREPIPGVKYSARGAKMRSPHLVPLSHQAIELLREVRQHCRPGTELVFPGDHNYRKPMS
ENTINKALRVMGYDTQKDVCGHGFRTMACSALVESGLWSSDAVERQMSHQERKRVRAAYIHKAQHLEERREMMQWWADYL
DANRFRHVVPYGFKKSPGGALDHMSFQERNDRQVEELKARILADSEWLTASELSAKAGFRSADPEAGPKGWKAAGKIFSL
KVDGEDLYPDYVLDEKMRPLKVVRLILSLFKERKTPWGLAIWFGSANRRLRGGKPKDLLISKSELVLMVAQEEIEMREHG
>Mature_559_residues
ALTDLAIRHARPLGKAYRLSDCHGLYIQVNPSGSKLWYLKFRFGNKENRMALGPYPLISLALAREKQADIRRLILEGINP
AEKRREEKRGGEPLYTFESVAREWVSSNVNWSAEHKKRVLRYFELYVFPTNGSWDITKMKVKDLLVPIKEVEKAGKLDVA
SRLQQRTACVMRYAVQNGIIDHNPASDLTGAVSTPKVRHHPALDLNLIPDFLERVDDFKGRKLTQLAVKLALLLFIRSSE
LRFARWDEIDLHNAMWTIPAEREPIPGVKYSARGAKMRSPHLVPLSHQAIELLREVRQHCRPGTELVFPGDHNYRKPMSE
NTINKALRVMGYDTQKDVCGHGFRTMACSALVESGLWSSDAVERQMSHQERKRVRAAYIHKAQHLEERREMMQWWADYLD
ANRFRHVVPYGFKKSPGGALDHMSFQERNDRQVEELKARILADSEWLTASELSAKAGFRSADPEAGPKGWKAAGKIFSLK
VDGEDLYPDYVLDEKMRPLKVVRLILSLFKERKTPWGLAIWFGSANRRLRGGKPKDLLISKSELVLMVAQEEIEMREHG

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the 'phage' integrase family [H]

Homologues:

Organism=Escherichia coli, GI1788690, Length=400, Percent_Identity=33, Blast_Score=222, Evalue=4e-59,
Organism=Escherichia coli, GI145693166, Length=406, Percent_Identity=35.7142857142857, Blast_Score=219, Evalue=3e-58,
Organism=Escherichia coli, GI1788974, Length=405, Percent_Identity=32.0987654320988, Blast_Score=203, Evalue=2e-53,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR011010
- InterPro:   IPR013762
- InterPro:   IPR002104
- InterPro:   IPR023109 [H]

Pfam domain/function: PF00589 Phage_integrase [H]

EC number: NA

Molecular weight: Translated: 64291; Mature: 64160

Theoretical pI: Translated: 10.11; Mature: 10.11

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.9 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
3.8 %Cys+Met (Translated Protein)
0.9 %Cys     (Mature Protein)
2.7 %Met     (Mature Protein)
3.6 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MALTDLAIRHARPLGKAYRLSDCHGLYIQVNPSGSKLWYLKFRFGNKENRMALGPYPLIS
CCCHHHHHHHHCCCCCCEECCCCCEEEEEECCCCCEEEEEEEEECCCCCCEEECCCHHHH
LALAREKQADIRRLILEGINPAEKRREEKRGGEPLYTFESVAREWVSSNVNWSAEHKKRV
HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCHHHHHHH
LRYFELYVFPTNGSWDITKMKVKDLLVPIKEVEKAGKLDVASRLQQRTACVMRYAVQNGI
HHHHEEEEEECCCCCEEHHHHHHHHHHCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCC
IDHNPASDLTGAVSTPKVRHHPALDLNLIPDFLERVDDFKGRKLTQLAVKLALLLFIRSS
CCCCCCHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC
ELRFARWDEIDLHNAMWTIPAEREPIPGVKYSARGAKMRSPHLVPLSHQAIELLREVRQH
CCCEECCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH
CRPGTELVFPGDHNYRKPMSENTINKALRVMGYDTQKDVCGHGFRTMACSALVESGLWSS
CCCCCEEEECCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHHCCCCCH
DAVERQMSHQERKRVRAAYIHKAQHLEERREMMQWWADYLDANRFRHVVPYGFKKSPGGA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCCCCCC
LDHMSFQERNDRQVEELKARILADSEWLTASELSAKAGFRSADPEAGPKGWKAAGKIFSL
CCCCCHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCCCCCCCCHHCCCCEEEE
KVDGEDLYPDYVLDEKMRPLKVVRLILSLFKERKTPWGLAIWFGSANRRLRGGKPKDLLI
EECCCCCCCCHHHHCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCEEE
SKSELVLMVAQEEIEMREHG
ECCCEEEEEEHHHHHHHCCC
>Mature Secondary Structure 
ALTDLAIRHARPLGKAYRLSDCHGLYIQVNPSGSKLWYLKFRFGNKENRMALGPYPLIS
CCHHHHHHHHCCCCCCEECCCCCEEEEEECCCCCEEEEEEEEECCCCCCEEECCCHHHH
LALAREKQADIRRLILEGINPAEKRREEKRGGEPLYTFESVAREWVSSNVNWSAEHKKRV
HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCHHHHHHH
LRYFELYVFPTNGSWDITKMKVKDLLVPIKEVEKAGKLDVASRLQQRTACVMRYAVQNGI
HHHHEEEEEECCCCCEEHHHHHHHHHHCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCC
IDHNPASDLTGAVSTPKVRHHPALDLNLIPDFLERVDDFKGRKLTQLAVKLALLLFIRSS
CCCCCCHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC
ELRFARWDEIDLHNAMWTIPAEREPIPGVKYSARGAKMRSPHLVPLSHQAIELLREVRQH
CCCEECCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH
CRPGTELVFPGDHNYRKPMSENTINKALRVMGYDTQKDVCGHGFRTMACSALVESGLWSS
CCCCCEEEECCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHHHHCCCCCH
DAVERQMSHQERKRVRAAYIHKAQHLEERREMMQWWADYLDANRFRHVVPYGFKKSPGGA
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCCCCCC
LDHMSFQERNDRQVEELKARILADSEWLTASELSAKAGFRSADPEAGPKGWKAAGKIFSL
CCCCCHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCCCCCCCCHHCCCCEEEE
KVDGEDLYPDYVLDEKMRPLKVVRLILSLFKERKTPWGLAIWFGSANRRLRGGKPKDLLI
EECCCCCCCCHHHHCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCEEE
SKSELVLMVAQEEIEMREHG
ECCCEEEEEEHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 7610040; 9278503 [H]