Definition Gluconacetobacter diazotrophicus PAl 5 chromosome, complete genome.
Accession NC_010125
Length 3,944,163

Click here to switch to the map view.

The map label for this gene is cycK [H]

Identifier: 162149211

GI number: 162149211

Start: 3539048

End: 3541027

Strand: Direct

Name: cycK [H]

Synonym: GDI_3441

Alternate gene names: 162149211

Gene position: 3539048-3541027 (Clockwise)

Preceding gene: 162149210

Following gene: 162149212

Centisome position: 89.73

GC content: 71.01

Gene sequence:

>1980_bases
ATGACGCCGGAACTGGGCAATTTCGCGCTGGCGCTGGCCTGCTGCCTGGCCGGCGGGCAGGCGATCCTGCCGCTGGTGGG
CGCGCGGCGGCGCGACCCGCGGCTGATGGCGCTGGCCCCCGCCCTGGCGGTGGGGCAGATGCTGGCGCTGGCCTTTTCCT
TCGCATGCCTGATCGATGCGGCGGTGCGTGACGATTTCTCGGTCCAGAACGTCGCGGCCAACAGCGCCGCCGCCAAGCCG
CTGCTGTACAAAATCACCGGCGTATGGGGCAATCACGAGGGATCGGTACTGCTGTGGGCGCTGATCCTGGGGATCTGCGG
CGGGGCGGTGGCACTGTTCGGCCGCAACCTGCCCTCTGCCCTGCGGGCGCGGGTCATCGCGGTCCTGGGCGGCGTTTCGG
CCGGCTTCCAGCTGTTCTGCCTGACGACGTCCAATCCGTTCGACCGCGTCTGGCCCGCGCCGATGGACGGCCAGGGCATG
AACCCGCTGCTGCAGGACCCGGGCCTGGCCTTCCATCCGCCCATTCTCTACACCGGCTATGTCGGCTTCGCCGTGCCCTT
CGCCTTCGCCGTCGCCGCCCTGATCGAGGGACGGGTGGACGCCGCCTGGGGCCGCTGGGTCCGCCCCTGGGCCGTCGCGG
CCTGGTGCTTCCTGACCTGCGGCATCGCGCTGGGGTCGTGGTGGTCGTACTACGTGCTGGGCTGGGGCGGCTACTGGTTC
TGGGACCCGGTGGAAAACGCGTCGCTGATCCCGTGGCTGACCGGCACCGCGCTGGTCCATTCCGCCATCGTGGTGGAAAA
GCGCGAGGCGCTGAAGATCTGGACCGTGCTGCTGGCCATCGGCACATTCTCGTTCTCGCTGTCCGGCACGTTCCTGGTCC
GCTCGGGCATTCTCAATTCCGTCCATGCCTTCGCCAACGACCCGGCGCGCGGCATCTTCATCCTGGGCCTGCTGGCGCTG
GTCATCGGCGGGTCGCTGCTGCTGTTCGCGATCCGCGCGCCGGCGCTGGTGGCGGGCGGCCTGTTCGCGCCGGTCTCGCG
CGAGGGGCTGCTGGTCGTGAACAATATCCTGCTGTGCTCGATCTGCGCGGTGGTGCTGACCGGGACCATGTATCCGCCCT
TCATGTCGCTGCTGTTCGGCAAGACGATTTCGGTCGGCAAGCCGTTCTTCGACGCCACGGCGGCGCCGCTGGCGATTCCG
CTTCTGGCCTTCATGGGGTTCGGCCCGATGATGCCGTGGAAGCGCGCCCAGTTCTGGCCCGTGCTGCGCCGGCTGTGGTG
GGCCGGGATCGTCACCGCCCTCGCCTTCTGCCTGATGGCCTGGCGCATCCGCGACGTGCTGCCGCTGCTGGCCGCGACCG
GCGCGGTCTGGGTGATCGCGGCCAGCGTCGCCGACATTGCCGAGCGCGTCCGCCTGTTCCGCATCCCGCCCGGCGCCAGC
CTGCAGCGCGCGCGCATGCTGCCGCGCGCGGCGCTGGGCGCGGCCCTGGCCCATGCCGGCGTCGGCATCAGCGTGCTGGG
GCTGGCCGCCATGTCGCAGGCCCAGCACCGCATCGTCGAGGTCCGGGTCGGCCAGACCGAGATGCTGGCCGGCGACGCCT
GGACCCTGACCGCGATCCGCGCGGCCCCCGGCCCGAACTACACGTCCCTGATCGCGACGATCGAGGTCCGGCATGACGGC
AGGCTGGTCACCGTGCTGCATCCGTCCAAGCGCACCTTCCCCAGCCAGAACCAGACGACGACCGAGGTCGCCATCCATAC
CAACCTGATGTCCGACCTGTACGGCGTGCTGGGCGACCGGCACGGCACCGACGCCGACCCGACCTACGTGCTGCGCCTGC
ACTACAATCCGCTGGCGCCGTGGATGTGGCTGGGCGGGCTGATCATGGCGCTGGGCGGGGCGCTGTCGCTGTCCGACCGG
CGGACGCGCGTCGGCGCGCCGCGCCGCGCCGCCGCCCCCGGCATGGTGGCCGCGCAATGA

Upstream 100 bases:

>100_bases
GCAGCGGCAAGTGGGACCCGCGCTACGGCAAGGCCCCCGACGCCGCAAGCTGGAACACCATGACCGTGGGCGATGCGCGC
CACGGCGAGGCCAACCGGTC

Downstream 100 bases:

>100_bases
CCGGTACCGTGCCGCATCCCCCGAACCTGGCGCGGCGGCGCCTGCTGATGGCGGCGCCGCTGGCCGCCGCCGGGGTGGCG
GGAGTGGCGTTCTGGCGCAT

Product: cytochrome c-type biogenesis protein cycK

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 659; Mature: 658

Protein sequence:

>659_residues
MTPELGNFALALACCLAGGQAILPLVGARRRDPRLMALAPALAVGQMLALAFSFACLIDAAVRDDFSVQNVAANSAAAKP
LLYKITGVWGNHEGSVLLWALILGICGGAVALFGRNLPSALRARVIAVLGGVSAGFQLFCLTTSNPFDRVWPAPMDGQGM
NPLLQDPGLAFHPPILYTGYVGFAVPFAFAVAALIEGRVDAAWGRWVRPWAVAAWCFLTCGIALGSWWSYYVLGWGGYWF
WDPVENASLIPWLTGTALVHSAIVVEKREALKIWTVLLAIGTFSFSLSGTFLVRSGILNSVHAFANDPARGIFILGLLAL
VIGGSLLLFAIRAPALVAGGLFAPVSREGLLVVNNILLCSICAVVLTGTMYPPFMSLLFGKTISVGKPFFDATAAPLAIP
LLAFMGFGPMMPWKRAQFWPVLRRLWWAGIVTALAFCLMAWRIRDVLPLLAATGAVWVIAASVADIAERVRLFRIPPGAS
LQRARMLPRAALGAALAHAGVGISVLGLAAMSQAQHRIVEVRVGQTEMLAGDAWTLTAIRAAPGPNYTSLIATIEVRHDG
RLVTVLHPSKRTFPSQNQTTTEVAIHTNLMSDLYGVLGDRHGTDADPTYVLRLHYNPLAPWMWLGGLIMALGGALSLSDR
RTRVGAPRRAAAPGMVAAQ

Sequences:

>Translated_659_residues
MTPELGNFALALACCLAGGQAILPLVGARRRDPRLMALAPALAVGQMLALAFSFACLIDAAVRDDFSVQNVAANSAAAKP
LLYKITGVWGNHEGSVLLWALILGICGGAVALFGRNLPSALRARVIAVLGGVSAGFQLFCLTTSNPFDRVWPAPMDGQGM
NPLLQDPGLAFHPPILYTGYVGFAVPFAFAVAALIEGRVDAAWGRWVRPWAVAAWCFLTCGIALGSWWSYYVLGWGGYWF
WDPVENASLIPWLTGTALVHSAIVVEKREALKIWTVLLAIGTFSFSLSGTFLVRSGILNSVHAFANDPARGIFILGLLAL
VIGGSLLLFAIRAPALVAGGLFAPVSREGLLVVNNILLCSICAVVLTGTMYPPFMSLLFGKTISVGKPFFDATAAPLAIP
LLAFMGFGPMMPWKRAQFWPVLRRLWWAGIVTALAFCLMAWRIRDVLPLLAATGAVWVIAASVADIAERVRLFRIPPGAS
LQRARMLPRAALGAALAHAGVGISVLGLAAMSQAQHRIVEVRVGQTEMLAGDAWTLTAIRAAPGPNYTSLIATIEVRHDG
RLVTVLHPSKRTFPSQNQTTTEVAIHTNLMSDLYGVLGDRHGTDADPTYVLRLHYNPLAPWMWLGGLIMALGGALSLSDR
RTRVGAPRRAAAPGMVAAQ
>Mature_658_residues
TPELGNFALALACCLAGGQAILPLVGARRRDPRLMALAPALAVGQMLALAFSFACLIDAAVRDDFSVQNVAANSAAAKPL
LYKITGVWGNHEGSVLLWALILGICGGAVALFGRNLPSALRARVIAVLGGVSAGFQLFCLTTSNPFDRVWPAPMDGQGMN
PLLQDPGLAFHPPILYTGYVGFAVPFAFAVAALIEGRVDAAWGRWVRPWAVAAWCFLTCGIALGSWWSYYVLGWGGYWFW
DPVENASLIPWLTGTALVHSAIVVEKREALKIWTVLLAIGTFSFSLSGTFLVRSGILNSVHAFANDPARGIFILGLLALV
IGGSLLLFAIRAPALVAGGLFAPVSREGLLVVNNILLCSICAVVLTGTMYPPFMSLLFGKTISVGKPFFDATAAPLAIPL
LAFMGFGPMMPWKRAQFWPVLRRLWWAGIVTALAFCLMAWRIRDVLPLLAATGAVWVIAASVADIAERVRLFRIPPGASL
QRARMLPRAALGAALAHAGVGISVLGLAAMSQAQHRIVEVRVGQTEMLAGDAWTLTAIRAAPGPNYTSLIATIEVRHDGR
LVTVLHPSKRTFPSQNQTTTEVAIHTNLMSDLYGVLGDRHGTDADPTYVLRLHYNPLAPWMWLGGLIMALGGALSLSDRR
TRVGAPRRAAAPGMVAAQ

Specific function: Required for the biogenesis of c-type cytochromes. Possible subunit of a heme lyase [H]

COG id: COG1138

COG function: function code O; Cytochrome c biogenesis factor

Gene ontology:

Cell location: Cell inner membrane; Multi-pass membrane protein (Potential) [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the ccmF/cycK/ccl1/nrfE/ccsA family [H]

Homologues:

Organism=Escherichia coli, GI1788524, Length=661, Percent_Identity=43.4190620272315, Blast_Score=480, Evalue=1e-136,
Organism=Escherichia coli, GI1790511, Length=612, Percent_Identity=34.3137254901961, Blast_Score=255, Evalue=8e-69,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR002541
- InterPro:   IPR003567
- InterPro:   IPR003568 [H]

Pfam domain/function: PF01578 Cytochrom_C_asm [H]

EC number: NA

Molecular weight: Translated: 70363; Mature: 70232

Theoretical pI: Translated: 10.03; Mature: 10.03

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.5 %Cys     (Translated Protein)
2.7 %Met     (Translated Protein)
4.2 %Cys+Met (Translated Protein)
1.5 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
4.1 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTPELGNFALALACCLAGGQAILPLVGARRRDPRLMALAPALAVGQMLALAFSFACLIDA
CCCCCHHHHHHHHHHHCCCCHHHHHHCCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHH
AVRDDFSVQNVAANSAAAKPLLYKITGVWGNHEGSVLLWALILGICGGAVALFGRNLPSA
HHHCCCCHHHHHCCCHHCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHH
LRARVIAVLGGVSAGFQLFCLTTSNPFDRVWPAPMDGQGMNPLLQDPGLAFHPPILYTGY
HHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCHHHHCH
VGFAVPFAFAVAALIEGRVDAAWGRWVRPWAVAAWCFLTCGIALGSWWSYYVLGWGGYWF
HHHHHHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHHHHHHHCHHHEEEEECCCEEE
WDPVENASLIPWLTGTALVHSAIVVEKREALKIWTVLLAIGTFSFSLSGTFLVRSGILNS
ECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCHHHHHHHHHHH
VHAFANDPARGIFILGLLALVIGGSLLLFAIRAPALVAGGLFAPVSREGLLVVNNILLCS
HHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHHHHH
ICAVVLTGTMYPPFMSLLFGKTISVGKPFFDATAAPLAIPLLAFMGFGPMMPWKRAQFWP
HHHHHHHCCCCHHHHHHHHCCCCCCCCCCHHCCHHHHHHHHHHHHCCCCCCCCCHHHHHH
VLRRLWWAGIVTALAFCLMAWRIRDVLPLLAATGAVWVIAASVADIAERVRLFRIPPGAS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHEEEECCCCCC
LQRARMLPRAALGAALAHAGVGISVLGLAAMSQAQHRIVEVRVGQTEMLAGDAWTLTAIR
HHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHEEEEEEECCHHEECCCCEEEEEEE
AAPGPNYTSLIATIEVRHDGRLVTVLHPSKRTFPSQNQTTTEVAIHTNLMSDLYGVLGDR
CCCCCCCEEEEEEEEEECCCEEEEEECCCCCCCCCCCCCEEEEEEHHHHHHHHHHHHCCC
HGTDADPTYVLRLHYNPLAPWMWLGGLIMALGGALSLSDRRTRVGAPRRAAAPGMVAAQ
CCCCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCC
>Mature Secondary Structure 
TPELGNFALALACCLAGGQAILPLVGARRRDPRLMALAPALAVGQMLALAFSFACLIDA
CCCCHHHHHHHHHHHCCCCHHHHHHCCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHH
AVRDDFSVQNVAANSAAAKPLLYKITGVWGNHEGSVLLWALILGICGGAVALFGRNLPSA
HHHCCCCHHHHHCCCHHCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHH
LRARVIAVLGGVSAGFQLFCLTTSNPFDRVWPAPMDGQGMNPLLQDPGLAFHPPILYTGY
HHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCHHHHCH
VGFAVPFAFAVAALIEGRVDAAWGRWVRPWAVAAWCFLTCGIALGSWWSYYVLGWGGYWF
HHHHHHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHHHHHHHCHHHEEEEECCCEEE
WDPVENASLIPWLTGTALVHSAIVVEKREALKIWTVLLAIGTFSFSLSGTFLVRSGILNS
ECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCHHHHHHHHHHH
VHAFANDPARGIFILGLLALVIGGSLLLFAIRAPALVAGGLFAPVSREGLLVVNNILLCS
HHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCEEHHHHHHHHH
ICAVVLTGTMYPPFMSLLFGKTISVGKPFFDATAAPLAIPLLAFMGFGPMMPWKRAQFWP
HHHHHHHCCCCHHHHHHHHCCCCCCCCCCHHCCHHHHHHHHHHHHCCCCCCCCCHHHHHH
VLRRLWWAGIVTALAFCLMAWRIRDVLPLLAATGAVWVIAASVADIAERVRLFRIPPGAS
HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHEEEECCCCCC
LQRARMLPRAALGAALAHAGVGISVLGLAAMSQAQHRIVEVRVGQTEMLAGDAWTLTAIR
HHHHHHCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHEEEEEEECCHHEECCCCEEEEEEE
AAPGPNYTSLIATIEVRHDGRLVTVLHPSKRTFPSQNQTTTEVAIHTNLMSDLYGVLGDR
CCCCCCCEEEEEEEEEECCCEEEEEECCCCCCCCCCCCCEEEEEEHHHHHHHHHHHHCCC
HGTDADPTYVLRLHYNPLAPWMWLGGLIMALGGALSLSDRRTRVGAPRRAAAPGMVAAQ
CCCCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCHHHCCCCCCCCCCCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 6.0

TargetDB status: NA

Availability: NA

References: 7715601; 12597275 [H]