| Definition | Sinorhizobium medicae WSM419 chromosome, complete genome. |
|---|---|
| Accession | NC_009636 |
| Length | 3,781,904 |
Click here to switch to the map view.
The map label for this gene is cycK [H]
Identifier: 150395859
GI number: 150395859
Start: 679959
End: 681941
Strand: Direct
Name: cycK [H]
Synonym: Smed_0635
Alternate gene names: 150395859
Gene position: 679959-681941 (Clockwise)
Preceding gene: 150395858
Following gene: 150395860
Centisome position: 17.98
GC content: 64.09
Gene sequence:
>1983_bases ATGATCATCGAACTCGGGCATTACGCGCTGGTCCTGGCGCTCGCGACCGCGATCATCCAGGGCCTCCTGCCCGTCGTCGG GGTACGGCGCGGCGATCCCGCCTTGATGGCGCTTGCTGCAAACGCGGCGCTCGTCTGCTTTCTGCTGGTCGCATTCTCCT TCGCCGTGCTGACATTCGCTTATGTGACGTCGGATTTCTCGGTCAAGAACGTATGGGAGAACTCGCATTCCCTTAAGCCG CTGATCTACAAGGTCACCGGGGTCTGGGGAAATCACGAAGGCTCCATGCTCCTGTGGCTGCTGATCCTCGTCTTCTTTTC CGCGATGGTCGCTCTCTTCGGCCGAAACCTGCCGGAGACGTTGAAGGCGAATGTGCTTGCGGTGCAGGCCTGGATCGCGA CGGCATTCACACTCTTCGTCCTGCTGACCTCCAATCCCTTTGCGCGGCTGGTGCCGGCGCCTGGAGAGGGCAGGGACTTG AATCCGGTGCTTCAGGATATCGGGCTGGCGATCCACCCGCCCTTGCTCTATCTCGGCTATGTCGGCTTTTCAGTCTGCTT TTCTTTCGCGGTGGCAGCTCTCATCGAGGGGCGGATCGACGCGGCCTGGGCCCGTTGGGTGCGGCCTTGGACGCTCGCCG CCTGGACCTTTCTCACGGCCGGCATCGCAATGGGCTCGTACTGGGCCTATTACGAGCTTGGCTGGGGCGGTTGGTGGTTC TGGGACCCGGTCGAGAATGCCTCCTTCATGCCCTGGCTTGCCGGAACGGCCCTTCTGCACTCGGCGCTGGTCATGGAAAA GCGCGAAGCGCTGAAGATCTGGACCGTGCTGCTGGCGATCACGACCTTCTCGCTGTCGCTGCTCGGGACCTTCCTGGTAC GCTCCGGCGTGCTGACATCGGTCCACGCTTTCGCCACCGACCCGACCCGCGGCGTCTTCATTCTCGCCATTCTGGTGGTC TTCATCGGCGGCGCCTTTTCGCTCTTTGCATTTCGTGCCTCCCACCTCAAAGCCGGGGGGATATTCGCGCCGATCTCGCG CGAGGGCGCCCTTGTCCTGAACAACCTGATCCTGACGACGGCAACCGCGACGGTGCTGACCGGCACTCTCTATCCGCTGG TGCTCGAAGCGCTGACCGGCGACAAGATCTCGGTCGGCGCGCCGTTTTTCAACATGACCTTCGGCCTGCTGATGCTGCCG CTCGTCGCGGTCGTTCCCTTCGGTCCGCTGCTTGCCTGGAAGCGCGGCGACCTCGCCGGCGCGGCGCAGCGGCTGTTTAC GGCTGCTGCCGTAGGCCTGCTTGCCGCCGCAGCCTGCTATTACGCGGTAAATGGCGGCCCGGTGCTGGCTCCACTCGGCC TCGGCCTCGGCGTTTACCTCATCATCGGTGCGCTCACCGACCTTGTCCTGCGCTCCGGACTCGGCAAGGTAAAGGCCGGC GTCGCCTGGAAGCGTTTCTCAGGCCTGCCACGTTCATCTATCGGCACGGCGCTGGCCCATATCGGTCTCGGCATTACCCT GATAGGGATCGTCGCGGTGACAGCCTTCGAGACGGAAACGGTCGTCGAGATGAAACCGGGGGCGGTGGTCGATGTCGGGC GCTACAGCCTGCGTTTCGACGGCATGCGCGAAGGGCGCGGGCCGAACTACACCGAGAATGCCGGCCATTTCACCATCAGC CGCGGCGGCGTCGCCGTCAGTGAGGTCTGGTCGTCGAAGCGGCTCTATTCGGCGCGCCGCATGCCGACGACCGAAGCCGG GATCCGGACCTTCGGCGTGAGCCAGCTCTATGTTTCCCTGGGCGATGACATGGCCGATGGAGGCATCGTCGTGCGCATCT GGTGGAAACCGCTGATCCTGTGCATCTGGGGCGGTGCGCTCGTGATGATGGCCGGCGGAATAGTCTCGCTCACCGACAGG CGTCTTCGCGTGGGCGCACCGGCGCGTGCCAGAAGGCCGCTTGCGGTGGGTGCTGCGGAATGA
Upstream 100 bases:
>100_bases ACACCGTGCTCGCCAAGCATGATGAGACCTACATGCCGAAGGATGTCGCAGATCGCCTCAAGGCCCAGGGGGTCACGCTT GGCGGGGAGGAAAACATCCG
Downstream 100 bases:
>100_bases TCCGGCTGCTCCTCGCACTTCTCCTCTTGCTGGCATCCGCGATGCCGGGTCTCGCCGTGAATCCTGACGAGGTGCTCGCA GATCCGGCGCTTGAAACCCG
Product: cytochrome c-type biogenesis protein CcmF
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 660; Mature: 660
Protein sequence:
>660_residues MIIELGHYALVLALATAIIQGLLPVVGVRRGDPALMALAANAALVCFLLVAFSFAVLTFAYVTSDFSVKNVWENSHSLKP LIYKVTGVWGNHEGSMLLWLLILVFFSAMVALFGRNLPETLKANVLAVQAWIATAFTLFVLLTSNPFARLVPAPGEGRDL NPVLQDIGLAIHPPLLYLGYVGFSVCFSFAVAALIEGRIDAAWARWVRPWTLAAWTFLTAGIAMGSYWAYYELGWGGWWF WDPVENASFMPWLAGTALLHSALVMEKREALKIWTVLLAITTFSLSLLGTFLVRSGVLTSVHAFATDPTRGVFILAILVV FIGGAFSLFAFRASHLKAGGIFAPISREGALVLNNLILTTATATVLTGTLYPLVLEALTGDKISVGAPFFNMTFGLLMLP LVAVVPFGPLLAWKRGDLAGAAQRLFTAAAVGLLAAAACYYAVNGGPVLAPLGLGLGVYLIIGALTDLVLRSGLGKVKAG VAWKRFSGLPRSSIGTALAHIGLGITLIGIVAVTAFETETVVEMKPGAVVDVGRYSLRFDGMREGRGPNYTENAGHFTIS RGGVAVSEVWSSKRLYSARRMPTTEAGIRTFGVSQLYVSLGDDMADGGIVVRIWWKPLILCIWGGALVMMAGGIVSLTDR RLRVGAPARARRPLAVGAAE
Sequences:
>Translated_660_residues MIIELGHYALVLALATAIIQGLLPVVGVRRGDPALMALAANAALVCFLLVAFSFAVLTFAYVTSDFSVKNVWENSHSLKP LIYKVTGVWGNHEGSMLLWLLILVFFSAMVALFGRNLPETLKANVLAVQAWIATAFTLFVLLTSNPFARLVPAPGEGRDL NPVLQDIGLAIHPPLLYLGYVGFSVCFSFAVAALIEGRIDAAWARWVRPWTLAAWTFLTAGIAMGSYWAYYELGWGGWWF WDPVENASFMPWLAGTALLHSALVMEKREALKIWTVLLAITTFSLSLLGTFLVRSGVLTSVHAFATDPTRGVFILAILVV FIGGAFSLFAFRASHLKAGGIFAPISREGALVLNNLILTTATATVLTGTLYPLVLEALTGDKISVGAPFFNMTFGLLMLP LVAVVPFGPLLAWKRGDLAGAAQRLFTAAAVGLLAAAACYYAVNGGPVLAPLGLGLGVYLIIGALTDLVLRSGLGKVKAG VAWKRFSGLPRSSIGTALAHIGLGITLIGIVAVTAFETETVVEMKPGAVVDVGRYSLRFDGMREGRGPNYTENAGHFTIS RGGVAVSEVWSSKRLYSARRMPTTEAGIRTFGVSQLYVSLGDDMADGGIVVRIWWKPLILCIWGGALVMMAGGIVSLTDR RLRVGAPARARRPLAVGAAE >Mature_660_residues MIIELGHYALVLALATAIIQGLLPVVGVRRGDPALMALAANAALVCFLLVAFSFAVLTFAYVTSDFSVKNVWENSHSLKP LIYKVTGVWGNHEGSMLLWLLILVFFSAMVALFGRNLPETLKANVLAVQAWIATAFTLFVLLTSNPFARLVPAPGEGRDL NPVLQDIGLAIHPPLLYLGYVGFSVCFSFAVAALIEGRIDAAWARWVRPWTLAAWTFLTAGIAMGSYWAYYELGWGGWWF WDPVENASFMPWLAGTALLHSALVMEKREALKIWTVLLAITTFSLSLLGTFLVRSGVLTSVHAFATDPTRGVFILAILVV FIGGAFSLFAFRASHLKAGGIFAPISREGALVLNNLILTTATATVLTGTLYPLVLEALTGDKISVGAPFFNMTFGLLMLP LVAVVPFGPLLAWKRGDLAGAAQRLFTAAAVGLLAAAACYYAVNGGPVLAPLGLGLGVYLIIGALTDLVLRSGLGKVKAG VAWKRFSGLPRSSIGTALAHIGLGITLIGIVAVTAFETETVVEMKPGAVVDVGRYSLRFDGMREGRGPNYTENAGHFTIS RGGVAVSEVWSSKRLYSARRMPTTEAGIRTFGVSQLYVSLGDDMADGGIVVRIWWKPLILCIWGGALVMMAGGIVSLTDR RLRVGAPARARRPLAVGAAE
Specific function: Required for the biogenesis of c-type cytochromes. Possible subunit of a heme lyase [H]
COG id: COG1138
COG function: function code O; Cytochrome c biogenesis factor
Gene ontology:
Cell location: Cell inner membrane; Multi-pass membrane protein (Potential) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the ccmF/cycK/ccl1/nrfE/ccsA family [H]
Homologues:
Organism=Escherichia coli, GI1788524, Length=659, Percent_Identity=44.6130500758725, Blast_Score=509, Evalue=1e-145, Organism=Escherichia coli, GI1790511, Length=384, Percent_Identity=47.1354166666667, Blast_Score=278, Evalue=8e-76,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR002541 - InterPro: IPR003567 - InterPro: IPR003568 [H]
Pfam domain/function: PF01578 Cytochrom_C_asm [H]
EC number: NA
Molecular weight: Translated: 70530; Mature: 70530
Theoretical pI: Translated: 9.92; Mature: 9.92
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.6 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 2.9 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 2.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MIIELGHYALVLALATAIIQGLLPVVGVRRGDPALMALAANAALVCFLLVAFSFAVLTFA CEEECHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH YVTSDFSVKNVWENSHSLKPLIYKVTGVWGNHEGSMLLWLLILVFFSAMVALFGRNLPET HHHCCCCHHHHHCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHH LKANVLAVQAWIATAFTLFVLLTSNPFARLVPAPGEGRDLNPVLQDIGLAIHPPLLYLGY HHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCHHHHHCCCCCCCHHHHHHH VGFSVCFSFAVAALIEGRIDAAWARWVRPWTLAAWTFLTAGIAMGSYWAYYELGWGGWWF HHHHHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCEEE WDPVENASFMPWLAGTALLHSALVMEKREALKIWTVLLAITTFSLSLLGTFLVRSGVLTS ECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHH VHAFATDPTRGVFILAILVVFIGGAFSLFAFRASHLKAGGIFAPISREGALVLNNLILTT HHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCEECCCCCCCHHHHHHHHHHH ATATVLTGTLYPLVLEALTGDKISVGAPFFNMTFGLLMLPLVAVVPFGPLLAWKRGDLAG HHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHHCCCHHHHCCCCCHHH AAQRLFTAAAVGLLAAAACYYAVNGGPVLAPLGLGLGVYLIIGALTDLVLRSGLGKVKAG HHHHHHHHHHHHHHHHHHHHHEECCCCEEEHHHHHHHHHHHHHHHHHHHHHHCCCHHHHC VAWKRFSGLPRSSIGTALAHIGLGITLIGIVAVTAFETETVVEMKPGAVVDVGRYSLRFD CHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCEEECCCEEEEEC GMREGRGPNYTENAGHFTISRGGVAVSEVWSSKRLYSARRMPTTEAGIRTFGVSQLYVSL CCCCCCCCCCCCCCCEEEEECCCEEHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH GDDMADGGIVVRIWWKPLILCIWGGALVMMAGGIVSLTDRRLRVGAPARARRPLAVGAAE CCCCCCCCEEEEEHHHHHHHHHHHHHHHHHHCCCEEECCCCEECCCCCCCCCCCCCCCCC >Mature Secondary Structure MIIELGHYALVLALATAIIQGLLPVVGVRRGDPALMALAANAALVCFLLVAFSFAVLTFA CEEECHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH YVTSDFSVKNVWENSHSLKPLIYKVTGVWGNHEGSMLLWLLILVFFSAMVALFGRNLPET HHHCCCCHHHHHCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHH LKANVLAVQAWIATAFTLFVLLTSNPFARLVPAPGEGRDLNPVLQDIGLAIHPPLLYLGY HHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCHHHHHCCCCCCCHHHHHHH VGFSVCFSFAVAALIEGRIDAAWARWVRPWTLAAWTFLTAGIAMGSYWAYYELGWGGWWF HHHHHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCEEE WDPVENASFMPWLAGTALLHSALVMEKREALKIWTVLLAITTFSLSLLGTFLVRSGVLTS ECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHH VHAFATDPTRGVFILAILVVFIGGAFSLFAFRASHLKAGGIFAPISREGALVLNNLILTT HHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCEECCCCCCCHHHHHHHHHHH ATATVLTGTLYPLVLEALTGDKISVGAPFFNMTFGLLMLPLVAVVPFGPLLAWKRGDLAG HHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHHCCCHHHHCCCCCHHH AAQRLFTAAAVGLLAAAACYYAVNGGPVLAPLGLGLGVYLIIGALTDLVLRSGLGKVKAG HHHHHHHHHHHHHHHHHHHHHEECCCCEEEHHHHHHHHHHHHHHHHHHHHHHCCCHHHHC VAWKRFSGLPRSSIGTALAHIGLGITLIGIVAVTAFETETVVEMKPGAVVDVGRYSLRFD CHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCEEECCCEEEEEC GMREGRGPNYTENAGHFTISRGGVAVSEVWSSKRLYSARRMPTTEAGIRTFGVSQLYVSL CCCCCCCCCCCCCCCEEEEECCCEEHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH GDDMADGGIVVRIWWKPLILCIWGGALVMMAGGIVSLTDRRLRVGAPARARRPLAVGAAE CCCCCCCCEEEEEHHHHHHHHHHHHHHHHHHCCCEEECCCCEECCCCCCCCCCCCCCCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 7715602; 11481430 [H]