Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is ykgF [H]

Identifier: 157159813

GI number: 157159813

Start: 372935

End: 374362

Strand: Direct

Name: ykgF [H]

Synonym: EcHS_A0360

Alternate gene names: 157159813

Gene position: 372935-374362 (Clockwise)

Preceding gene: 157159812

Following gene: 157159814

Centisome position: 8.03

GC content: 50.35

Gene sequence:

>1428_bases
ATGTCGATCAAAACCAGTAATACAGATTTTAAGACACGCATCCGTCAGCAAATTGAAGATCCGATCATGCGCAAAGCGGT
GGCAAACGCGCAGCAGCGTATTGGGGCAAATCGGCAAAAAATGGTCGATGAATTGGGGCACTGGGAGGAGTGGCGCGATC
GGGCCGCCCAGATACGTGATCATGTTCTGAGTAATCTCGACGCTTATCTGTACCAGCTCTCAGAAAAAGTGACGCAAAAC
GGCGGTCACGTCTGGTTTGCAAAAACCAAAGAAGACGCTACCCGCTACATTTTACAGGTTGCCCAACGCAAAAATGCCCG
GAAGGTGGTGAAATCTAAATCGATGGTGACCGAAGAGATTGGTGTCAATCATGTGTTGCAGGATGCTGGCATTCAGGTGA
TTGAAACCGATCTGGGTGAATATATTCTCCAGCTGGATCAAGATCCGCCATCTCATGTTGTGGTCCCGGCAATTCATAAA
GATCGCCATCAGATCCGTCGAGTGCTACACGAACGTCTGGGCTATGAGGGGCCGGAAACGCCTGAAGCGATGACCTTATT
CATCCGGCAAAAAATCCGCGAAGATTTCCTCAGTGCTGAAATAGGTATTACCGGCTGTAATTTCGCGGTGGCAGAGACAG
GTTCGGTATGCCTGGTGACCAATGAAGGTAATGCGCGAATGTGTACCACGCTGCCTAAAACGCATATTGCAGTGATGGGA
ATGGAGCGTATTGCCCCCACGTTTGCCGAGGTAGATGTATTGATCACCATGCTGGCGCGCAGTGCCGTTGGTGCACGTTT
GACGGGATACAACACCTGGCTGACAGGACCGCGCGAAGCGGGGCACGTTGATGGTCCTGAAGAGTTTCATCTGGTTATTG
TCGATAACGGGCGTTCTGAGGTGCTGGCCTCTGAATTTCGGGATGTGCTGCGCTGTATTCGCTGCGGGGCTTGTATGAAT
ACTTGTCCGGCATATCGCCATATTGGCGGTCATGGATATGGCTCTATTTATCCAGGGCCAATTGGTGCGGTGATTTCTCC
GCTACTTGGCGGCTATAAAGATTTTAAAGATTTACCCTACGCCTGCTCTTTATGCACTGCTTGTGACAGCGTGTGTCCGG
TGCGTATTCCGCTGTCAAAACTGATTTTGCGTCATCGTCGGGTGATGGCTGAAAAAGGGATCACCGCAAAAGCAGAGCAA
CGGGCGATAAAAATGTTCGCTTATGCCAATAGTCATCCAGGATTGTGGAAAGTCGGGATGATGGCCGGTGCTCATGCGGC
AAGCTGGTTTATCAATGGCGGCAAAACACCACTCAAATTTGGCGCGATTAGCGACTGGATGGAAGCACGCGATCTTCCTG
AAGCTGACGGAGAGAGTTTCCGTAGTTGGTTTAAGAAACATCAGGCGCAGGAGAAAAAGAATGGATAA

Upstream 100 bases:

>100_bases
AGTTGCCTGCTGAACATCAGTGGGCGATTACAACGGGAAGGGCAGAAAGTCAAAGTGATGCATATTGCTGAAGTGTTGAT
GAGCCGCTGAGGATATAAAG

Downstream 100 bases:

>100_bases
TCGGAGCGAATTTTTGAATAACGTTGCTCAGGCACTGGGTCGCCCGCTGCGACTTGAACCGCAAGCAGAAGATGCGCCGC
TTAACAACTATGCTAACGAG

Product: iron-sulfur cluster binding protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 475; Mature: 474

Protein sequence:

>475_residues
MSIKTSNTDFKTRIRQQIEDPIMRKAVANAQQRIGANRQKMVDELGHWEEWRDRAAQIRDHVLSNLDAYLYQLSEKVTQN
GGHVWFAKTKEDATRYILQVAQRKNARKVVKSKSMVTEEIGVNHVLQDAGIQVIETDLGEYILQLDQDPPSHVVVPAIHK
DRHQIRRVLHERLGYEGPETPEAMTLFIRQKIREDFLSAEIGITGCNFAVAETGSVCLVTNEGNARMCTTLPKTHIAVMG
MERIAPTFAEVDVLITMLARSAVGARLTGYNTWLTGPREAGHVDGPEEFHLVIVDNGRSEVLASEFRDVLRCIRCGACMN
TCPAYRHIGGHGYGSIYPGPIGAVISPLLGGYKDFKDLPYACSLCTACDSVCPVRIPLSKLILRHRRVMAEKGITAKAEQ
RAIKMFAYANSHPGLWKVGMMAGAHAASWFINGGKTPLKFGAISDWMEARDLPEADGESFRSWFKKHQAQEKKNG

Sequences:

>Translated_475_residues
MSIKTSNTDFKTRIRQQIEDPIMRKAVANAQQRIGANRQKMVDELGHWEEWRDRAAQIRDHVLSNLDAYLYQLSEKVTQN
GGHVWFAKTKEDATRYILQVAQRKNARKVVKSKSMVTEEIGVNHVLQDAGIQVIETDLGEYILQLDQDPPSHVVVPAIHK
DRHQIRRVLHERLGYEGPETPEAMTLFIRQKIREDFLSAEIGITGCNFAVAETGSVCLVTNEGNARMCTTLPKTHIAVMG
MERIAPTFAEVDVLITMLARSAVGARLTGYNTWLTGPREAGHVDGPEEFHLVIVDNGRSEVLASEFRDVLRCIRCGACMN
TCPAYRHIGGHGYGSIYPGPIGAVISPLLGGYKDFKDLPYACSLCTACDSVCPVRIPLSKLILRHRRVMAEKGITAKAEQ
RAIKMFAYANSHPGLWKVGMMAGAHAASWFINGGKTPLKFGAISDWMEARDLPEADGESFRSWFKKHQAQEKKNG
>Mature_474_residues
SIKTSNTDFKTRIRQQIEDPIMRKAVANAQQRIGANRQKMVDELGHWEEWRDRAAQIRDHVLSNLDAYLYQLSEKVTQNG
GHVWFAKTKEDATRYILQVAQRKNARKVVKSKSMVTEEIGVNHVLQDAGIQVIETDLGEYILQLDQDPPSHVVVPAIHKD
RHQIRRVLHERLGYEGPETPEAMTLFIRQKIREDFLSAEIGITGCNFAVAETGSVCLVTNEGNARMCTTLPKTHIAVMGM
ERIAPTFAEVDVLITMLARSAVGARLTGYNTWLTGPREAGHVDGPEEFHLVIVDNGRSEVLASEFRDVLRCIRCGACMNT
CPAYRHIGGHGYGSIYPGPIGAVISPLLGGYKDFKDLPYACSLCTACDSVCPVRIPLSKLILRHRRVMAEKGITAKAEQR
AIKMFAYANSHPGLWKVGMMAGAHAASWFINGGKTPLKFGAISDWMEARDLPEADGESFRSWFKKHQAQEKKNG

Specific function: Unknown

COG id: COG1139

COG function: function code C; Uncharacterized conserved protein containing a ferredoxin-like domain

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 2 4Fe-4S ferredoxin-type domains [H]

Homologues:

Organism=Escherichia coli, GI1786498, Length=475, Percent_Identity=99.3684210526316, Blast_Score=989, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR017896
- InterPro:   IPR017900
- InterPro:   IPR003741
- InterPro:   IPR004452
- InterPro:   IPR002698
- InterPro:   IPR012285
- InterPro:   IPR009051 [H]

Pfam domain/function: PF02589 DUF162 [H]

EC number: NA

Molecular weight: Translated: 53021; Mature: 52889

Theoretical pI: Translated: 8.60; Mature: 8.60

Prosite motif: PS00198 4FE4S_FERREDOXIN

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.3 %Cys     (Translated Protein)
3.2 %Met     (Translated Protein)
5.5 %Cys+Met (Translated Protein)
2.3 %Cys     (Mature Protein)
3.0 %Met     (Mature Protein)
5.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSIKTSNTDFKTRIRQQIEDPIMRKAVANAQQRIGANRQKMVDELGHWEEWRDRAAQIRD
CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHH
HVLSNLDAYLYQLSEKVTQNGGHVWFAKTKEDATRYILQVAQRKNARKVVKSKSMVTEEI
HHHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GVNHVLQDAGIQVIETDLGEYILQLDQDPPSHVVVPAIHKDRHQIRRVLHERLGYEGPET
HHHHHHHHCCCEEEHHHHHHHHHHCCCCCCCCEEEECHHHHHHHHHHHHHHHHCCCCCCC
PEAMTLFIRQKIREDFLSAEIGITGCNFAVAETGSVCLVTNEGNARMCTTLPKTHIAVMG
HHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCEEEEECCCCCEEEECCCCHHHHHHH
MERIAPTFAEVDVLITMLARSAVGARLTGYNTWLTGPREAGHVDGPEEFHLVIVDNGRSE
HHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCCCCCCCCCCCCCCCEEEEEEECCCHH
VLASEFRDVLRCIRCGACMNTCPAYRHIGGHGYGSIYPGPIGAVISPLLGGYKDFKDLPY
HHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCCHHHHHHHHHHCCHHHHHCCHH
ACSLCTACDSVCPVRIPLSKLILRHRRVMAEKGITAKAEQRAIKMFAYANSHPGLWKVGM
HHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHEECCCCCEEEHHH
MAGAHAASWFINGGKTPLKFGAISDWMEARDLPEADGESFRSWFKKHQAQEKKNG
HHCCHHHHHEECCCCCCEECCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
SIKTSNTDFKTRIRQQIEDPIMRKAVANAQQRIGANRQKMVDELGHWEEWRDRAAQIRD
CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHH
HVLSNLDAYLYQLSEKVTQNGGHVWFAKTKEDATRYILQVAQRKNARKVVKSKSMVTEEI
HHHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
GVNHVLQDAGIQVIETDLGEYILQLDQDPPSHVVVPAIHKDRHQIRRVLHERLGYEGPET
HHHHHHHHCCCEEEHHHHHHHHHHCCCCCCCCEEEECHHHHHHHHHHHHHHHHCCCCCCC
PEAMTLFIRQKIREDFLSAEIGITGCNFAVAETGSVCLVTNEGNARMCTTLPKTHIAVMG
HHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCEEEEECCCCCEEEECCCCHHHHHHH
MERIAPTFAEVDVLITMLARSAVGARLTGYNTWLTGPREAGHVDGPEEFHLVIVDNGRSE
HHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCCCCCCCCCCCCCCCEEEEEEECCCHH
VLASEFRDVLRCIRCGACMNTCPAYRHIGGHGYGSIYPGPIGAVISPLLGGYKDFKDLPY
HHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCCCHHHHHHHHHHCCHHHHHCCHH
ACSLCTACDSVCPVRIPLSKLILRHRRVMAEKGITAKAEQRAIKMFAYANSHPGLWKVGM
HHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHEECCCCCEEEHHH
MAGAHAASWFINGGKTPLKFGAISDWMEARDLPEADGESFRSWFKKHQAQEKKNG
HHCCHHHHHEECCCCCCEECCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: Fe [C]

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 9278503 [H]