Definition | Escherichia coli HS, complete genome. |
---|---|
Accession | NC_009800 |
Length | 4,643,538 |
Click here to switch to the map view.
The map label for this gene is yggW [H]
Identifier: 157162416
GI number: 157162416
Start: 3126185
End: 3127321
Strand: Direct
Name: yggW [H]
Synonym: EcHS_A3115
Alternate gene names: 157162416
Gene position: 3126185-3127321 (Clockwise)
Preceding gene: 157162415
Following gene: 157162423
Centisome position: 67.32
GC content: 53.21
Gene sequence:
>1137_bases ATGGTTAAATTACCGCCGCTGAGTCTCTACATTCACATCCCGTGGTGCGTGCAGAAATGCCCGTACTGCGATTTCAACTC TCACGCGTTGAAAGGAGAAGTGCCGCACGACGATTACGTTCAGCATCTGCTTTGCGATCTGGACAATGATGTGGCTTACG CTCAGGGCCGTGAAGTAAAGACAATCTTTATTGGCGGTGGTACGCCGAGCCTGCTTTCCGGCCCGGCGATGCAAACGCTG CTGGACGGCGTGCGTGCGCGTTTGCCGCTGGCAGCGGATGCAGAAATTACTATGGAAGCGAACCCTGGTACGGTAGAAGC CGATCGCTTTGTCGATTATCAGCGTGCTGGTGTGAACCGCATCTCTATTGGTGTGCAGAGTTTTAGCGAAGAAAAGCTGA AACGACTTGGGCGCATTCATGGCCCGCAAGAAGCGAAACGAGCGGCAAAGCTGGCGAGCGGTTTAGGGTTACGTAGCTTT AACCTTGATTTGATGCATGGACTACCGGATCAATCACTGGAAGAGGCGCTTGGCGATCTACGCCAGGCCATTGAACTGAA TCCGCCGCATCTTTCCTGGTATCAACTGACCATCGAACCCAATACGCTGTTTGGTTCGCGACCACCGGTGCTGCCGGACG ACGACGCGCTGTGGGATATATTCGAACAGGGGCATCAGTTATTAACCGCAGCGGGTTATCAGCAATATGAAACTTCCGCT TACGCCAAACCCAGTTATCAGTGCCAGCACAATCTCAACTACTGGCGCTTTGGCGACTACATCGGTATTGGCTGCGGCGC GCACGGCAAAGTGACCTTCCCGGATGGGCGCATTCTTCGTACCACTAAAACGCGTCATCCGCGTGGTTTTATGCAAGGAA GGTATCTGGAAAGCCAGCGTGATGTCGAAGCCGCAGATAAGCCGTTTGAGTTCTTTATGAACCGCTTTCGCTTGCTGGAG GCCGCGCCACGCGCAGAGTTTAGTGCGTATACCGGGCTTTGCGAAGATGTGATTCGCCCACAGTTAGACGAGGCGATTGC CCAGGGTTATCTCACCGAATGTGCGGATTACTGGCAGATAACGGAACATGGGAAGCTGTTTTTAAATTCGCTGCTGGAGC TTTTTCTGGCTGAGTAA
Upstream 100 bases:
>100_bases CCTTCCGAAGGGAAAACCGCTGCCGAACTGACCCGCGAAGAAAAGAGCGCCATTTCCCACCGTGGTCAGGCATTGAAACT GCTGTTGGACGCTTTACGTA
Downstream 100 bases:
>100_bases ACTTGTATTGCCGGATGCGGCGTGAACGCCTTATCCAGCCGACATGTGGCAGCGGTTGTAGGTCTGATAAGACGCGCAAG CGTCGCATCAGACGTTGATT
Product: coproporphyrinogen III oxidase
Products: Protoporphyrinogen IX; CO2; H2O
Alternate protein names: NA
Number of amino acids: Translated: 378; Mature: 378
Protein sequence:
>378_residues MVKLPPLSLYIHIPWCVQKCPYCDFNSHALKGEVPHDDYVQHLLCDLDNDVAYAQGREVKTIFIGGGTPSLLSGPAMQTL LDGVRARLPLAADAEITMEANPGTVEADRFVDYQRAGVNRISIGVQSFSEEKLKRLGRIHGPQEAKRAAKLASGLGLRSF NLDLMHGLPDQSLEEALGDLRQAIELNPPHLSWYQLTIEPNTLFGSRPPVLPDDDALWDIFEQGHQLLTAAGYQQYETSA YAKPSYQCQHNLNYWRFGDYIGIGCGAHGKVTFPDGRILRTTKTRHPRGFMQGRYLESQRDVEAADKPFEFFMNRFRLLE AAPRAEFSAYTGLCEDVIRPQLDEAIAQGYLTECADYWQITEHGKLFLNSLLELFLAE
Sequences:
>Translated_378_residues MVKLPPLSLYIHIPWCVQKCPYCDFNSHALKGEVPHDDYVQHLLCDLDNDVAYAQGREVKTIFIGGGTPSLLSGPAMQTL LDGVRARLPLAADAEITMEANPGTVEADRFVDYQRAGVNRISIGVQSFSEEKLKRLGRIHGPQEAKRAAKLASGLGLRSF NLDLMHGLPDQSLEEALGDLRQAIELNPPHLSWYQLTIEPNTLFGSRPPVLPDDDALWDIFEQGHQLLTAAGYQQYETSA YAKPSYQCQHNLNYWRFGDYIGIGCGAHGKVTFPDGRILRTTKTRHPRGFMQGRYLESQRDVEAADKPFEFFMNRFRLLE AAPRAEFSAYTGLCEDVIRPQLDEAIAQGYLTECADYWQITEHGKLFLNSLLELFLAE >Mature_378_residues MVKLPPLSLYIHIPWCVQKCPYCDFNSHALKGEVPHDDYVQHLLCDLDNDVAYAQGREVKTIFIGGGTPSLLSGPAMQTL LDGVRARLPLAADAEITMEANPGTVEADRFVDYQRAGVNRISIGVQSFSEEKLKRLGRIHGPQEAKRAAKLASGLGLRSF NLDLMHGLPDQSLEEALGDLRQAIELNPPHLSWYQLTIEPNTLFGSRPPVLPDDDALWDIFEQGHQLLTAAGYQQYETSA YAKPSYQCQHNLNYWRFGDYIGIGCGAHGKVTFPDGRILRTTKTRHPRGFMQGRYLESQRDVEAADKPFEFFMNRFRLLE AAPRAEFSAYTGLCEDVIRPQLDEAIAQGYLTECADYWQITEHGKLFLNSLLELFLAE
Specific function: Unknown
COG id: COG0635
COG function: function code H; Coproporphyrinogen III oxidase and related Fe-S oxidoreductases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the anaerobic coproporphyrinogen-III oxidase family [H]
Homologues:
Organism=Homo sapiens, GI8922911, Length=284, Percent_Identity=35.5633802816901, Blast_Score=171, Evalue=9e-43, Organism=Escherichia coli, GI1789325, Length=378, Percent_Identity=98.6772486772487, Blast_Score=775, Evalue=0.0, Organism=Escherichia coli, GI87082341, Length=244, Percent_Identity=32.7868852459016, Blast_Score=128, Evalue=7e-31,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006638 - InterPro: IPR004559 - InterPro: IPR010723 - InterPro: IPR007197 [H]
Pfam domain/function: PF06969 HemN_C; PF04055 Radical_SAM [H]
EC number: 1.3.3.3
Molecular weight: Translated: 42520; Mature: 42520
Theoretical pI: Translated: 5.41; Mature: 5.41
Prosite motif: NA
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
2.1 %Cys (Translated Protein) 1.6 %Met (Translated Protein) 3.7 %Cys+Met (Translated Protein) 2.1 %Cys (Mature Protein) 1.6 %Met (Mature Protein) 3.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MVKLPPLSLYIHIPWCVQKCPYCDFNSHALKGEVPHDDYVQHLLCDLDNDVAYAQGREVK CCCCCCEEEEEECCHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCEEE TIFIGGGTPSLLSGPAMQTLLDGVRARLPLAADAEITMEANPGTVEADRFVDYQRAGVNR EEEECCCCCHHHCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCHHHHHHHHHCCCCE ISIGVQSFSEEKLKRLGRIHGPQEAKRAAKLASGLGLRSFNLDLMHGLPDQSLEEALGDL EEHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEECHHHHCCCCCHHHHHHHHHH RQAIELNPPHLSWYQLTIEPNTLFGSRPPVLPDDDALWDIFEQGHQLLTAAGYQQYETSA HHHHCCCCCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCC YAKPSYQCQHNLNYWRFGDYIGIGCGAHGKVTFPDGRILRTTKTRHPRGFMQGRYLESQR CCCCCCCHHCCCCEEEECCEEEECCCCCCEEECCCCCEEEECCCCCCCCHHCCCCCCCCC DVEAADKPFEFFMNRFRLLEAAPRAEFSAYTGLCEDVIRPQLDEAIAQGYLTECADYWQI CCHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH TEHGKLFLNSLLELFLAE HHHHHHHHHHHHHHHHCC >Mature Secondary Structure MVKLPPLSLYIHIPWCVQKCPYCDFNSHALKGEVPHDDYVQHLLCDLDNDVAYAQGREVK CCCCCCEEEEEECCHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECCCCCEEE TIFIGGGTPSLLSGPAMQTLLDGVRARLPLAADAEITMEANPGTVEADRFVDYQRAGVNR EEEECCCCCHHHCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCHHHHHHHHHCCCCE ISIGVQSFSEEKLKRLGRIHGPQEAKRAAKLASGLGLRSFNLDLMHGLPDQSLEEALGDL EEHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEECHHHHCCCCCHHHHHHHHHH RQAIELNPPHLSWYQLTIEPNTLFGSRPPVLPDDDALWDIFEQGHQLLTAAGYQQYETSA HHHHCCCCCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHCC YAKPSYQCQHNLNYWRFGDYIGIGCGAHGKVTFPDGRILRTTKTRHPRGFMQGRYLESQR CCCCCCCHHCCCCEEEECCEEEECCCCCCEEECCCCCEEEECCCCCCCCHHCCCCCCCCC DVEAADKPFEFFMNRFRLLEAAPRAEFSAYTGLCEDVIRPQLDEAIAQGYLTECADYWQI CCHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH TEHGKLFLNSLLELFLAE HHHHHHHHHHHHHHHHCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: Coproporphyrinogen III; O2; H+
Specific reaction: coproporphyrinogen-III + O2 + 2 H+ = protoporphyrinogen-IX + 2 CO2 + 2 H2O
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 9278503 [H]