Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is ydeP [H]

Identifier: 157160976

GI number: 157160976

Start: 1601030

End: 1603309

Strand: Reverse

Name: ydeP [H]

Synonym: EcHS_A1585

Alternate gene names: 157160976

Gene position: 1603309-1601030 (Counterclockwise)

Preceding gene: 157160977

Following gene: 157160975

Centisome position: 34.53

GC content: 50.22

Gene sequence:

>2280_bases
ATGAAGAAAAAAATTGAATCCTACCAGGGTGCTGCAGGTGGTTGGGGTGCTGTTAAATCCGTAGCGAATGCAGTACGTAA
GCAGATGGATATACGCCAGGATGTTATTGCCATGTTTGACATGAATAAGCCAGAGGGCTTTGACTGTCCGGGTTGTGCAT
GGCCAGATCCTAAGCACAGTGCGTCATTCGACATTTGTGAAAACGGCGCAAAAGCAATCGCCTGGGAAGTCACGGATAAG
CAGGTAAACGCCTCTTTCTTTGCTGAGAATACGGTTCAATCATTACTTACCTGGGGAGACCACGAGCTTGAGGCTGCGGG
GCGACTCACTCAGCCTTTGAAATATGATGCCGTCAGCGACTGTTACAAGCCATTAAGCTGGCAACAAGCTTTCGACGAAA
TTGGCGCACGCCTTCAAAGCTATAGTGATCCCAATCAGGTTGAATTCTATACTTCGGGCCGCACTTCCAATGAAGCTGCC
TTTCTTTATCAGCTTTTTGCCCGTGAATACGGGAGCAATAACTTTCCCGACTGCTCCAACATGTGCCATGAACCGACAAG
CGTGGGTTTGGCAGCGAGTATCGGTGTAGGTAAAGGGACCGTGTTGCTGGAAGACTTTGAGAAGTGCGATTTAGTCATTT
GCATTGGGCATAACCCTGGTACAAACCACCCTCGCATGCTGACTTCGTTGCGCGCTTTAGTGAAACGGGGAGCGAAAATG
ATCGCCATCAATCCTCTACAGGAACGTGGCCTGGAGCGATTTACCGCACCGCAAAACCCGTTTGAAATGCTGACGAACTC
TGAGACTCAGTTGGCCAGTGCCTACTATAACGTGCGCATTGGTGGCGATATGGCGTTGCTCAAGGGGATGATGCGCCTGT
TAATTGAGCGCGATGATGCTGCAAGCGCCGCAGGTCGGCCTTCATTGCTTGATGACGAGTTTATTCAAACGCATACCGTC
GGCTTTGACGAGCTACGCCGTGACGTTCTCAATTCCGAGTGGAAAGATATCGAACGTATTTCTGGACTAAGTCAGACACA
AATCGCCGAACTGGCTGACGCATATGCCGCTGCCGAACGCACCATTATCTGTTACGGAATGGGGATCACTCAGCACGAAC
ATGGTACCCAGAACGTACAGCAACTGGTCAATCTGCTGTTGATGAAAGGTAACATTGGCAAGCCTGGTGCGGGTATCTGC
CCACTACGTGGACACTCTAATGTACAGGGCGACCGAACCGTCGGTATCACCGAGAAACCGTCTGCAGAGTTTCTGGCTCG
TCTGGGTGAGCGCTATGGCTTCACCCCACCTCATGCACCTGGACATGCTGCAATTGCCAGCATGCAAGCAATATGTACGG
GGCAGGCTCGAGCATTGATCTGCATGGGGGGCAATTTTGCGCTGGCAATGCCAGATCGGGAAGCGAGCGCTGTACCGTTA
ACGCAATTAGATTTGGCGGTACACGTAGCCACTAAGCTTAACCGCTCTCATCTGTTGACCGCACGGCATAGCTATATTCT
GCCGGTCCTGGGACGTAGCGAGATTGACATGCAAAAAAACGGTGCGCAGGCGGTAACCGTTGAGGATTCAATGTCGATGA
TTCATGCCTCGCGTGGCGTGTTAAAACCCGCCGGTGTAATGCTGAAATCAGAGTGTGCAGTGGTCGCGGGAATCGCGCAG
GCAGCACTACCCCAGAGCGTGGTAGCCTGGGAGTATCTGGTGGAAGATTATGATCGCATTCGCAATGACATTGAAGCTGT
GCTGCCAGAGTTCGCCGACTATAACCAGCGCATCCGTCATCCCGGTGGTTTTCACCTGATAAATGCAGCTGCTGAAAGGC
GCTGGATGACGCCGTCAGGTAAGGCTAATTTCATTACCAGCAAAGGGCTGTTAGAAGATCCATCTTCAGCGTTTAACAGT
AAGCTGGTCATGGCGACAGTACGCAGCCACGATCAGTACAACACGACGATTTATGGTATGGATGATCGCTATCGAGGGGT
ATTCGGTCAACGAGATGTGGTCTTTATGAGTGCTAAACAAGCTAAAATTTGCCGTGTAAAAAACGGCGAAAGAGTTAATC
TTATTGCGCTTACGCCAGACGGTAAGCGCAGCTCACGCCACATGGATAGATTAAAAGTGGTCATTTACCCTATGGCTGAC
CGCTCACTGGTGACCTATTTTCCAGAATCGAATCACATGCTAACACTTGATAACCACGATCCATTAAGTGGCATTCCTGG
CTATAAAAGTATTCCGGTTGAATTAGAACCATCAAATTAA

Upstream 100 bases:

>100_bases
CCTACTCTTATATATCCATGTTGGCGATAATCCCTTTGTATGTACTGTGCATCATCGCTATTACAAATCCTAATAATTCA
TTTCCACACAGGATAAGTAG

Downstream 100 bases:

>100_bases
TGCCTCTTCTCATTTCTTCTGCTGTCATCCGCACAGCAGAAGAATTCCTCATTGACTATTATTTCGCAATTTGCTCACAT
GGATTAAATTAAACTACATA

Product: putative oxidoreductase

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 759; Mature: 759

Protein sequence:

>759_residues
MKKKIESYQGAAGGWGAVKSVANAVRKQMDIRQDVIAMFDMNKPEGFDCPGCAWPDPKHSASFDICENGAKAIAWEVTDK
QVNASFFAENTVQSLLTWGDHELEAAGRLTQPLKYDAVSDCYKPLSWQQAFDEIGARLQSYSDPNQVEFYTSGRTSNEAA
FLYQLFAREYGSNNFPDCSNMCHEPTSVGLAASIGVGKGTVLLEDFEKCDLVICIGHNPGTNHPRMLTSLRALVKRGAKM
IAINPLQERGLERFTAPQNPFEMLTNSETQLASAYYNVRIGGDMALLKGMMRLLIERDDAASAAGRPSLLDDEFIQTHTV
GFDELRRDVLNSEWKDIERISGLSQTQIAELADAYAAAERTIICYGMGITQHEHGTQNVQQLVNLLLMKGNIGKPGAGIC
PLRGHSNVQGDRTVGITEKPSAEFLARLGERYGFTPPHAPGHAAIASMQAICTGQARALICMGGNFALAMPDREASAVPL
TQLDLAVHVATKLNRSHLLTARHSYILPVLGRSEIDMQKNGAQAVTVEDSMSMIHASRGVLKPAGVMLKSECAVVAGIAQ
AALPQSVVAWEYLVEDYDRIRNDIEAVLPEFADYNQRIRHPGGFHLINAAAERRWMTPSGKANFITSKGLLEDPSSAFNS
KLVMATVRSHDQYNTTIYGMDDRYRGVFGQRDVVFMSAKQAKICRVKNGERVNLIALTPDGKRSSRHMDRLKVVIYPMAD
RSLVTYFPESNHMLTLDNHDPLSGIPGYKSIPVELEPSN

Sequences:

>Translated_759_residues
MKKKIESYQGAAGGWGAVKSVANAVRKQMDIRQDVIAMFDMNKPEGFDCPGCAWPDPKHSASFDICENGAKAIAWEVTDK
QVNASFFAENTVQSLLTWGDHELEAAGRLTQPLKYDAVSDCYKPLSWQQAFDEIGARLQSYSDPNQVEFYTSGRTSNEAA
FLYQLFAREYGSNNFPDCSNMCHEPTSVGLAASIGVGKGTVLLEDFEKCDLVICIGHNPGTNHPRMLTSLRALVKRGAKM
IAINPLQERGLERFTAPQNPFEMLTNSETQLASAYYNVRIGGDMALLKGMMRLLIERDDAASAAGRPSLLDDEFIQTHTV
GFDELRRDVLNSEWKDIERISGLSQTQIAELADAYAAAERTIICYGMGITQHEHGTQNVQQLVNLLLMKGNIGKPGAGIC
PLRGHSNVQGDRTVGITEKPSAEFLARLGERYGFTPPHAPGHAAIASMQAICTGQARALICMGGNFALAMPDREASAVPL
TQLDLAVHVATKLNRSHLLTARHSYILPVLGRSEIDMQKNGAQAVTVEDSMSMIHASRGVLKPAGVMLKSECAVVAGIAQ
AALPQSVVAWEYLVEDYDRIRNDIEAVLPEFADYNQRIRHPGGFHLINAAAERRWMTPSGKANFITSKGLLEDPSSAFNS
KLVMATVRSHDQYNTTIYGMDDRYRGVFGQRDVVFMSAKQAKICRVKNGERVNLIALTPDGKRSSRHMDRLKVVIYPMAD
RSLVTYFPESNHMLTLDNHDPLSGIPGYKSIPVELEPSN
>Mature_759_residues
MKKKIESYQGAAGGWGAVKSVANAVRKQMDIRQDVIAMFDMNKPEGFDCPGCAWPDPKHSASFDICENGAKAIAWEVTDK
QVNASFFAENTVQSLLTWGDHELEAAGRLTQPLKYDAVSDCYKPLSWQQAFDEIGARLQSYSDPNQVEFYTSGRTSNEAA
FLYQLFAREYGSNNFPDCSNMCHEPTSVGLAASIGVGKGTVLLEDFEKCDLVICIGHNPGTNHPRMLTSLRALVKRGAKM
IAINPLQERGLERFTAPQNPFEMLTNSETQLASAYYNVRIGGDMALLKGMMRLLIERDDAASAAGRPSLLDDEFIQTHTV
GFDELRRDVLNSEWKDIERISGLSQTQIAELADAYAAAERTIICYGMGITQHEHGTQNVQQLVNLLLMKGNIGKPGAGIC
PLRGHSNVQGDRTVGITEKPSAEFLARLGERYGFTPPHAPGHAAIASMQAICTGQARALICMGGNFALAMPDREASAVPL
TQLDLAVHVATKLNRSHLLTARHSYILPVLGRSEIDMQKNGAQAVTVEDSMSMIHASRGVLKPAGVMLKSECAVVAGIAQ
AALPQSVVAWEYLVEDYDRIRNDIEAVLPEFADYNQRIRHPGGFHLINAAAERRWMTPSGKANFITSKGLLEDPSSAFNS
KLVMATVRSHDQYNTTIYGMDDRYRGVFGQRDVVFMSAKQAKICRVKNGERVNLIALTPDGKRSSRHMDRLKVVIYPMAD
RSLVTYFPESNHMLTLDNHDPLSGIPGYKSIPVELEPSN

Specific function: Probably involved in acid resistance [H]

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: Non Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the prokaryotic molybdopterin-containing oxidoreductase family [H]

Homologues:

Organism=Escherichia coli, GI1787778, Length=759, Percent_Identity=99.8682476943347, Blast_Score=1587, Evalue=0.0,
Organism=Escherichia coli, GI3868721, Length=308, Percent_Identity=30.8441558441558, Blast_Score=155, Evalue=8e-39,
Organism=Escherichia coli, GI3868720, Length=376, Percent_Identity=25.7978723404255, Blast_Score=107, Evalue=4e-24,
Organism=Escherichia coli, GI3868719, Length=391, Percent_Identity=25.3196930946292, Blast_Score=104, Evalue=3e-23,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR009010
- InterPro:   IPR006657
- InterPro:   IPR006656
- InterPro:   IPR010046 [H]

Pfam domain/function: PF00384 Molybdopterin; PF01568 Molydop_binding [H]

EC number: NA

Molecular weight: Translated: 83477; Mature: 83477

Theoretical pI: Translated: 6.70; Mature: 6.70

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.8 %Cys     (Translated Protein)
3.6 %Met     (Translated Protein)
5.4 %Cys+Met (Translated Protein)
1.8 %Cys     (Mature Protein)
3.6 %Met     (Mature Protein)
5.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MKKKIESYQGAAGGWGAVKSVANAVRKQMDIRQDVIAMFDMNKPEGFDCPGCAWPDPKHS
CCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC
ASFDICENGAKAIAWEVTDKQVNASFFAENTVQSLLTWGDHELEAAGRLTQPLKYDAVSD
CCCHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHHH
CYKPLSWQQAFDEIGARLQSYSDPNQVEFYTSGRTSNEAAFLYQLFAREYGSNNFPDCSN
HHCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCHHH
MCHEPTSVGLAASIGVGKGTVLLEDFEKCDLVICIGHNPGTNHPRMLTSLRALVKRGAKM
HCCCCCCCCEEEECCCCCCEEEEECCCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCEE
IAINPLQERGLERFTAPQNPFEMLTNSETQLASAYYNVRIGGDMALLKGMMRLLIERDDA
EEECCHHHCCHHHHCCCCCHHHHHCCCCHHHEEEEEEEEECCCHHHHHHHHHHHHCCCCC
ASAAGRPSLLDDEFIQTHTVGFDELRRDVLNSEWKDIERISGLSQTQIAELADAYAAAER
CCCCCCCCCCCCHHHHHHCCCHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHHHCC
TIICYGMGITQHEHGTQNVQQLVNLLLMKGNIGKPGAGICPLRGHSNVQGDRTVGITEKP
EEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCC
SAEFLARLGERYGFTPPHAPGHAAIASMQAICTGQARALICMGGNFALAMPDREASAVPL
HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCEEEECCCCCCCCCCH
TQLDLAVHVATKLNRSHLLTARHSYILPVLGRSEIDMQKNGAQAVTVEDSMSMIHASRGV
HHHHHHHHHHHHCCHHHEEEECCCEEEEECCCCCCCCCCCCCEEEEEHHHHHHHHHHCCC
LKPAGVMLKSECAVVAGIAQAALPQSVVAWEYLVEDYDRIRNDIEAVLPEFADYNQRIRH
CCCCCCEECCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHCC
PGGFHLINAAAERRWMTPSGKANFITSKGLLEDPSSAFNSKLVMATVRSHDQYNTTIYGM
CCCEEEEEHHHHCCCCCCCCCCCEEECCCCCCCCHHHHCCEEEEEEECCCCCCCCEEEEC
DDRYRGVFGQRDVVFMSAKQAKICRVKNGERVNLIALTPDGKRSSRHMDRLKVVIYPMAD
CCHHCCCCCCCCEEEEECCCCEEEEECCCCEEEEEEECCCCCCHHHHHCEEEEEEEECCC
RSLVTYFPESNHMLTLDNHDPLSGIPGYKSIPVELEPSN
CCEEEECCCCCCEEEECCCCCCCCCCCCCCCCEEECCCC
>Mature Secondary Structure
MKKKIESYQGAAGGWGAVKSVANAVRKQMDIRQDVIAMFDMNKPEGFDCPGCAWPDPKHS
CCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC
ASFDICENGAKAIAWEVTDKQVNASFFAENTVQSLLTWGDHELEAAGRLTQPLKYDAVSD
CCCHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHHH
CYKPLSWQQAFDEIGARLQSYSDPNQVEFYTSGRTSNEAAFLYQLFAREYGSNNFPDCSN
HHCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCHHH
MCHEPTSVGLAASIGVGKGTVLLEDFEKCDLVICIGHNPGTNHPRMLTSLRALVKRGAKM
HCCCCCCCCEEEECCCCCCEEEEECCCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCEE
IAINPLQERGLERFTAPQNPFEMLTNSETQLASAYYNVRIGGDMALLKGMMRLLIERDDA
EEECCHHHCCHHHHCCCCCHHHHHCCCCHHHEEEEEEEEECCCHHHHHHHHHHHHCCCCC
ASAAGRPSLLDDEFIQTHTVGFDELRRDVLNSEWKDIERISGLSQTQIAELADAYAAAER
CCCCCCCCCCCCHHHHHHCCCHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHHHCC
TIICYGMGITQHEHGTQNVQQLVNLLLMKGNIGKPGAGICPLRGHSNVQGDRTVGITEKP
EEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCC
SAEFLARLGERYGFTPPHAPGHAAIASMQAICTGQARALICMGGNFALAMPDREASAVPL
HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCEEEECCCCCCCCCCH
TQLDLAVHVATKLNRSHLLTARHSYILPVLGRSEIDMQKNGAQAVTVEDSMSMIHASRGV
HHHHHHHHHHHHCCHHHEEEECCCEEEEECCCCCCCCCCCCCEEEEEHHHHHHHHHHCCC
LKPAGVMLKSECAVVAGIAQAALPQSVVAWEYLVEDYDRIRNDIEAVLPEFADYNQRIRH
CCCCCCEECCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHCC
PGGFHLINAAAERRWMTPSGKANFITSKGLLEDPSSAFNSKLVMATVRSHDQYNTTIYGM
CCCEEEEEHHHHCCCCCCCCCCCEEECCCCCCCCHHHHCCEEEEEEECCCCCCCCEEEEC
DDRYRGVFGQRDVVFMSAKQAKICRVKNGERVNLIALTPDGKRSSRHMDRLKVVIYPMAD
CCHHCCCCCCCCEEEEECCCCEEEEECCCCEEEEEEECCCCCCHHHHHCEEEEEEEECCC
RSLVTYFPESNHMLTLDNHDPLSGIPGYKSIPVELEPSN
CCEEEECCCCCCEEEECCCCCCCCCCCCCCCCEEECCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: 11206551; 11258796 [H]