Definition Escherichia coli HS, complete genome.
Accession NC_009800
Length 4,643,538

Click here to switch to the map view.

The map label for this gene is yceA [C]

Identifier: 157160582

GI number: 157160582

Start: 1180483

End: 1181535

Strand: Direct

Name: yceA [C]

Synonym: EcHS_A1178

Alternate gene names: 157160582

Gene position: 1180483-1181535 (Clockwise)

Preceding gene: 157160580

Following gene: 157160593

Centisome position: 25.42

GC content: 49.95

Gene sequence:

>1053_bases
ATGCCAGTGTTACACAACCGCATTTCCAACGACGCGCTAAAAGCCAAAATGTTGGCTGAGAGCGAACCGCGAACCACCAT
TTCGTTTTACAAGTATTTCCACATCGCCGATCCTAAGGCGACCCGTGACGCTTTATATCAGCTGTTTACCGCGCTGAATG
TTTTTGGGCGAGTGTATCTGGCGCATGAGGGCATTAACGCGCAAATCAGCGTACCTGCGAGCAATGTTGAAACATTTCGC
GCGCAGCTCTATGCCTTCGACTCGGCTTTAGATGGCTTACGCCTGAATATCGCGTTGGATGATGACGGGAAATCCTTCTG
GGTACTGCGCATGAAGGTCCGCGATCGTATCGTTGCCGACGGTATTGACGATCCTCACTTTGATGCCAGCAATGTTGGTG
AGTATCTGCAAGCGGCGGAAGTGAACGCCATGCTTGACGATCCCGATGCACTGTTTATCGACATGCGTAACCACTATGAG
TATGAAGTGGGGCACTTTGAAAACGCGCTCGAAATTCCGGCAGATACCTTCCGTGAGCAGCTGCCAAAAGCAGTCGAGAT
GATGCAGGCACATAAAGATAAAAAAATCGTCATGTACTGCACCGGCGGCATTCGTTGTGAAAAGGCCAGTGCCTGGATGA
AACATAACGGATTCAACAAAGTCTGGCATATCGAGGGCGGAATTATTGAATACGCCCGTAAGGCGCGCGAGCAGGGCTTG
CCGGTGCGTTTTATTGGCAAAAATTTTGTTTTTGACGAGCGGATGGGCGAACGTATATCGGATGAGATTATCGCGCATTG
CCACCAGTGCGGCGCGCCGTGCGACAGCCATACCAACTGTAAAAATGATGGCTGCCACCTGCTGTTTATTCAGTGTCCAG
TATGCGCGGAAAAATACAAAGGTTGTTGTAGTGAGATTTGCTGCGAAGAAAGCGCGTTACCGCCAGAAGAACAGCGACGC
CGTCGGGCAGGACGTGAAAATGGCAATAAGATCTTTAATAAGTCTCGTGGACGTCTGAATACAACACTGGGCATTCCTGA
TCCAACAGAATAA

Upstream 100 bases:

>100_bases
AATGCGCAAATGTAGCGTAAAATGTGTGGATGTTAATTATCGATAATTGCTATATCATGCCGCGGATTTTTACTTTCCCA
TCTCGCAGGAACCGTACACC

Downstream 100 bases:

>100_bases
ATATCATTGCCGGATGCGTGCCATCCGGCAACATTTCACGCTTACTTCTGCTGTACGCCTTCCACTGAAATAATCAGATC
CACATCCTGAGAAGCTGGAC

Product: hypothetical protein

Products: NA

Alternate protein names: NA

Number of amino acids: Translated: 350; Mature: 349

Protein sequence:

>350_residues
MPVLHNRISNDALKAKMLAESEPRTTISFYKYFHIADPKATRDALYQLFTALNVFGRVYLAHEGINAQISVPASNVETFR
AQLYAFDSALDGLRLNIALDDDGKSFWVLRMKVRDRIVADGIDDPHFDASNVGEYLQAAEVNAMLDDPDALFIDMRNHYE
YEVGHFENALEIPADTFREQLPKAVEMMQAHKDKKIVMYCTGGIRCEKASAWMKHNGFNKVWHIEGGIIEYARKAREQGL
PVRFIGKNFVFDERMGERISDEIIAHCHQCGAPCDSHTNCKNDGCHLLFIQCPVCAEKYKGCCSEICCEESALPPEEQRR
RRAGRENGNKIFNKSRGRLNTTLGIPDPTE

Sequences:

>Translated_350_residues
MPVLHNRISNDALKAKMLAESEPRTTISFYKYFHIADPKATRDALYQLFTALNVFGRVYLAHEGINAQISVPASNVETFR
AQLYAFDSALDGLRLNIALDDDGKSFWVLRMKVRDRIVADGIDDPHFDASNVGEYLQAAEVNAMLDDPDALFIDMRNHYE
YEVGHFENALEIPADTFREQLPKAVEMMQAHKDKKIVMYCTGGIRCEKASAWMKHNGFNKVWHIEGGIIEYARKAREQGL
PVRFIGKNFVFDERMGERISDEIIAHCHQCGAPCDSHTNCKNDGCHLLFIQCPVCAEKYKGCCSEICCEESALPPEEQRR
RRAGRENGNKIFNKSRGRLNTTLGIPDPTE
>Mature_349_residues
PVLHNRISNDALKAKMLAESEPRTTISFYKYFHIADPKATRDALYQLFTALNVFGRVYLAHEGINAQISVPASNVETFRA
QLYAFDSALDGLRLNIALDDDGKSFWVLRMKVRDRIVADGIDDPHFDASNVGEYLQAAEVNAMLDDPDALFIDMRNHYEY
EVGHFENALEIPADTFREQLPKAVEMMQAHKDKKIVMYCTGGIRCEKASAWMKHNGFNKVWHIEGGIIEYARKAREQGLP
VRFIGKNFVFDERMGERISDEIIAHCHQCGAPCDSHTNCKNDGCHLLFIQCPVCAEKYKGCCSEICCEESALPPEEQRRR
RAGRENGNKIFNKSRGRLNTTLGIPDPTE

Specific function: Unknown. [C]

COG id: COG1054

COG function: function code R; Predicted sulfurtransferase

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 rhodanese domain [H]

Homologues:

Organism=Homo sapiens, GI111038120, Length=298, Percent_Identity=28.8590604026846, Blast_Score=120, Evalue=1e-27,
Organism=Escherichia coli, GI1787294, Length=350, Percent_Identity=99.1428571428571, Blast_Score=727, Evalue=0.0,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR001763
- InterPro:   IPR020936 [H]

Pfam domain/function: PF00581 Rhodanese [H]

EC number: NA

Molecular weight: Translated: 39712; Mature: 39580

Theoretical pI: Translated: 6.44; Mature: 6.44

Prosite motif: PS50206 RHODANESE_3

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

3.7 %Cys     (Translated Protein)
2.9 %Met     (Translated Protein)
6.6 %Cys+Met (Translated Protein)
3.7 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
6.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MPVLHNRISNDALKAKMLAESEPRTTISFYKYFHIADPKATRDALYQLFTALNVFGRVYL
CCCCHHHHCHHHHHHHHHHCCCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHEEEE
AHEGINAQISVPASNVETFRAQLYAFDSALDGLRLNIALDDDGKSFWVLRMKVRDRIVAD
EECCCCEEEECCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEEEEHHHHHHHC
GIDDPHFDASNVGEYLQAAEVNAMLDDPDALFIDMRNHYEYEVGHFENALEIPADTFREQ
CCCCCCCCHHHHHHHHHHHHHHHEECCCCEEEEEECCCCEEECCCCCCCCCCCHHHHHHH
LPKAVEMMQAHKDKKIVMYCTGGIRCEKASAWMKHNGFNKVWHIEGGIIEYARKAREQGL
HHHHHHHHHHCCCCEEEEEECCCEEECHHHHHHHCCCCCEEEEECCCHHHHHHHHHHCCC
PVRFIGKNFVFDERMGERISDEIIAHCHQCGAPCDSHTNCKNDGCHLLFIQCPVCAEKYK
CEEECCCCCEEHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEECCHHHHHHH
GCCSEICCEESALPPEEQRRRRAGRENGNKIFNKSRGRLNTTLGIPDPTE
HHHHHHHHCCCCCCCHHHHHHHCCCCCCCHHHHCCCCCEEEEECCCCCCC
>Mature Secondary Structure 
PVLHNRISNDALKAKMLAESEPRTTISFYKYFHIADPKATRDALYQLFTALNVFGRVYL
CCCHHHHCHHHHHHHHHHCCCCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHEEEE
AHEGINAQISVPASNVETFRAQLYAFDSALDGLRLNIALDDDGKSFWVLRMKVRDRIVAD
EECCCCEEEECCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCEEEEEEEEHHHHHHHC
GIDDPHFDASNVGEYLQAAEVNAMLDDPDALFIDMRNHYEYEVGHFENALEIPADTFREQ
CCCCCCCCHHHHHHHHHHHHHHHEECCCCEEEEEECCCCEEECCCCCCCCCCCHHHHHHH
LPKAVEMMQAHKDKKIVMYCTGGIRCEKASAWMKHNGFNKVWHIEGGIIEYARKAREQGL
HHHHHHHHHHCCCCEEEEEECCCEEECHHHHHHHCCCCCEEEEECCCHHHHHHHHHHCCC
PVRFIGKNFVFDERMGERISDEIIAHCHQCGAPCDSHTNCKNDGCHLLFIQCPVCAEKYK
CEEECCCCCEEHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEECCHHHHHHH
GCCSEICCEESALPPEEQRRRRAGRENGNKIFNKSRGRLNTTLGIPDPTE
HHHHHHHHCCCCCCCHHHHHHHCCCCCCCHHHHCCCCCEEEEECCCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA