Definition Escherichia coli O157:H7 str. EC4115, complete genome.
Accession NC_011353
Length 5,572,075

Click here to switch to the map view.

The map label for this gene is cysJ [H]

Identifier: 209400763

GI number: 209400763

Start: 3714614

End: 3716413

Strand: Reverse

Name: cysJ [H]

Synonym: ECH74115_4018

Alternate gene names: 209400763

Gene position: 3716413-3714614 (Counterclockwise)

Preceding gene: 209396000

Following gene: 209400070

Centisome position: 66.7

GC content: 56.61

Gene sequence:

>1800_bases
ATGACGACACAGGTCCCACCTTCCGCGTTGCTTCCGTTGAACCCGGAGCAACTGGCACGCCTTCAGGCGGCCACGACCGA
TTTAACTCCCACCCAGCTTGCCTGGGTTTCTGGCTATTTCTGGGGCGTGCTCAATCAGCAGCCTGCTGCGCTTGCAGCGA
CGCCAGCGCCAGCCGCAGAAATGCCGGGTATAACTATTATCTCTGCTTCGCAAACCGGCAATGCGCGCCGGGTTGCTGAA
GCATTACGCGATGATTTATTGACGGCAAAACTGAACGTTAAGCTGGTGAACGCGGGCGACTATAAATTCAAACAAATCGC
CAGCGAAAAACTGCTTATCGTGGTGACGTCAACGCAAGGAGAAGGGGAACCGCCGGAAGAAGCCGTCGCGCTGCATAAGT
TCCTGTTCTCCAAAAAAGCGCCGAAGCTGGAAAATACCGCTTTTGCCGTGTTTAGCCTCGGCGATAGTTCTTATGAATTT
TTCTGCCAGTCCGGGAAAGATTTCGACAGCAAGCTGGCGGAACTGGGCGGTGAACGCCTGCTCGACCGTGTCGACGCCGA
CGTTGAATACCAGGCTGCTGCCAGCGAGTGGCGCGCCCGCGTGGTTGATGCGCTTAAATCGCGTGCGCCTGTCGCGGCAC
CTTCGCAATCCGTCGCTACTGGCGTGGTAAATGAAATCCACACCAGCCCGTACAGCAAAGACGCGCCGCTGGTAGCGAGC
CTTTCGGTTAACCAGAAAATTACCGGGCGTAACTCTGAAAAAGACGTTCGCCATATCGAAATTGACTTAGGTGACTCGGG
CCTGCGTTACCAGCCGGGTGACGCGCTGGGTGTCTGGTATCAGAACGATCCGGCACTGGTGAAAGAACTTGTCGAACTGC
TGTGGCTGAAAGGCGATGAACCTGTCACCGTCGAGGGCAAAACGTTGCCTCTGAACGAAGCGCTACAGTGGCACTTCGAA
CTGACCGTCAACACCGCCAATATTGTTGAGAACTATGCCACGCTTACCCGCAGCGAAACGCTGTTGCCGCTGGTGGGCGA
TAAAGCGAAGTTACAGCATTACGCCGCGACTACGCCGATTGTCGACATGGTGCGCTTCTCTCCGGCGCAACTGGACGCCG
AAGCGCTGATTAATCTGCTGCGCCCGCTGACGCCGCGTCTGTATTCCATCGCCTCCTCGCAGGAGGAAGTCGAGAACGAA
GTACACGTCACCGTTGGTGTGGTGCGTTACGACGTGGAAGGCCGCGCCCGTGCCGGTGGTGCCTCCAGCTTCCTCGCGGA
CCGCGTGGAAGAAGAGGGCGAAGTTCGCGTATTTATCGAACATAACGATAACTTCCGCCTGCCCGCTAACCCGGAAACCC
CGGTGATTATGATTGGCCCAGGCACCGGCATCGCGCCGTTCCGCGCCTTTATGCAGCAGCGCGCCGCTGACGAAGCGCCG
GGTAAAAACTGGCTGTTCTTTGGCAACCCGCACTTTACGGAAGATTTCCTCTACCAGGTGGAGTGGCAGCGTTACGTCAA
AGAGGGCGTGCTGACGCGTATCGATCTTGCCTGGTCGCGCGATCAAAAAGAAAAAGTTTACGTACAAGACAAACTGCGCG
AACAGGGCGCGGAGCTGTGGCGCTGGATCAATGACGGTGCCCACATTTATGTCTGCGGCGACGCTAATCGCATGGCGAAA
GACGTTGAGCAGGCACTTCTGGAAGTGATTGCCGAATTTGGTGGCATGGACACCGAAGCGGCGGATGAATTTTTAAGTGA
GCTGCGCGTAGAGCGCCGTTATCAGCGAGATGTCTACTAA

Upstream 100 bases:

>100_bases
TTAATCCACACCGTTCACCCCGTTAACCTTACCTTCTCTTCTGTTTTATGGGCGCTGACAGGGCGCAGAAACAGCTTTGC
TTACTGGAACATAACGACGC

Downstream 100 bases:

>100_bases
TGAGCGAAAAACATCCAGGGCCTTTAGTGGTCGAAGGAAAACTGACAGACGCCGAGCGCATGAAGCTTGAAAGCAACTAC
CTGCGCGGCACCATTGCGGA

Product: sulfite reductase subunit alpha

Products: NA

Alternate protein names: SiR-FP [H]

Number of amino acids: Translated: 599; Mature: 598

Protein sequence:

>599_residues
MTTQVPPSALLPLNPEQLARLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAE
ALRDDLLTAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEF
FCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGVVNEIHTSPYSKDAPLVAS
LSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFE
LTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQEEVENE
VHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAP
GKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAK
DVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY

Sequences:

>Translated_599_residues
MTTQVPPSALLPLNPEQLARLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAE
ALRDDLLTAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEF
FCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGVVNEIHTSPYSKDAPLVAS
LSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFE
LTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQEEVENE
VHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAP
GKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAK
DVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY
>Mature_598_residues
TTQVPPSALLPLNPEQLARLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAEA
LRDDLLTAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFF
CQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGVVNEIHTSPYSKDAPLVASL
SVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFEL
TVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQEEVENEV
HVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAPG
KNWLFFGNPHFTEDFLYQVEWQRYVKEGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAKD
VEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY

Specific function: Component of the sulfite reductase complex that catalyzes the 6-electron reduction of sulfite to sulfide. This is one of several activities required for the biosynthesis of L- cysteine from sulfate. The flavoprotein component catalyzes the electron flow f

COG id: COG0369

COG function: function code P; Sulfite reductase, alpha subunit (flavoprotein)

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 flavodoxin-like domain [H]

Homologues:

Organism=Homo sapiens, GI127139033, Length=601, Percent_Identity=30.4492512479201, Blast_Score=244, Evalue=2e-64,
Organism=Homo sapiens, GI7657393, Length=601, Percent_Identity=29.9500831946755, Blast_Score=231, Evalue=1e-60,
Organism=Homo sapiens, GI221316709, Length=601, Percent_Identity=29.9500831946755, Blast_Score=226, Evalue=3e-59,
Organism=Homo sapiens, GI24041029, Length=597, Percent_Identity=30.820770519263, Blast_Score=225, Evalue=9e-59,
Organism=Homo sapiens, GI221316705, Length=610, Percent_Identity=29.5081967213115, Blast_Score=224, Evalue=2e-58,
Organism=Homo sapiens, GI10835173, Length=651, Percent_Identity=29.6466973886329, Blast_Score=206, Evalue=5e-53,
Organism=Homo sapiens, GI221316707, Length=530, Percent_Identity=29.622641509434, Blast_Score=199, Evalue=4e-51,
Organism=Homo sapiens, GI40254422, Length=649, Percent_Identity=27.4268104776579, Blast_Score=168, Evalue=1e-41,
Organism=Homo sapiens, GI169790956, Length=437, Percent_Identity=28.604118993135, Blast_Score=161, Evalue=2e-39,
Organism=Homo sapiens, GI169790958, Length=437, Percent_Identity=28.604118993135, Blast_Score=160, Evalue=3e-39,
Organism=Escherichia coli, GI1789123, Length=599, Percent_Identity=99.1652754590985, Blast_Score=1218, Evalue=0.0,
Organism=Caenorhabditis elegans, GI17554134, Length=636, Percent_Identity=29.0880503144654, Blast_Score=233, Evalue=3e-61,
Organism=Caenorhabditis elegans, GI17566446, Length=583, Percent_Identity=26.5866209262436, Blast_Score=147, Evalue=1e-35,
Organism=Caenorhabditis elegans, GI17531441, Length=394, Percent_Identity=25.3807106598985, Blast_Score=97, Evalue=3e-20,
Organism=Saccharomyces cerevisiae, GI6321832, Length=640, Percent_Identity=28.75, Blast_Score=208, Evalue=3e-54,
Organism=Saccharomyces cerevisiae, GI6321143, Length=397, Percent_Identity=31.4861460957179, Blast_Score=166, Evalue=1e-41,
Organism=Saccharomyces cerevisiae, GI6325305, Length=626, Percent_Identity=23.3226837060703, Blast_Score=131, Evalue=4e-31,
Organism=Drosophila melanogaster, GI17137192, Length=624, Percent_Identity=28.2051282051282, Blast_Score=218, Evalue=1e-56,
Organism=Drosophila melanogaster, GI24582192, Length=510, Percent_Identity=30, Blast_Score=199, Evalue=4e-51,
Organism=Drosophila melanogaster, GI78706872, Length=524, Percent_Identity=32.4427480916031, Blast_Score=185, Evalue=1e-46,
Organism=Drosophila melanogaster, GI24583543, Length=524, Percent_Identity=32.4427480916031, Blast_Score=184, Evalue=1e-46,
Organism=Drosophila melanogaster, GI78706876, Length=524, Percent_Identity=32.4427480916031, Blast_Score=184, Evalue=1e-46,
Organism=Drosophila melanogaster, GI24660907, Length=593, Percent_Identity=26.4755480607083, Blast_Score=155, Evalue=9e-38,
Organism=Drosophila melanogaster, GI24660903, Length=593, Percent_Identity=26.4755480607083, Blast_Score=155, Evalue=9e-38,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR010199
- InterPro:   IPR003097
- InterPro:   IPR017927
- InterPro:   IPR001094
- InterPro:   IPR008254
- InterPro:   IPR001709
- InterPro:   IPR023173
- InterPro:   IPR001433
- InterPro:   IPR017938 [H]

Pfam domain/function: PF00667 FAD_binding_1; PF00258 Flavodoxin_1; PF00175 NAD_binding_1 [H]

EC number: =1.8.1.2 [H]

Molecular weight: Translated: 66382; Mature: 66251

Theoretical pI: Translated: 4.63; Mature: 4.63

Prosite motif: PS50902 FLAVODOXIN_LIKE

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.3 %Cys     (Translated Protein)
1.2 %Met     (Translated Protein)
1.5 %Cys+Met (Translated Protein)
0.3 %Cys     (Mature Protein)
1.0 %Met     (Mature Protein)
1.3 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MTTQVPPSALLPLNPEQLARLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAE
CCCCCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCEEEECCCCHHH
MPGITIISASQTGNARRVAEALRDDLLTAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQG
CCCEEEEECCCCCCHHHHHHHHHHHHEEEEEEEEEEECCCCHHHHHCCCCEEEEEECCCC
EGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFFCQSGKDFDSKLAELGGERL
CCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCHHHHCCCCCCHHHHHHHHCHHHH
LDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGVVNEIHTSPYSKDAPLVAS
HHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEE
LSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDE
EECCCEECCCCCCCCCEEEEEEECCCCCEECCCCEEEEEECCCHHHHHHHHHHHHCCCCC
PVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPI
CEEECCCCCCCCCCEEEEEEEEEEHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHCCCCH
VDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQEEVENEVHVTVGVVRYDVEGRARAGG
HHHHHCCCHHCCHHHHHHHHHCCCHHHHHHHCCHHHHCCCEEEEEEEEEEECCCCCCCCC
ASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAP
HHHHHHHHHCCCCCEEEEEEECCCEECCCCCCCCEEEEECCCCHHHHHHHHHHHCCCCCC
GKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLTRIDLAWSRDQKEKVYVQDKLREQGAELW
CCCEEEECCCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHH
RWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY
HHHCCCCEEEEECCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCC
>Mature Secondary Structure 
TTQVPPSALLPLNPEQLARLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAE
CCCCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCCEEEECCCCHHH
MPGITIISASQTGNARRVAEALRDDLLTAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQG
CCCEEEEECCCCCCHHHHHHHHHHHHEEEEEEEEEEECCCCHHHHHCCCCEEEEEECCCC
EGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFFCQSGKDFDSKLAELGGERL
CCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCHHHHCCCCCCHHHHHHHHCHHHH
LDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGVVNEIHTSPYSKDAPLVAS
HHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEE
LSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDE
EECCCEECCCCCCCCCEEEEEEECCCCCEECCCCEEEEEECCCHHHHHHHHHHHHCCCCC
PVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPI
CEEECCCCCCCCCCEEEEEEEEEEHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHCCCCH
VDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQEEVENEVHVTVGVVRYDVEGRARAGG
HHHHHCCCHHCCHHHHHHHHHCCCHHHHHHHCCHHHHCCCEEEEEEEEEEECCCCCCCCC
ASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAP
HHHHHHHHHCCCCCEEEEEEECCCEECCCCCCCCEEEEECCCCHHHHHHHHHHHCCCCCC
GKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLTRIDLAWSRDQKEKVYVQDKLREQGAELW
CCCEEEECCCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHH
RWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY
HHHCCCCEEEEECCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA