Definition Prochlorococcus marinus str. MIT 9301, complete genome.
Accession NC_009091
Length 1,641,879

Click here to switch to the map view.

The map label for this gene is petH [H]

Identifier: 126696520

GI number: 126696520

Start: 994779

End: 995744

Strand: Reverse

Name: petH [H]

Synonym: P9301_11821

Alternate gene names: 126696520

Gene position: 995744-994779 (Counterclockwise)

Preceding gene: 126696522

Following gene: 126696519

Centisome position: 60.65

GC content: 36.44

Gene sequence:

>966_bases
GTGGAACCTAAAAAAGCCATCGTTTCAGAAACTGAAGCTCCAAAAACAGAGGCTCCAAAAGTAGTTAAGAAAAAACATGC
AGATGTACCAGTAAATATTTATCGTCCTAAGACACCCTACGAGGGAACCGTCATTGAAAACTATAGCCTTCTCAAAGAAG
GAGCAATTGGTAGAGTTAATCATATAACTTTCGACCTTAAAGATAGTGATCCATTTTTAAATTATGTAGAAGGCCAAAGT
ATTGGTATTATGCCTGCCGGTGAAGATGCTAATGGAAAACCTCACAAATTAAGACTTTATTCAATAGCTAGTACTAGACA
CGGAGATGACTTTAATGGAAATACAGTTTCTCTTTGTGTAAGACAGCTTCAATATGAAAAAGATGGTGAAACCATTAATG
GTGTCTGCTCTACTTACTTATGTGATATTAAGCCAGGAGATAAAGTAAAAATAACAGGTCCTGTAGGTAAAGAAATGCTT
CTCCCTGAGGAAGAGGATGCGAACATTGTTATGTTGGCCACAGGAACTGGAATAGCACCAATGAGGGCTTATTTAAGAAG
AATGTTCGAACCAACTGAAAAAGAAAAAAATAAATGGAATTTCAAAGGTAAAGCTTGGTTATTTATGGGTGCTCCAAAAT
CAGCTAATTTGTTATACGAGGAAGATCTTCAAAGATACCTTTCTGATTATCCTGATAATTTCAAATATACAAAAGCTATT
AGTCGCGAGCAGCAAAATACAAAAGGTGGAAGAATGTACATTCAGGACAGAGTTTTAGAGTCAGCCAATGAACTTTTCAA
CATGATTGAAGATGAAAAGACACACATATATCTTTGTGGATTAAAGGGTATGGAACCTGGAATAGATGAAGCAATGACTA
AGGCAGCAGAAGAAAAAGGCTTGAACTGGTCAGAACTAAGACCTCAACTAAAAAAAGCAGGGAGATGGCACGTAGAAACT
TACTAA

Upstream 100 bases:

>100_bases
AGTAGTAATAGCTGTTTTTTATTTTATACTCACAACTTTCAATAAAAGAGCTTTAAAATTTGTTGAAGAAGCCAAAACAA
AAAAACCTGAGGTAAAAGCT

Downstream 100 bases:

>100_bases
ACTTTGATATTTAGATTTAAGTTTAACCTTCTAAATACTTTTTGGTTTAGACACAATATGTAGCTAATATTTGAAAAAAA
GAGGATTTTAAATAAAGAAT

Product: ferredoxin-NADP oxidoreductase (FNR)

Products: NA

Alternate protein names: FNR [H]

Number of amino acids: Translated: 321; Mature: 321

Protein sequence:

>321_residues
MEPKKAIVSETEAPKTEAPKVVKKKHADVPVNIYRPKTPYEGTVIENYSLLKEGAIGRVNHITFDLKDSDPFLNYVEGQS
IGIMPAGEDANGKPHKLRLYSIASTRHGDDFNGNTVSLCVRQLQYEKDGETINGVCSTYLCDIKPGDKVKITGPVGKEML
LPEEEDANIVMLATGTGIAPMRAYLRRMFEPTEKEKNKWNFKGKAWLFMGAPKSANLLYEEDLQRYLSDYPDNFKYTKAI
SREQQNTKGGRMYIQDRVLESANELFNMIEDEKTHIYLCGLKGMEPGIDEAMTKAAEEKGLNWSELRPQLKKAGRWHVET
Y

Sequences:

>Translated_321_residues
MEPKKAIVSETEAPKTEAPKVVKKKHADVPVNIYRPKTPYEGTVIENYSLLKEGAIGRVNHITFDLKDSDPFLNYVEGQS
IGIMPAGEDANGKPHKLRLYSIASTRHGDDFNGNTVSLCVRQLQYEKDGETINGVCSTYLCDIKPGDKVKITGPVGKEML
LPEEEDANIVMLATGTGIAPMRAYLRRMFEPTEKEKNKWNFKGKAWLFMGAPKSANLLYEEDLQRYLSDYPDNFKYTKAI
SREQQNTKGGRMYIQDRVLESANELFNMIEDEKTHIYLCGLKGMEPGIDEAMTKAAEEKGLNWSELRPQLKKAGRWHVET
Y
>Mature_321_residues
MEPKKAIVSETEAPKTEAPKVVKKKHADVPVNIYRPKTPYEGTVIENYSLLKEGAIGRVNHITFDLKDSDPFLNYVEGQS
IGIMPAGEDANGKPHKLRLYSIASTRHGDDFNGNTVSLCVRQLQYEKDGETINGVCSTYLCDIKPGDKVKITGPVGKEML
LPEEEDANIVMLATGTGIAPMRAYLRRMFEPTEKEKNKWNFKGKAWLFMGAPKSANLLYEEDLQRYLSDYPDNFKYTKAI
SREQQNTKGGRMYIQDRVLESANELFNMIEDEKTHIYLCGLKGMEPGIDEAMTKAAEEKGLNWSELRPQLKKAGRWHVET
Y

Specific function: Essential for growth [H]

COG id: COG0369

COG function: function code P; Sulfite reductase, alpha subunit (flavoprotein)

Gene ontology:

Cell location: Cellular thylakoid membrane; Peripheral membrane protein; Cytoplasmic side. Note=May be bound to the thylakoid membrane or anchored to the thylakoid-bound phycobilisomes [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Contains 1 FAD-binding FR-type domain [H]

Homologues:

Organism=Escherichia coli, GI1789123, Length=228, Percent_Identity=29.3859649122807, Blast_Score=86, Evalue=2e-18,
Organism=Caenorhabditis elegans, GI17554134, Length=209, Percent_Identity=28.2296650717703, Blast_Score=73, Evalue=2e-13,
Organism=Saccharomyces cerevisiae, GI6321832, Length=168, Percent_Identity=29.1666666666667, Blast_Score=70, Evalue=5e-13,
Organism=Saccharomyces cerevisiae, GI6321143, Length=237, Percent_Identity=29.535864978903, Blast_Score=70, Evalue=6e-13,
Organism=Drosophila melanogaster, GI24582192, Length=267, Percent_Identity=27.3408239700375, Blast_Score=76, Evalue=3e-14,
Organism=Drosophila melanogaster, GI17137192, Length=267, Percent_Identity=26.9662921348315, Blast_Score=75, Evalue=6e-14,
Organism=Drosophila melanogaster, GI24583543, Length=212, Percent_Identity=28.7735849056604, Blast_Score=74, Evalue=1e-13,
Organism=Drosophila melanogaster, GI78706876, Length=212, Percent_Identity=28.7735849056604, Blast_Score=74, Evalue=1e-13,
Organism=Drosophila melanogaster, GI78706872, Length=212, Percent_Identity=28.7735849056604, Blast_Score=74, Evalue=1e-13,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR017927
- InterPro:   IPR001709
- InterPro:   IPR012146
- InterPro:   IPR015701
- InterPro:   IPR008333
- InterPro:   IPR001433
- InterPro:   IPR008213
- InterPro:   IPR017938 [H]

Pfam domain/function: PF01383 CpcD; PF00970 FAD_binding_6; PF00175 NAD_binding_1 [H]

EC number: =1.18.1.2 [H]

Molecular weight: Translated: 36350; Mature: 36350

Theoretical pI: Translated: 6.80; Mature: 6.80

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.2 %Cys     (Translated Protein)
3.4 %Met     (Translated Protein)
4.7 %Cys+Met (Translated Protein)
1.2 %Cys     (Mature Protein)
3.4 %Met     (Mature Protein)
4.7 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MEPKKAIVSETEAPKTEAPKVVKKKHADVPVNIYRPKTPYEGTVIENYSLLKEGAIGRVN
CCCHHHHHCCCCCCCCCCCHHHHHHCCCCCEEEECCCCCCCCEEECCHHHHHCCCCCEEE
HITFDLKDSDPFLNYVEGQSIGIMPAGEDANGKPHKLRLYSIASTRHGDDFNGNTVSLCV
EEEEEECCCCCCCEEECCCEEEEEECCCCCCCCCCEEEEEEEECCCCCCCCCCCHHHHHH
RQLQYEKDGETINGVCSTYLCDIKPGDKVKITGPVGKEMLLPEEEDANIVMLATGTGIAP
HHHHHCCCCCHHHHHHHHEEEECCCCCEEEEECCCCCCCCCCCCCCCCEEEEECCCCCHH
MRAYLRRMFEPTEKEKNKWNFKGKAWLFMGAPKSANLLYEEDLQRYLSDYPDNFKYTKAI
HHHHHHHHCCCCHHHCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHCCCCCCHHHHH
SREQQNTKGGRMYIQDRVLESANELFNMIEDEKTHIYLCGLKGMEPGIDEAMTKAAEEKG
HHHHCCCCCCEEEEHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHHC
LNWSELRPQLKKAGRWHVETY
CCHHHHHHHHHHCCCEEECCC
>Mature Secondary Structure
MEPKKAIVSETEAPKTEAPKVVKKKHADVPVNIYRPKTPYEGTVIENYSLLKEGAIGRVN
CCCHHHHHCCCCCCCCCCCHHHHHHCCCCCEEEECCCCCCCCEEECCHHHHHCCCCCEEE
HITFDLKDSDPFLNYVEGQSIGIMPAGEDANGKPHKLRLYSIASTRHGDDFNGNTVSLCV
EEEEEECCCCCCCEEECCCEEEEEECCCCCCCCCCEEEEEEEECCCCCCCCCCCHHHHHH
RQLQYEKDGETINGVCSTYLCDIKPGDKVKITGPVGKEMLLPEEEDANIVMLATGTGIAP
HHHHHCCCCCHHHHHHHHEEEECCCCCEEEEECCCCCCCCCCCCCCCCEEEEECCCCCHH
MRAYLRRMFEPTEKEKNKWNFKGKAWLFMGAPKSANLLYEEDLQRYLSDYPDNFKYTKAI
HHHHHHHHCCCCHHHCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHCCCCCCHHHHH
SREQQNTKGGRMYIQDRVLESANELFNMIEDEKTHIYLCGLKGMEPGIDEAMTKAAEEKG
HHHHCCCCCCEEEEHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHHC
LNWSELRPQLKKAGRWHVETY
CCHHHHHHHHHHCCCEEECCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 7.0

TargetDB status: NA

Availability: NA

References: 8905231 [H]