Definition Shewanella sp. ANA-3 chromosome chromosome 1, complete sequence.
Accession NC_008577
Length 4,972,204

Click here to switch to the map view.

The map label for this gene is 117918723

Identifier: 117918723

GI number: 117918723

Start: 299571

End: 300554

Strand: Direct

Name: 117918723

Synonym: Shewana3_0266

Alternate gene names: NA

Gene position: 299571-300554 (Clockwise)

Preceding gene: 117918721

Following gene: 117918724

Centisome position: 6.02

GC content: 41.16

Gene sequence:

>984_bases
ATGCAGCAGCTATTCATTCAATTATTAGAAGAACGTTATTTCAACCGACTTCCTGAAAAGGTCGTTATGAAATTTGCAAC
AGAATCAGCTGATTTTGTTCGTTTTTGGCTAGTTCATTGTGTGCGGAGTTCTGCTTCAGAAAGAGAGTTGGCATTGGCTT
TGTTTTTTAGTGCAGAGTGGCAGTTAGAAGAGCTTTCTCGCTTAAAAGCGTTCTGGAATGATTATGTTGTGATTGAAAGT
CTTCAACAACATGAACGTTTGAAAGCGCGTTTTTGTGAGTTGAACCCTTGGTTAACAACCATCTCCCCTTATCATGCCGA
TGCTATTTGTGTCTTCTCTGCTTTATTGGCTTTTGAGCAAAGTCCTATATCAGTTGGACAGCAGCTACAATCGGCTAAGT
TGTTTTACTCCCAATTACCTTCGAACCTATTAGAAGAGGTTAAGGAATCCTCATTACCTTATTTACAACCCATTACAGTT
TATGGACATCAAGGTAAGGGGAGTATTGAAGCTGGGATCCGTAACAATATGCATTATCCGACACCCATACCTAATGGAAG
CGTTGCACTGGCTTGTGCAGAAAGAGTGACAGCAAAATTTTCAGGGATAAAGATTGCCTATGCAGAGCCAATGGTGGTGT
TACGGTACGAGCCGGGTCAGTTTTATCAATGGCATTATGATGCAATACATGCGCACACTTCCGAGATTAAAGCTGAATTG
GAACGTTTTGGGCAAAGGTGCCGAACAGGTATTTTATACCTTAATGATGATTTTCAAGGGGGAGAAACCGAGTTCAAAGC
GCCTTATATTCAAGTAAAGCCTCAGGCTGCTGCTATTTTAGTTTTTGATAATACGGATAAGTCAGGCAAACCAATACCTT
CCTCTTTGCATCGCGGTTGCGAAGTAACATCTGGTCATAAATGGGTTTGCACGCAATGGTTTAGGGATAAACCCTTTTGG
TTAAGATCGGGATTGCTAACCTAA

Upstream 100 bases:

>100_bases
GCCATTAAAGAATGGATTGTAAATAAAATGTTTCATTGAAAGTGTGGTTCAGTTTTAATGGATATAGTATTTGTAGTGTC
GGAAAGAGGACGTTCGTTTT

Downstream 100 bases:

>100_bases
CTTTAGGTATTTACTGAAATTGATTGCCGTTTTAAAGATGACTGCAGAAGGTTAAAATTATTTTATCCCTGACCAGTATT
TTATCTGCCAAGGTTAAGGC

Product: 2OG-Fe(II) oxygenase

Products: NA

Alternate protein names: Prolyl 4-Hydroxylase Alpha Subunit; Procollagen-Proline Dioxygenase; Oxidoreductase 2OG-Fe(II) Oxygenase Family; Procollagen-Proline 2-Oxoglutarate-4-Dioxygenase; Response Regulator Receiver Domain-Containing Protein; Oxygenase; 2OG-Fe(II) Oxygenase Family Oxidoreductase; 2OG-Fe(II) Oxygenase Family Protein; Procollagen-Proline 2-Oxoglutarate-4- Dioxygenase; Prolyl 4-Hydroxylase

Number of amino acids: Translated: 327; Mature: 327

Protein sequence:

>327_residues
MQQLFIQLLEERYFNRLPEKVVMKFATESADFVRFWLVHCVRSSASERELALALFFSAEWQLEELSRLKAFWNDYVVIES
LQQHERLKARFCELNPWLTTISPYHADAICVFSALLAFEQSPISVGQQLQSAKLFYSQLPSNLLEEVKESSLPYLQPITV
YGHQGKGSIEAGIRNNMHYPTPIPNGSVALACAERVTAKFSGIKIAYAEPMVVLRYEPGQFYQWHYDAIHAHTSEIKAEL
ERFGQRCRTGILYLNDDFQGGETEFKAPYIQVKPQAAAILVFDNTDKSGKPIPSSLHRGCEVTSGHKWVCTQWFRDKPFW
LRSGLLT

Sequences:

>Translated_327_residues
MQQLFIQLLEERYFNRLPEKVVMKFATESADFVRFWLVHCVRSSASERELALALFFSAEWQLEELSRLKAFWNDYVVIES
LQQHERLKARFCELNPWLTTISPYHADAICVFSALLAFEQSPISVGQQLQSAKLFYSQLPSNLLEEVKESSLPYLQPITV
YGHQGKGSIEAGIRNNMHYPTPIPNGSVALACAERVTAKFSGIKIAYAEPMVVLRYEPGQFYQWHYDAIHAHTSEIKAEL
ERFGQRCRTGILYLNDDFQGGETEFKAPYIQVKPQAAAILVFDNTDKSGKPIPSSLHRGCEVTSGHKWVCTQWFRDKPFW
LRSGLLT
>Mature_327_residues
MQQLFIQLLEERYFNRLPEKVVMKFATESADFVRFWLVHCVRSSASERELALALFFSAEWQLEELSRLKAFWNDYVVIES
LQQHERLKARFCELNPWLTTISPYHADAICVFSALLAFEQSPISVGQQLQSAKLFYSQLPSNLLEEVKESSLPYLQPITV
YGHQGKGSIEAGIRNNMHYPTPIPNGSVALACAERVTAKFSGIKIAYAEPMVVLRYEPGQFYQWHYDAIHAHTSEIKAEL
ERFGQRCRTGILYLNDDFQGGETEFKAPYIQVKPQAAAILVFDNTDKSGKPIPSSLHRGCEVTSGHKWVCTQWFRDKPFW
LRSGLLT

Specific function: Unknown

COG id: NA

COG function: NA

Gene ontology:

Cell location: Cytoplasmic

Metaboloic importance: NA

Operon status: Not Known

Operon components: None

Similarity: NA

Homologues:

Organism=Drosophila melanogaster, GI24651418, Length=173, Percent_Identity=28.3236994219653, Blast_Score=74, Evalue=1e-13,
Organism=Drosophila melanogaster, GI24651477, Length=187, Percent_Identity=27.807486631016, Blast_Score=74, Evalue=1e-13,
Organism=Drosophila melanogaster, GI116008128, Length=133, Percent_Identity=31.5789473684211, Blast_Score=74, Evalue=1e-13,
Organism=Drosophila melanogaster, GI116008432, Length=133, Percent_Identity=31.5789473684211, Blast_Score=74, Evalue=1e-13,
Organism=Drosophila melanogaster, GI116008130, Length=117, Percent_Identity=31.6239316239316, Blast_Score=71, Evalue=1e-12,
Organism=Drosophila melanogaster, GI116008537, Length=117, Percent_Identity=31.6239316239316, Blast_Score=71, Evalue=1e-12,
Organism=Drosophila melanogaster, GI221460681, Length=127, Percent_Identity=33.8582677165354, Blast_Score=69, Evalue=4e-12,
Organism=Drosophila melanogaster, GI21358233, Length=184, Percent_Identity=25, Blast_Score=69, Evalue=5e-12,
Organism=Drosophila melanogaster, GI281361323, Length=130, Percent_Identity=32.3076923076923, Blast_Score=68, Evalue=9e-12,
Organism=Drosophila melanogaster, GI24651430, Length=116, Percent_Identity=31.8965517241379, Blast_Score=68, Evalue=1e-11,
Organism=Drosophila melanogaster, GI24651424, Length=114, Percent_Identity=30.7017543859649, Blast_Score=65, Evalue=7e-11,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

NA

Pfam domain/function: NA

EC number: NA

Molecular weight: Translated: 37508; Mature: 37508

Theoretical pI: Translated: 7.04; Mature: 7.04

Prosite motif: NA

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

2.1 %Cys     (Translated Protein)
1.2 %Met     (Translated Protein)
3.4 %Cys+Met (Translated Protein)
2.1 %Cys     (Mature Protein)
1.2 %Met     (Mature Protein)
3.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MQQLFIQLLEERYFNRLPEKVVMKFATESADFVRFWLVHCVRSSASERELALALFFSAEW
CHHHHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHEEEEEEECCCC
QLEELSRLKAFWNDYVVIESLQQHERLKARFCELNPWLTTISPYHADAICVFSALLAFEQ
CHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHHCC
SPISVGQQLQSAKLFYSQLPSNLLEEVKESSLPYLQPITVYGHQGKGSIEAGIRNNMHYP
CHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCEEEEEEEECCCCCCEEHHHCCCCCCC
TPIPNGSVALACAERVTAKFSGIKIAYAEPMVVLRYEPGQFYQWHYDAIHAHTSEIKAEL
CCCCCCCEEHHHHHHHHHHHCCEEEEEECCEEEEEECCCCEEEEEHHHHHHHHHHHHHHH
ERFGQRCRTGILYLNDDFQGGETEFKAPYIQVKPQAAAILVFDNTDKSGKPIPSSLHRGC
HHHHHHHCCCEEEECCCCCCCCCCCCCCEEEECCCEEEEEEEECCCCCCCCCCHHHHCCC
EVTSGHKWVCTQWFRDKPFWLRSGLLT
CCCCCCCEEEEHHHCCCCHHHHCCCCC
>Mature Secondary Structure
MQQLFIQLLEERYFNRLPEKVVMKFATESADFVRFWLVHCVRSSASERELALALFFSAEW
CHHHHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHEEEEEEECCCC
QLEELSRLKAFWNDYVVIESLQQHERLKARFCELNPWLTTISPYHADAICVFSALLAFEQ
CHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHHCC
SPISVGQQLQSAKLFYSQLPSNLLEEVKESSLPYLQPITVYGHQGKGSIEAGIRNNMHYP
CHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCEEEEEEEECCCCCCEEHHHCCCCCCC
TPIPNGSVALACAERVTAKFSGIKIAYAEPMVVLRYEPGQFYQWHYDAIHAHTSEIKAEL
CCCCCCCEEHHHHHHHHHHHCCEEEEEECCEEEEEECCCCEEEEEHHHHHHHHHHHHHHH
ERFGQRCRTGILYLNDDFQGGETEFKAPYIQVKPQAAAILVFDNTDKSGKPIPSSLHRGC
HHHHHHHCCCEEEECCCCCCCCCCCCCCEEEECCCEEEEEEEECCCCCCCCCCHHHHCCC
EVTSGHKWVCTQWFRDKPFWLRSGLLT
CCCCCCCEEEEHHHCCCCHHHHCCCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: NA