Definition Bacillus cereus AH820, complete genome.
Accession NC_011773
Length 5,302,683

Click here to switch to the map view.

The map label for this gene is proC3 [H]

Identifier: 218905276

GI number: 218905276

Start: 3971508

End: 3972347

Strand: Direct

Name: proC3 [H]

Synonym: BCAH820_4160

Alternate gene names: 218905276

Gene position: 3971508-3972347 (Clockwise)

Preceding gene: 218905273

Following gene: 218905281

Centisome position: 74.9

GC content: 37.26

Gene sequence:

>840_bases
ATGTCTATTCAAAACATTTCCTTTCTCGGTGCAGGCTCTATTGCTGAAGCTATTATTGGTGGCTTGTTACATGCAAATGT
TGTGAAAGGCGAACAAATTACCGTAAGTAATCGTTCTAACGAGACAAGGTTACAGGAGCTACATCAAAAATATGGAGTCA
AAGGTACGCATAATAAAAAAGAACTACTTACTGATACAAATATTCTTTTTCTAGCTATGAAACCTAAGGATATTGCAGAA
GCGCTTATCCCTTTTAAAGAATATATACATCATAACGTACTTATTATTTCGTTATTAGCGGGTGTTTCTACTCACTCGAT
TAAAAACTTACTTCAAAAAGACGTTCCGATTATTCGAGCAATGCCAAATACATCTGCAGCTATTTTAAAATCAGCTACTG
CTATCTCACCTTCAAAGCATGCAACAGCGGAACATATTCAGACTGCCATAGCTTTATTTAAAACGATCGGCCTCGTCTCT
GTTGTAGAGGAAGAAGATATGCATGCTGTCACTGCATTATCTGGAAGTGGGCCTGCTTATATTTATTACGTAGTAGAAGC
GATGGAAGCAGCCGCAAAAAAAATCGGTTTAAAAGAAGATGTTGCAAAGTCACTTATTCTTCAGACGATGATTGGTGCTG
CTGAAATGCTAAAAGCAAGTGAAAAACACCCTTCTATTTTGCGAAAGGAAATTACTTCTCCTGGTGGAACGACCGAAGCG
GGCATTGAAGTATTACAAGAACATAAATTTCAACAAGCACTTATTTCTTGTATTACACAAGCAGCGCAACGATCGCATAA
CCTCGGGAAAACATTAGAACAACTAACAAAAGAAAAATAA

Upstream 100 bases:

>100_bases
ATATTTGTCATTATTTTCAGATAAAAAGAGGTGTTTCCTATTAAAAAACGAATGTATGTACATTTATAAGAATTTCTACC
AAATTTAAAGGAGGAACATC

Downstream 100 bases:

>100_bases
GAAATGGAGCTCAATCTTACTTGAGCTCCATTTCTTATAAACATATGTTTTTAGTTATTCGTCTTCCCAAAAATCTCTTC
TTGCAGACGACGACCAGTCG

Product: pyrroline-5-carboxylate reductase

Products: NA

Alternate protein names: P5C reductase 2; P5CR 2 [H]

Number of amino acids: Translated: 279; Mature: 278

Protein sequence:

>279_residues
MSIQNISFLGAGSIAEAIIGGLLHANVVKGEQITVSNRSNETRLQELHQKYGVKGTHNKKELLTDTNILFLAMKPKDIAE
ALIPFKEYIHHNVLIISLLAGVSTHSIKNLLQKDVPIIRAMPNTSAAILKSATAISPSKHATAEHIQTAIALFKTIGLVS
VVEEEDMHAVTALSGSGPAYIYYVVEAMEAAAKKIGLKEDVAKSLILQTMIGAAEMLKASEKHPSILRKEITSPGGTTEA
GIEVLQEHKFQQALISCITQAAQRSHNLGKTLEQLTKEK

Sequences:

>Translated_279_residues
MSIQNISFLGAGSIAEAIIGGLLHANVVKGEQITVSNRSNETRLQELHQKYGVKGTHNKKELLTDTNILFLAMKPKDIAE
ALIPFKEYIHHNVLIISLLAGVSTHSIKNLLQKDVPIIRAMPNTSAAILKSATAISPSKHATAEHIQTAIALFKTIGLVS
VVEEEDMHAVTALSGSGPAYIYYVVEAMEAAAKKIGLKEDVAKSLILQTMIGAAEMLKASEKHPSILRKEITSPGGTTEA
GIEVLQEHKFQQALISCITQAAQRSHNLGKTLEQLTKEK
>Mature_278_residues
SIQNISFLGAGSIAEAIIGGLLHANVVKGEQITVSNRSNETRLQELHQKYGVKGTHNKKELLTDTNILFLAMKPKDIAEA
LIPFKEYIHHNVLIISLLAGVSTHSIKNLLQKDVPIIRAMPNTSAAILKSATAISPSKHATAEHIQTAIALFKTIGLVSV
VEEEDMHAVTALSGSGPAYIYYVVEAMEAAAKKIGLKEDVAKSLILQTMIGAAEMLKASEKHPSILRKEITSPGGTTEAG
IEVLQEHKFQQALISCITQAAQRSHNLGKTLEQLTKEK

Specific function: Proline biosynthesis; third (last) step. [C]

COG id: COG0345

COG function: function code E; Pyrroline-5-carboxylate reductase

Gene ontology:

Cell location: Cytoplasm [H]

Metaboloic importance: Non_Essential [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the pyrroline-5-carboxylate reductase family [H]

Homologues:

Organism=Homo sapiens, GI24797097, Length=273, Percent_Identity=33.3333333333333, Blast_Score=144, Evalue=1e-34,
Organism=Homo sapiens, GI24797095, Length=273, Percent_Identity=33.3333333333333, Blast_Score=144, Evalue=1e-34,
Organism=Homo sapiens, GI21361454, Length=273, Percent_Identity=31.8681318681319, Blast_Score=139, Evalue=2e-33,
Organism=Homo sapiens, GI198041662, Length=271, Percent_Identity=28.4132841328413, Blast_Score=124, Evalue=1e-28,
Organism=Escherichia coli, GI1786585, Length=268, Percent_Identity=32.089552238806, Blast_Score=159, Evalue=2e-40,
Organism=Caenorhabditis elegans, GI17569021, Length=264, Percent_Identity=28.030303030303, Blast_Score=137, Evalue=9e-33,
Organism=Caenorhabditis elegans, GI17540664, Length=282, Percent_Identity=25.886524822695, Blast_Score=94, Evalue=7e-20,
Organism=Saccharomyces cerevisiae, GI6320861, Length=206, Percent_Identity=30.0970873786408, Blast_Score=95, Evalue=1e-20,
Organism=Drosophila melanogaster, GI24648116, Length=279, Percent_Identity=29.7491039426523, Blast_Score=135, Evalue=2e-32,
Organism=Drosophila melanogaster, GI21358587, Length=272, Percent_Identity=29.7794117647059, Blast_Score=134, Evalue=7e-32,
Organism=Drosophila melanogaster, GI24647700, Length=172, Percent_Identity=33.1395348837209, Blast_Score=105, Evalue=2e-23,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): NA

Other databases:

- InterPro:   IPR008927
- InterPro:   IPR016040
- InterPro:   IPR004455
- InterPro:   IPR000304 [H]

Pfam domain/function: PF03807 F420_oxidored [H]

EC number: =1.5.1.2 [H]

Molecular weight: Translated: 30138; Mature: 30007

Theoretical pI: Translated: 8.93; Mature: 8.93

Prosite motif: PS00521 P5CR

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

0.4 %Cys     (Translated Protein)
2.5 %Met     (Translated Protein)
2.9 %Cys+Met (Translated Protein)
0.4 %Cys     (Mature Protein)
2.2 %Met     (Mature Protein)
2.5 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MSIQNISFLGAGSIAEAIIGGLLHANVVKGEQITVSNRSNETRLQELHQKYGVKGTHNKK
CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHCCCCCCCHH
ELLTDTNILFLAMKPKDIAEALIPFKEYIHHNVLIISLLAGVSTHSIKNLLQKDVPIIRA
HHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCEEEE
MPNTSAAILKSATAISPSKHATAEHIQTAIALFKTIGLVSVVEEEDMHAVTALSGSGPAY
CCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCEE
IYYVVEAMEAAAKKIGLKEDVAKSLILQTMIGAAEMLKASEKHPSILRKEITSPGGTTEA
EHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCHHH
GIEVLQEHKFQQALISCITQAAQRSHNLGKTLEQLTKEK
HHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCC
>Mature Secondary Structure 
SIQNISFLGAGSIAEAIIGGLLHANVVKGEQITVSNRSNETRLQELHQKYGVKGTHNKK
CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHCCCCCCCHH
ELLTDTNILFLAMKPKDIAEALIPFKEYIHHNVLIISLLAGVSTHSIKNLLQKDVPIIRA
HHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCEEEE
MPNTSAAILKSATAISPSKHATAEHIQTAIALFKTIGLVSVVEEEDMHAVTALSGSGPAY
CCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCEE
IYYVVEAMEAAAKKIGLKEDVAKSLILQTMIGAAEMLKASEKHPSILRKEITSPGGTTEA
EHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCHHH
GIEVLQEHKFQQALISCITQAAQRSHNLGKTLEQLTKEK
HHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCC

PDB accession: NA

Resolution: NA

Structure class: Unstructured

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 10.0

TargetDB status: NA

Availability: NA

References: 8969508; 9384377; 11418582 [H]