| Definition | Shigella flexneri 2a str. 2457T, complete genome. |
|---|---|
| Accession | NC_004741 |
| Length | 4,599,354 |
Click here to switch to the map view.
The map label for this gene is proC
Identifier: 30061882
GI number: 30061882
Start: 333212
End: 334021
Strand: Reverse
Name: proC
Synonym: S0330
Alternate gene names: 30061882
Gene position: 334021-333212 (Counterclockwise)
Preceding gene: 30061890
Following gene: 30061880
Centisome position: 7.26
GC content: 54.81
Gene sequence:
>810_bases ATGGAAAAGAAAATCGGTTTTATTGGCTGCGGCAATATGGGAAAAGCCATTCTCGGCGGTCTGATTGCCAGCGGTCAGGT GCTTCCAGGGCAAATCTGGGTATACACCCCCTCCCCGGATAAAGTCGCCGCCCTGCATGACCAGTTCGGCATCAACGCCG CAGAATCGGCGCAAGAAGTGGCGCAAATCGCCGACATCATTTTTGCTGCCGTTAAACCTGGCATCATGATTAAAGTGCTT AGCGAAATCACCTCCAGCCTGAATAAAGACTCTCTGGTCGTTTCTATTGCTGCAGGTGTCACGCTCGACCAGCTTGCCCG CGCGCTGGGCCATGACCGGAAAATTATCCGCGCCATGCCGAACACTCCCGCGCTGGTTAATGCCGGGATGACCTCCGTAA CGCCAAACGCGCTGGTAACCCCAGAAGATACCGCTGATGTGCTGAATATTTTCCGCTGCTTTGGCGAAGCGGAAGTAATT GCTGAGCCGATGATCCACCCGGTGGTCGGTGTGAGCGGTTCTTCGCCAGCCTACGTATTTATGTTTATCGAAGCGATGGC CGACGCCGCCGTGCTGGGCGGGATGCCACGCGCCCAGGCGTATAAATTTGCCGCTCAGGCGGTAATGGGTTCCGCAAAAA TGGTGCTGGAAACGGGAGAACATCCGGGGGCACTGAAAGATATGGTCTGCTCACCGGGAGGCACCACCATTGAAGCGGTA CGCGTACTGGAAGAGAAAGGCTTCCGTGCCGCAGTGATCGAAGCGATGACGAAGTGTATGGAAAAATCAGAAAAACTCAG CAAATCCTGA
Upstream 100 bases:
>100_bases TAATCCTCTATTGTGTCGCGCTTTTGCCTTCCGGCATAGTTCTGTTTATGCTTCTGCCAGCGATTATCAAAACAATGAAT TTCACGGCAGGAGTGAGGCA
Downstream 100 bases:
>100_bases TGACTTTCGCCGGACGTCAGGCCGCCACTTCGGTGCGGTTACGTCCGGCTTTCTTTGCTTTGTAAAGCGCCAAATCTGCC GATTTCAACCACTCACGATA
Product: pyrroline-5-carboxylate reductase
Products: NA
Alternate protein names: P5C reductase; P5CR
Number of amino acids: Translated: 269; Mature: 269
Protein sequence:
>269_residues MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVAALHDQFGINAAESAQEVAQIADIIFAAVKPGIMIKVL SEITSSLNKDSLVVSIAAGVTLDQLARALGHDRKIIRAMPNTPALVNAGMTSVTPNALVTPEDTADVLNIFRCFGEAEVI AEPMIHPVVGVSGSSPAYVFMFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGEHPGALKDMVCSPGGTTIEAV RVLEEKGFRAAVIEAMTKCMEKSEKLSKS
Sequences:
>Translated_269_residues MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVAALHDQFGINAAESAQEVAQIADIIFAAVKPGIMIKVL SEITSSLNKDSLVVSIAAGVTLDQLARALGHDRKIIRAMPNTPALVNAGMTSVTPNALVTPEDTADVLNIFRCFGEAEVI AEPMIHPVVGVSGSSPAYVFMFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGEHPGALKDMVCSPGGTTIEAV RVLEEKGFRAAVIEAMTKCMEKSEKLSKS >Mature_269_residues MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVAALHDQFGINAAESAQEVAQIADIIFAAVKPGIMIKVL SEITSSLNKDSLVVSIAAGVTLDQLARALGHDRKIIRAMPNTPALVNAGMTSVTPNALVTPEDTADVLNIFRCFGEAEVI AEPMIHPVVGVSGSSPAYVFMFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGEHPGALKDMVCSPGGTTIEAV RVLEEKGFRAAVIEAMTKCMEKSEKLSKS
Specific function: Proline biosynthesis; third (last) step. [C]
COG id: COG0345
COG function: function code E; Pyrroline-5-carboxylate reductase
Gene ontology:
Cell location: Cytoplasm
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the pyrroline-5-carboxylate reductase family
Homologues:
Organism=Homo sapiens, GI24797097, Length=266, Percent_Identity=37.9699248120301, Blast_Score=169, Evalue=2e-42, Organism=Homo sapiens, GI24797095, Length=266, Percent_Identity=37.9699248120301, Blast_Score=169, Evalue=3e-42, Organism=Homo sapiens, GI21361454, Length=266, Percent_Identity=37.9699248120301, Blast_Score=169, Evalue=3e-42, Organism=Homo sapiens, GI198041662, Length=266, Percent_Identity=34.5864661654135, Blast_Score=168, Evalue=6e-42, Organism=Escherichia coli, GI1786585, Length=269, Percent_Identity=100, Blast_Score=545, Evalue=1e-157, Organism=Caenorhabditis elegans, GI17569021, Length=264, Percent_Identity=40.1515151515151, Blast_Score=185, Evalue=2e-47, Organism=Caenorhabditis elegans, GI17540664, Length=285, Percent_Identity=37.1929824561403, Blast_Score=152, Evalue=1e-37, Organism=Saccharomyces cerevisiae, GI6320861, Length=272, Percent_Identity=29.0441176470588, Blast_Score=111, Evalue=1e-25, Organism=Drosophila melanogaster, GI24648116, Length=277, Percent_Identity=38.2671480144404, Blast_Score=180, Evalue=1e-45, Organism=Drosophila melanogaster, GI21358587, Length=265, Percent_Identity=39.2452830188679, Blast_Score=179, Evalue=1e-45, Organism=Drosophila melanogaster, GI24647700, Length=169, Percent_Identity=41.4201183431953, Blast_Score=131, Evalue=5e-31,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): P5CR_ECOLI (P0A9L8)
Other databases:
- EMBL: J01665 - EMBL: U73857 - EMBL: U00096 - EMBL: AP009048 - PIR: A00385 - RefSeq: AP_001037.1 - RefSeq: NP_414920.1 - ProteinModelPortal: P0A9L8 - SMR: P0A9L8 - DIP: DIP-47863N - STRING: P0A9L8 - EnsemblBacteria: EBESCT00000003053 - EnsemblBacteria: EBESCT00000014492 - GeneID: 945034 - GenomeReviews: AP009048_GR - GenomeReviews: U00096_GR - KEGG: ecj:JW0377 - KEGG: eco:b0386 - EchoBASE: EB0762 - EcoGene: EG10769 - eggNOG: COG0345 - GeneTree: EBGT00050000011553 - HOGENOM: HBG726602 - OMA: RTFNEHQ - ProtClustDB: PRK11880 - BioCyc: EcoCyc:PYRROLINECARBREDUCT-MONOMER - BioCyc: MetaCyc:PYRROLINECARBREDUCT-MONOMER - Genevestigator: P0A9L8 - GO: GO:0005737 - InterPro: IPR008927 - InterPro: IPR016040 - InterPro: IPR004455 - InterPro: IPR000304 - Gene3D: G3DSA:3.40.50.720 - PANTHER: PTHR11645 - PIRSF: PIRSF000193 - TIGRFAMs: TIGR00112
Pfam domain/function: PF03807 F420_oxidored; SSF48179 6DGDH_C_like
EC number: =1.5.1.2
Molecular weight: Translated: 28145; Mature: 28145
Theoretical pI: Translated: 5.71; Mature: 5.71
Prosite motif: PS00521 P5CR
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.5 %Cys (Translated Protein) 5.2 %Met (Translated Protein) 6.7 %Cys+Met (Translated Protein) 1.5 %Cys (Mature Protein) 5.2 %Met (Mature Protein) 6.7 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVAALHDQFGINAAESAQEV CCCCEEEEECCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHCCCHHHHHHHH AQIADIIFAAVKPGIMIKVLSEITSSLNKDSLVVSIAAGVTLDQLARALGHDRKIIRAMP HHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHCCCCHHHHHCC NTPALVNAGMTSVTPNALVTPEDTADVLNIFRCFGEAEVIAEPMIHPVVGVSGSSPAYVF CCCCEEECCCCCCCCCCEECCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHH MFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGEHPGALKDMVCSPGGTTIEAV HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHEEECCCCCCHHHHHHCCCCCCHHHHH RVLEEKGFRAAVIEAMTKCMEKSEKLSKS HHHHHCCCHHHHHHHHHHHHHHHHHHCCC >Mature Secondary Structure MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVAALHDQFGINAAESAQEV CCCCEEEEECCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHCCCHHHHHHHH AQIADIIFAAVKPGIMIKVLSEITSSLNKDSLVVSIAAGVTLDQLARALGHDRKIIRAMP HHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHCCCCHHHHHCC NTPALVNAGMTSVTPNALVTPEDTADVLNIFRCFGEAEVIAEPMIHPVVGVSGSSPAYVF CCCCEEECCCCCCCCCCEECCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHH MFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGEHPGALKDMVCSPGGTTIEAV HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHEEECCCCCCHHHHHHCCCCCCHHHHH RVLEEKGFRAAVIEAMTKCMEKSEKLSKS HHHHHCCCHHHHHHHHHHHHHHHHHHCCC
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 6296787; 9278503