Definition Geobacillus thermodenitrificans NG80-2 chromosome, complete genome.
Accession NC_009328
Length 3,550,319

Click here to switch to the map view.

The map label for this gene is pyrC

Identifier: 138894677

GI number: 138894677

Start: 1072763

End: 1074049

Strand: Direct

Name: pyrC

Synonym: GTNG_1007

Alternate gene names: 138894677

Gene position: 1072763-1074049 (Clockwise)

Preceding gene: 138894676

Following gene: 138894678

Centisome position: 30.22

GC content: 54.55

Gene sequence:

>1287_bases
ATGGCCATGTGGTTGAAAAATGGCATGTCGTTCAATGAAAACGGGCAATTGGTGCGAACTCATATCAAAATAGAGCACGG
AAACATTGCAGCGATCCATCATGAACAGCTGTTTGAAGCAAATGGAGAAGACGTGATCGACGTTGGGGGCAAACTCATCG
CTCCCGGCTTGATTGATGTGCATGTCCATTTGCGCGAGCCGGGTGGAGAAGCGAAAGAAACGATTGAAACCGGCACGCTC
GCCGCTGCGAAAGGTGGCTTTACGACAGTGGCGGCGATGCCAAACACAAACCCGGTGCCGGACCGGAAAGAGCAGATGGA
ATGGCTTGCGCGACGCATCCAAGAAACGGCGCATGTCCGTGTGTTGCCGTATGCTTCGATCACGCTCGGGCAAAAAGGGG
AGGAGCTGACCGATTTTGCCGCGTTAAAAGAAGCGGGAGCGTTCGCCTTTACTGATGACGGCGTCGGAGTGCAATCAGCG
GGGATGATGTTTGAAGCGATGAAACGGGCCGCCGCCCTCGATATGGCGATCGTTGCCCACTGCGAGGACGATACGTTAAA
AAACGGCGGTGCGGTGCATGACGGCGATTTTGCGCGCCGATACGGGATCGCTGGCATCCCGTCGGTCTGTGAAGCGGTGC
ACATCGCCCGCGATGTTTTGCTCGCCGAAGCGACCGGGTGCCACTATCATGTCTGCCATATTAGCACGAAAGAATCGGTG
CGCGTCGTCCGTGACGCAAAGCGGGCTGGTATCTGCGTCACCGCGGAAGTGACGCCGCATCATCTCCTCCTATGCGATGA
GGACATTCCGAGGCTTGATGCGAACTATAAAATGAATCCGCCGCTGCGCAGCCGCGCTGACCGTGAAGCGTTAATTGAAG
GGCTGCTCGACGGCACGATCGATTTCATTGCCACCGACCACGCGCCGCATACAGCAGCGGAAAAAGCGAAAGGAATGGAG
GCGGCGCCGTTTGGCATCGTCGGATTGGAAACGGCATTCCCGCTTTTGTACACCCATTTTGTCAAAAAGAACGTGTTTAC
GTTAAAACAGCTTGTCGATTGGTTGACGATCAAACCAGCGCAATGTTTCGGCTTGCAAACAGGGCGACTCGAGGTCGGAG
CACCGGCGGATATCACGGTCATTGATTTAGAAACAGAAGAACCGATTGATCCAGAGACATTTGCCTCCAAAGGAAACAAT
ACGCCGTTTGCCGGATGGAGATGTCAAGGATGGCCGGTGATGACGTTTGTTGGTGGAACACTCGTATGGGAGAAAGGAAG
GGCATAA

Upstream 100 bases:

>100_bases
CTTGTCGAAGCGAAACCGTCACGCATTTTTAAACAAATGGAAAATGGCGTCTATGTGCGGATGGCGGTCTTAAAACGGGC
AATAGAAGGGAGAATGCAGC

Downstream 100 bases:

>100_bases
CATGAAGCGGCAGCTTATTTTAGAAGATGGTTCGTTTTTTGTTGGGGAAGCATTTGGTAGTTTGAAAGAGACGACGGGTG
AAGTCGTCTTTAACACCGGG

Product: dihydroorotase

Products: NA

Alternate protein names: DHOase

Number of amino acids: Translated: 428; Mature: 427

Protein sequence:

>428_residues
MAMWLKNGMSFNENGQLVRTHIKIEHGNIAAIHHEQLFEANGEDVIDVGGKLIAPGLIDVHVHLREPGGEAKETIETGTL
AAAKGGFTTVAAMPNTNPVPDRKEQMEWLARRIQETAHVRVLPYASITLGQKGEELTDFAALKEAGAFAFTDDGVGVQSA
GMMFEAMKRAAALDMAIVAHCEDDTLKNGGAVHDGDFARRYGIAGIPSVCEAVHIARDVLLAEATGCHYHVCHISTKESV
RVVRDAKRAGICVTAEVTPHHLLLCDEDIPRLDANYKMNPPLRSRADREALIEGLLDGTIDFIATDHAPHTAAEKAKGME
AAPFGIVGLETAFPLLYTHFVKKNVFTLKQLVDWLTIKPAQCFGLQTGRLEVGAPADITVIDLETEEPIDPETFASKGNN
TPFAGWRCQGWPVMTFVGGTLVWEKGRA

Sequences:

>Translated_428_residues
MAMWLKNGMSFNENGQLVRTHIKIEHGNIAAIHHEQLFEANGEDVIDVGGKLIAPGLIDVHVHLREPGGEAKETIETGTL
AAAKGGFTTVAAMPNTNPVPDRKEQMEWLARRIQETAHVRVLPYASITLGQKGEELTDFAALKEAGAFAFTDDGVGVQSA
GMMFEAMKRAAALDMAIVAHCEDDTLKNGGAVHDGDFARRYGIAGIPSVCEAVHIARDVLLAEATGCHYHVCHISTKESV
RVVRDAKRAGICVTAEVTPHHLLLCDEDIPRLDANYKMNPPLRSRADREALIEGLLDGTIDFIATDHAPHTAAEKAKGME
AAPFGIVGLETAFPLLYTHFVKKNVFTLKQLVDWLTIKPAQCFGLQTGRLEVGAPADITVIDLETEEPIDPETFASKGNN
TPFAGWRCQGWPVMTFVGGTLVWEKGRA
>Mature_427_residues
AMWLKNGMSFNENGQLVRTHIKIEHGNIAAIHHEQLFEANGEDVIDVGGKLIAPGLIDVHVHLREPGGEAKETIETGTLA
AAKGGFTTVAAMPNTNPVPDRKEQMEWLARRIQETAHVRVLPYASITLGQKGEELTDFAALKEAGAFAFTDDGVGVQSAG
MMFEAMKRAAALDMAIVAHCEDDTLKNGGAVHDGDFARRYGIAGIPSVCEAVHIARDVLLAEATGCHYHVCHISTKESVR
VVRDAKRAGICVTAEVTPHHLLLCDEDIPRLDANYKMNPPLRSRADREALIEGLLDGTIDFIATDHAPHTAAEKAKGMEA
APFGIVGLETAFPLLYTHFVKKNVFTLKQLVDWLTIKPAQCFGLQTGRLEVGAPADITVIDLETEEPIDPETFASKGNNT
PFAGWRCQGWPVMTFVGGTLVWEKGRA

Specific function: Involved In The Anaerobic Utilization Of Allantoin. [C]

COG id: COG0044

COG function: function code F; Dihydroorotase and related cyclic amidohydrolases

Gene ontology:

Cell location: Cytoplasm [C]

Metaboloic importance: Unknown [C]

Operon status: Not Known

Operon components: None

Similarity: Belongs to the DHOase family. Type 2 subfamily

Homologues:

Organism=Homo sapiens, GI18105007, Length=381, Percent_Identity=33.5958005249344, Blast_Score=144, Evalue=2e-34,
Organism=Homo sapiens, GI4503375, Length=461, Percent_Identity=24.7288503253796, Blast_Score=97, Evalue=3e-20,
Organism=Homo sapiens, GI19923821, Length=429, Percent_Identity=25.4079254079254, Blast_Score=96, Evalue=9e-20,
Organism=Homo sapiens, GI4503051, Length=455, Percent_Identity=23.956043956044, Blast_Score=86, Evalue=8e-17,
Organism=Homo sapiens, GI62422571, Length=455, Percent_Identity=23.956043956044, Blast_Score=85, Evalue=1e-16,
Organism=Homo sapiens, GI4503377, Length=457, Percent_Identity=24.2888402625821, Blast_Score=79, Evalue=1e-14,
Organism=Escherichia coli, GI1786722, Length=442, Percent_Identity=27.1493212669683, Blast_Score=156, Evalue=2e-39,
Organism=Escherichia coli, GI87082175, Length=460, Percent_Identity=28.2608695652174, Blast_Score=124, Evalue=1e-29,
Organism=Caenorhabditis elegans, GI193204318, Length=365, Percent_Identity=32.8767123287671, Blast_Score=144, Evalue=1e-34,
Organism=Caenorhabditis elegans, GI71989490, Length=461, Percent_Identity=24.295010845987, Blast_Score=89, Evalue=4e-18,
Organism=Caenorhabditis elegans, GI17539558, Length=459, Percent_Identity=23.9651416122004, Blast_Score=86, Evalue=5e-17,
Organism=Caenorhabditis elegans, GI86575075, Length=448, Percent_Identity=23.6607142857143, Blast_Score=79, Evalue=6e-15,
Organism=Saccharomyces cerevisiae, GI6322218, Length=391, Percent_Identity=28.3887468030691, Blast_Score=116, Evalue=6e-27,
Organism=Drosophila melanogaster, GI24642586, Length=366, Percent_Identity=33.3333333333333, Blast_Score=141, Evalue=9e-34,
Organism=Drosophila melanogaster, GI18859883, Length=423, Percent_Identity=28.1323877068558, Blast_Score=98, Evalue=1e-20,

Paralogues:

None

Copy number: NA

Swissprot (AC and ID): PYRC_GEOTN (A4IM31)

Other databases:

- EMBL:   CP000557
- RefSeq:   YP_001125130.1
- ProteinModelPortal:   A4IM31
- SMR:   A4IM31
- STRING:   A4IM31
- MEROPS:   M38.972
- GeneID:   4965950
- GenomeReviews:   CP000557_GR
- KEGG:   gtn:GTNG_1007
- NMPDR:   fig|420246.5.peg.976
- eggNOG:   COG0044
- HOGENOM:   HBG724623
- OMA:   GIFAEKE
- PhylomeDB:   A4IM31
- ProtClustDB:   PRK09357
- BioCyc:   GTHE420246:GTNG_1007-MONOMER
- HAMAP:   MF_00220_B
- InterPro:   IPR006680
- InterPro:   IPR004722
- InterPro:   IPR002195
- InterPro:   IPR011059
- TIGRFAMs:   TIGR00857

Pfam domain/function: PF01979 Amidohydro_1; SSF51338 Metalo_hydrolase

EC number: =3.5.2.3

Molecular weight: Translated: 46454; Mature: 46322

Theoretical pI: Translated: 5.66; Mature: 5.66

Prosite motif: PS00482 DIHYDROOROTASE_1; PS00483 DIHYDROOROTASE_2

Important sites: NA

Signals:

None

Transmembrane regions:

None

Cys/Met content:

1.9 %Cys     (Translated Protein)
2.8 %Met     (Translated Protein)
4.7 %Cys+Met (Translated Protein)
1.9 %Cys     (Mature Protein)
2.6 %Met     (Mature Protein)
4.4 %Cys+Met (Mature Protein)

Secondary structure:

>Translated Secondary Structure
MAMWLKNGMSFNENGQLVRTHIKIEHGNIAAIHHEQLFEANGEDVIDVGGKLIAPGLIDV
CCEEECCCCCCCCCCCEEEEEEEEECCCEEEEEHHHHHHCCCCCEECCCCEEECCCEEEE
HVHLREPGGEAKETIETGTLAAAKGGFTTVAAMPNTNPVPDRKEQMEWLARRIQETAHVR
EEEEECCCCHHHHHHHHCCEEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCEE
VLPYASITLGQKGEELTDFAALKEAGAFAFTDDGVGVQSAGMMFEAMKRAAALDMAIVAH
EEEEEEEEECCCCCHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHHHHCEEEEEE
CEDDTLKNGGAVHDGDFARRYGIAGIPSVCEAVHIARDVLLAEATGCHYHVCHISTKESV
CCCCCCCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCHHHH
RVVRDAKRAGICVTAEVTPHHLLLCDEDIPRLDANYKMNPPLRSRADREALIEGLLDGTI
HHHHHHHHCCEEEEEECCCCEEEEECCCCCCCCCCCCCCCCHHHCCCHHHHHHHHHCCCE
DFIATDHAPHTAAEKAKGMEAAPFGIVGLETAFPLLYTHFVKKNVFTLKQLVDWLTIKPA
EEEEECCCCCHHHHHHCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH
QCFGLQTGRLEVGAPADITVIDLETEEPIDPETFASKGNNTPFAGWRCQGWPVMTFVGGT
HHHCCCCCCEEECCCCCEEEEEECCCCCCCHHHHHCCCCCCCCCCEEECCCEEEEEECCE
LVWEKGRA
EEEECCCC
>Mature Secondary Structure 
AMWLKNGMSFNENGQLVRTHIKIEHGNIAAIHHEQLFEANGEDVIDVGGKLIAPGLIDV
CEEECCCCCCCCCCCEEEEEEEEECCCEEEEEHHHHHHCCCCCEECCCCEEECCCEEEE
HVHLREPGGEAKETIETGTLAAAKGGFTTVAAMPNTNPVPDRKEQMEWLARRIQETAHVR
EEEEECCCCHHHHHHHHCCEEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCEE
VLPYASITLGQKGEELTDFAALKEAGAFAFTDDGVGVQSAGMMFEAMKRAAALDMAIVAH
EEEEEEEEECCCCCHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHHHHCEEEEEE
CEDDTLKNGGAVHDGDFARRYGIAGIPSVCEAVHIARDVLLAEATGCHYHVCHISTKESV
CCCCCCCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCHHHH
RVVRDAKRAGICVTAEVTPHHLLLCDEDIPRLDANYKMNPPLRSRADREALIEGLLDGTI
HHHHHHHHCCEEEEEECCCCEEEEECCCCCCCCCCCCCCCCHHHCCCHHHHHHHHHCCCE
DFIATDHAPHTAAEKAKGMEAAPFGIVGLETAFPLLYTHFVKKNVFTLKQLVDWLTIKPA
EEEEECCCCCHHHHHHCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH
QCFGLQTGRLEVGAPADITVIDLETEEPIDPETFASKGNNTPFAGWRCQGWPVMTFVGGT
HHHCCCCCCEEECCCCCEEEEEECCCCCCCHHHHHCCCCCCCCCCEEECCCEEEEEECCE
LVWEKGRA
EEEECCCC

PDB accession: NA

Resolution: NA

Structure class: Alpha Beta

Cofactors: NA

Metal ions: NA

Kcat value (1/min): NA

Specific activity: NA

Km value (mM): NA

Substrates: NA

Specific reaction: NA

General reaction: NA

Inhibitor: NA

Structure determination priority: 9.0

TargetDB status: NA

Availability: NA

References: NA