| Definition | Trichodesmium erythraeum IMS101 chromosome, complete genome. |
|---|---|
| Accession | NC_008312 |
| Length | 7,750,108 |
Click here to switch to the map view.
The map label for this gene is pyrC [H]
Identifier: 113475820
GI number: 113475820
Start: 3400076
End: 3401407
Strand: Reverse
Name: pyrC [H]
Synonym: Tery_2177
Alternate gene names: 113475820
Gene position: 3401407-3400076 (Counterclockwise)
Preceding gene: 113475822
Following gene: 113475819
Centisome position: 43.89
GC content: 39.86
Gene sequence:
>1332_bases ATGGCTCCTATCTCTTCTTTGTTGATTCGTCGTGCCCGCATCCTTTTACCGGATGGGACATTTTTGATTGGTGATGTGCA AACTCAAGGCCGAGAAATTATTCAGGTTGCTCCAGAGATTTCATCTTCTGAAGCTCCAGAAAAAATCATAGATGCTGAAG GTTTAACTTTATTACCGGGAGTAATTGATCCTCAAGTACATTTTCGGGAACCTGGGTTGGAACATAAGGAGGACTTATTT ACTGCTAGTCGGGCTTGTGTGAAGGGTGGTGTTACTTCTTTTCTAGAAATGCCTAATACTAAACCTTTGACTACAACTCA AGGAGCATTGGATGATAAGTTACGACGGGCAGAGCAAAAATGCGTGGCTAATTTTGGTTTTTTTATTGGGGCAACGGCAG AGAATTTACCTGATTTATTAACAGCTAATCCTACTCCTGGAATTAAAATTTTTATGGGTTCTATGCACGGACCTTTGTTG GTGGATACTGAGGAAAAGTTAGAGCCAATTTTTGCTAGGGGAAAAAGATTAATTGCTGTTCATGCTGAAAATCAAGCTCG AATTGATGAAAGGAAAAAACAGTTTGCTGGTATTTCTGATCCTGCTATTCATTCACAAATTCAGGATAATGAAGCTGCTT TGTTGGCAACTAAGATGGCTTTGAAATTATCTAAGAAGTATGAGCGGAGGTTACATATTTTGCATACTTCGACTGGAGAT GAGGCTGAATTGTTGCGTCAGGATAAGCCTAGTTGGGTAACTGCTGAAGTTACGCCTCAACATTTGTTCTTAAATACAAG TGCTTATGAAAAAATTGGTACTTTAGCTCAGATGAATCCTCCTTTAAAATCTGCTGGTGATAATGATATTTTATGGCGAG CTTTGTTGGATGGAGTAATTGATTTTATTGCAACGGATCATGCACCTCATACTTTGGCAGAAAAGGGTAAAGGTTATCCG AATACTCCTTCGGGAATGCCTGGGGTGGAAACTTCTTTGCCTTTGATGTTGACTCAAGCTATTGAGGGAAGATGTTCTGT TGCTCAGGTTTCGAATTGGATGTCTACTGCTGTGGCTAAGGGTTATGGAATTCTGAAAAAGGGTGCGATCGCTCCTGGTT TTGATGCAGATTTAGTGTTAGTTGATTTAAATAATTATCGCCCTGTTTTAAGAGAGGAATTGATGACAAAATGTAGGTGG AGTCCTTTTGAAGGATGGAGTTTAACTGGTTGGCCTGTTGTTACTATTGTTGGGGGAGAGGTTGCTTTTAATCGTGGTGA ATTTAATTCTGAAGTTCGGGGTCGAGCTTTGATTTTTTCGGAAATTGCCTAA
Upstream 100 bases:
>100_bases TGAATTACTAGCTCCTAGTCCTTTATCTTTAATTCCTAAAATAATATAATAAGTTTCAAACCAAATTTTCTTTCTCTAGT TTTCAAACCAAATTTTTATT
Downstream 100 bases:
>100_bases ACAATTTTGACAGTAATTTTCGGGATTTTTTTGCTGCTTTGTTGCACAAAAGATGAAAATGGACTAAAAATATAGTTATA GTACAAAAAATCTGCACCTA
Product: dihydroorotase
Products: NA
Alternate protein names: DHOase [H]
Number of amino acids: Translated: 443; Mature: 442
Protein sequence:
>443_residues MAPISSLLIRRARILLPDGTFLIGDVQTQGREIIQVAPEISSSEAPEKIIDAEGLTLLPGVIDPQVHFREPGLEHKEDLF TASRACVKGGVTSFLEMPNTKPLTTTQGALDDKLRRAEQKCVANFGFFIGATAENLPDLLTANPTPGIKIFMGSMHGPLL VDTEEKLEPIFARGKRLIAVHAENQARIDERKKQFAGISDPAIHSQIQDNEAALLATKMALKLSKKYERRLHILHTSTGD EAELLRQDKPSWVTAEVTPQHLFLNTSAYEKIGTLAQMNPPLKSAGDNDILWRALLDGVIDFIATDHAPHTLAEKGKGYP NTPSGMPGVETSLPLMLTQAIEGRCSVAQVSNWMSTAVAKGYGILKKGAIAPGFDADLVLVDLNNYRPVLREELMTKCRW SPFEGWSLTGWPVVTIVGGEVAFNRGEFNSEVRGRALIFSEIA
Sequences:
>Translated_443_residues MAPISSLLIRRARILLPDGTFLIGDVQTQGREIIQVAPEISSSEAPEKIIDAEGLTLLPGVIDPQVHFREPGLEHKEDLF TASRACVKGGVTSFLEMPNTKPLTTTQGALDDKLRRAEQKCVANFGFFIGATAENLPDLLTANPTPGIKIFMGSMHGPLL VDTEEKLEPIFARGKRLIAVHAENQARIDERKKQFAGISDPAIHSQIQDNEAALLATKMALKLSKKYERRLHILHTSTGD EAELLRQDKPSWVTAEVTPQHLFLNTSAYEKIGTLAQMNPPLKSAGDNDILWRALLDGVIDFIATDHAPHTLAEKGKGYP NTPSGMPGVETSLPLMLTQAIEGRCSVAQVSNWMSTAVAKGYGILKKGAIAPGFDADLVLVDLNNYRPVLREELMTKCRW SPFEGWSLTGWPVVTIVGGEVAFNRGEFNSEVRGRALIFSEIA >Mature_442_residues APISSLLIRRARILLPDGTFLIGDVQTQGREIIQVAPEISSSEAPEKIIDAEGLTLLPGVIDPQVHFREPGLEHKEDLFT ASRACVKGGVTSFLEMPNTKPLTTTQGALDDKLRRAEQKCVANFGFFIGATAENLPDLLTANPTPGIKIFMGSMHGPLLV DTEEKLEPIFARGKRLIAVHAENQARIDERKKQFAGISDPAIHSQIQDNEAALLATKMALKLSKKYERRLHILHTSTGDE AELLRQDKPSWVTAEVTPQHLFLNTSAYEKIGTLAQMNPPLKSAGDNDILWRALLDGVIDFIATDHAPHTLAEKGKGYPN TPSGMPGVETSLPLMLTQAIEGRCSVAQVSNWMSTAVAKGYGILKKGAIAPGFDADLVLVDLNNYRPVLREELMTKCRWS PFEGWSLTGWPVVTIVGGEVAFNRGEFNSEVRGRALIFSEIA
Specific function: Involved In The Anaerobic Utilization Of Allantoin. [C]
COG id: COG0044
COG function: function code F; Dihydroorotase and related cyclic amidohydrolases
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the DHOase family. Type 2 subfamily [H]
Homologues:
Organism=Homo sapiens, GI18105007, Length=384, Percent_Identity=31.25, Blast_Score=159, Evalue=5e-39, Organism=Homo sapiens, GI4503051, Length=447, Percent_Identity=24.1610738255034, Blast_Score=99, Evalue=6e-21, Organism=Homo sapiens, GI62422571, Length=447, Percent_Identity=24.1610738255034, Blast_Score=98, Evalue=1e-20, Organism=Homo sapiens, GI4503375, Length=458, Percent_Identity=24.6724890829694, Blast_Score=95, Evalue=1e-19, Organism=Homo sapiens, GI4503379, Length=446, Percent_Identity=24.2152466367713, Blast_Score=90, Evalue=3e-18, Organism=Homo sapiens, GI4503377, Length=456, Percent_Identity=24.1228070175439, Blast_Score=90, Evalue=4e-18, Organism=Homo sapiens, GI190194363, Length=447, Percent_Identity=24.1610738255034, Blast_Score=87, Evalue=2e-17, Organism=Homo sapiens, GI19923821, Length=454, Percent_Identity=22.9074889867841, Blast_Score=81, Evalue=2e-15, Organism=Escherichia coli, GI1786722, Length=437, Percent_Identity=26.7734553775744, Blast_Score=162, Evalue=5e-41, Organism=Escherichia coli, GI87082175, Length=453, Percent_Identity=28.2560706401766, Blast_Score=151, Evalue=8e-38, Organism=Caenorhabditis elegans, GI193204318, Length=420, Percent_Identity=30.7142857142857, Blast_Score=162, Evalue=4e-40, Organism=Caenorhabditis elegans, GI71989490, Length=452, Percent_Identity=25.6637168141593, Blast_Score=108, Evalue=6e-24, Organism=Caenorhabditis elegans, GI17539558, Length=451, Percent_Identity=26.6075388026608, Blast_Score=104, Evalue=9e-23, Organism=Caenorhabditis elegans, GI86575075, Length=448, Percent_Identity=25, Blast_Score=73, Evalue=3e-13, Organism=Saccharomyces cerevisiae, GI6322218, Length=394, Percent_Identity=28.1725888324873, Blast_Score=137, Evalue=5e-33, Organism=Drosophila melanogaster, GI24642586, Length=378, Percent_Identity=32.010582010582, Blast_Score=162, Evalue=5e-40, Organism=Drosophila melanogaster, GI18859883, Length=476, Percent_Identity=26.890756302521, Blast_Score=112, Evalue=4e-25, Organism=Drosophila melanogaster, GI221377917, Length=409, Percent_Identity=25.4278728606357, Blast_Score=97, Evalue=2e-20, Organism=Drosophila melanogaster, GI17137462, Length=404, Percent_Identity=25.990099009901, Blast_Score=94, Evalue=1e-19,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006680 - InterPro: IPR004722 - InterPro: IPR002195 - InterPro: IPR011059 [H]
Pfam domain/function: PF01979 Amidohydro_1 [H]
EC number: =3.5.2.3 [H]
Molecular weight: Translated: 48444; Mature: 48313
Theoretical pI: Translated: 6.20; Mature: 6.20
Prosite motif: PS00483 DIHYDROOROTASE_2
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
0.9 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 3.2 %Cys+Met (Translated Protein) 0.9 %Cys (Mature Protein) 2.0 %Met (Mature Protein) 2.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MAPISSLLIRRARILLPDGTFLIGDVQTQGREIIQVAPEISSSEAPEKIIDAEGLTLLPG CCCHHHHHHHHHHEECCCCCEEEECCCCCCCEEEEECCCCCCCCCCHHHHCCCCCEECCC VIDPQVHFREPGLEHKEDLFTASRACVKGGVTSFLEMPNTKPLTTTQGALDDKLRRAEQK CCCCCCEECCCCCCHHHHHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHH CVANFGFFIGATAENLPDLLTANPTPGIKIFMGSMHGPLLVDTEEKLEPIFARGKRLIAV HHHHCCCEEECCHHCCCHHHCCCCCCCEEEEEECCCCCEEECCHHHHHHHHHCCCEEEEE HAENQARIDERKKQFAGISDPAIHSQIQDNEAALLATKMALKLSKKYERRLHILHTSTGD ECCCCHHHHHHHHHHCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHEEEEEECCCCC EAELLRQDKPSWVTAEVTPQHLFLNTSAYEKIGTLAQMNPPLKSAGDNDILWRALLDGVI HHHHHHCCCCCEEEEEECCCEEEECCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHH DFIATDHAPHTLAEKGKGYPNTPSGMPGVETSLPLMLTQAIEGRCSVAQVSNWMSTAVAK HHHHCCCCCHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHH GYGILKKGAIAPGFDADLVLVDLNNYRPVLREELMTKCRWSPFEGWSLTGWPVVTIVGGE CCCHHHCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCE VAFNRGEFNSEVRGRALIFSEIA EEECCCCCCCCCCCCEEEEECCC >Mature Secondary Structure APISSLLIRRARILLPDGTFLIGDVQTQGREIIQVAPEISSSEAPEKIIDAEGLTLLPG CCHHHHHHHHHHEECCCCCEEEECCCCCCCEEEEECCCCCCCCCCHHHHCCCCCEECCC VIDPQVHFREPGLEHKEDLFTASRACVKGGVTSFLEMPNTKPLTTTQGALDDKLRRAEQK CCCCCCEECCCCCCHHHHHHHHHHHHHHCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHH CVANFGFFIGATAENLPDLLTANPTPGIKIFMGSMHGPLLVDTEEKLEPIFARGKRLIAV HHHHCCCEEECCHHCCCHHHCCCCCCCEEEEEECCCCCEEECCHHHHHHHHHCCCEEEEE HAENQARIDERKKQFAGISDPAIHSQIQDNEAALLATKMALKLSKKYERRLHILHTSTGD ECCCCHHHHHHHHHHCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHEEEEEECCCCC EAELLRQDKPSWVTAEVTPQHLFLNTSAYEKIGTLAQMNPPLKSAGDNDILWRALLDGVI HHHHHHCCCCCEEEEEECCCEEEECCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHH DFIATDHAPHTLAEKGKGYPNTPSGMPGVETSLPLMLTQAIEGRCSVAQVSNWMSTAVAK HHHHCCCCCHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHH GYGILKKGAIAPGFDADLVLVDLNNYRPVLREELMTKCRWSPFEGWSLTGWPVVTIVGGE CCCHHHCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCE VAFNRGEFNSEVRGRALIFSEIA EEECCCCCCCCCCCCEEEEECCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 9.0
TargetDB status: NA
Availability: NA
References: 11932238 [H]