Definition | Methanosarcina mazei Go1 chromosome, complete genome. |
---|---|
Accession | NC_003901 |
Length | 4,096,345 |
Click here to switch to the map view.
The map label for this gene is argF [H]
Identifier: 161485684
GI number: 161485684
Start: 180118
End: 181026
Strand: Direct
Name: argF [H]
Synonym: MM_0145
Alternate gene names: 161485684
Gene position: 180118-181026 (Clockwise)
Preceding gene: 21226246
Following gene: 21226248
Centisome position: 4.4
GC content: 49.39
Gene sequence:
>909_bases ATGAAGAGAGATGTACTTTCTATAACTGACCTGTCCAGGGAGGAGATATACGAACTCCTTGAATCGGCAGCGGACCTGAA GAAAAAACGTAAGGCGGGAGAACCTACCGAGTACCTGAAACATAAAAGCCTTGGAATGATTTTCGAAAAATCCTCCACAA GGACAAGGGTATCCTTTGAGGTTGCAATGAGCGATTTCGGAGGACACGCCCTTTACCTTAACTCCAGGGATATCCAGGTA GGAAGAGGTGAGACAATTGAAGATACAGCCAGGACCCTTTCGGGATACCTGCACGGGATCATGGCAAGGGTTATGAGCCA TGACACCGTGGAAAAGCTTGCCAGATTCTCGACCATACCCGTGATAAATGCCCTCTCGGACCGGGAACACCCATGCCAGA TCCTTGGCGATTTCATGACCATCATGGAATATAAAAACAGGTTTGAAGGCCTGAAGTTCGCCTGGATAGGAGACGGGAAC AATGTCTGCAATTCCGCTCTCCTGGGCTCTGCAATCATGGGAATGGAGTTTGTCATTGCCTGCCCTGAAGGTTACGAACC CGGAGCCGAGTTCCTCGAGAAAGCGAAAGCCCTGGGAGGCAAATTCTCAATAACAGATGACCCCAAAACTGCCGCAAAAG ACGCAGATATAATCTATACTGATGTCTGGGTCTCCATGGGGGACGAAGCAGAGCAGGAAAAGCGCCTGAAAGATTTTGGT TCTTTCCAGGTCAACACCGAACTCCTCGGAGTCGCAAAACCGGATGTAATAGTAATGCACTGCCTTCCTGCCAGGAGAGG CCTTGAAATCACGGACGAAGTTATGGACGGCCCAAACTCCGTGATCTTCGAAGAGGCAGAAAACCGCCTTCACGCGCAGA AAGCCCTTATCCTGAAATTGATGAGATAA
Upstream 100 bases:
>100_bases GCCTGGAAACAAAACAGAGCCTGGAAATAAAGCCCAGATTTAAAGTTTGGTTTCAGCCAGAAGTTTAAACTCAGCCGTAA AGAAGTCGGGATTTGAGACG
Downstream 100 bases:
>100_bases AATTTCATCAATTTCTCCTCTATATTTCTCCTCTATGATTGAGAGGACTCATATGAAATAAATATATATTAGAAGTTATC GGGTTTTTTCACCTTCCGAG
Product: ornithine carbamoyltransferase
Products: NA
Alternate protein names: OTCase [H]
Number of amino acids: Translated: 302; Mature: 302
Protein sequence:
>302_residues MKRDVLSITDLSREEIYELLESAADLKKKRKAGEPTEYLKHKSLGMIFEKSSTRTRVSFEVAMSDFGGHALYLNSRDIQV GRGETIEDTARTLSGYLHGIMARVMSHDTVEKLARFSTIPVINALSDREHPCQILGDFMTIMEYKNRFEGLKFAWIGDGN NVCNSALLGSAIMGMEFVIACPEGYEPGAEFLEKAKALGGKFSITDDPKTAAKDADIIYTDVWVSMGDEAEQEKRLKDFG SFQVNTELLGVAKPDVIVMHCLPARRGLEITDEVMDGPNSVIFEEAENRLHAQKALILKLMR
Sequences:
>Translated_302_residues MKRDVLSITDLSREEIYELLESAADLKKKRKAGEPTEYLKHKSLGMIFEKSSTRTRVSFEVAMSDFGGHALYLNSRDIQV GRGETIEDTARTLSGYLHGIMARVMSHDTVEKLARFSTIPVINALSDREHPCQILGDFMTIMEYKNRFEGLKFAWIGDGN NVCNSALLGSAIMGMEFVIACPEGYEPGAEFLEKAKALGGKFSITDDPKTAAKDADIIYTDVWVSMGDEAEQEKRLKDFG SFQVNTELLGVAKPDVIVMHCLPARRGLEITDEVMDGPNSVIFEEAENRLHAQKALILKLMR >Mature_302_residues MKRDVLSITDLSREEIYELLESAADLKKKRKAGEPTEYLKHKSLGMIFEKSSTRTRVSFEVAMSDFGGHALYLNSRDIQV GRGETIEDTARTLSGYLHGIMARVMSHDTVEKLARFSTIPVINALSDREHPCQILGDFMTIMEYKNRFEGLKFAWIGDGN NVCNSALLGSAIMGMEFVIACPEGYEPGAEFLEKAKALGGKFSITDDPKTAAKDADIIYTDVWVSMGDEAEQEKRLKDFG SFQVNTELLGVAKPDVIVMHCLPARRGLEITDEVMDGPNSVIFEEAENRLHAQKALILKLMR
Specific function: Arginine biosynthesis; sixth step. [C]
COG id: COG0078
COG function: function code E; Ornithine carbamoyltransferase
Gene ontology:
Cell location: Cytoplasm (Probable) [H]
Metaboloic importance: Non_Essential [C]
Operon status: Not Known
Operon components: None
Similarity: Belongs to the ATCase/OTCase family [H]
Homologues:
Organism=Homo sapiens, GI38788445, Length=304, Percent_Identity=42.4342105263158, Blast_Score=253, Evalue=1e-67, Organism=Homo sapiens, GI18105007, Length=306, Percent_Identity=27.7777777777778, Blast_Score=96, Evalue=4e-20, Organism=Escherichia coli, GI1786469, Length=324, Percent_Identity=36.4197530864198, Blast_Score=209, Evalue=2e-55, Organism=Escherichia coli, GI1790703, Length=323, Percent_Identity=35.9133126934984, Blast_Score=207, Evalue=6e-55, Organism=Escherichia coli, GI2367364, Length=308, Percent_Identity=31.4935064935065, Blast_Score=110, Evalue=1e-25, Organism=Caenorhabditis elegans, GI193204318, Length=326, Percent_Identity=27.6073619631902, Blast_Score=80, Evalue=1e-15, Organism=Saccharomyces cerevisiae, GI6322373, Length=314, Percent_Identity=37.8980891719745, Blast_Score=217, Evalue=2e-57, Organism=Saccharomyces cerevisiae, GI6322331, Length=306, Percent_Identity=30.718954248366, Blast_Score=103, Evalue=3e-23, Organism=Drosophila melanogaster, GI24642586, Length=304, Percent_Identity=28.9473684210526, Blast_Score=98, Evalue=8e-21,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR006132 - InterPro: IPR006130 - InterPro: IPR006131 - InterPro: IPR002292 [H]
Pfam domain/function: PF00185 OTCace; PF02729 OTCace_N [H]
EC number: =2.1.3.3 [H]
Molecular weight: Translated: 33707; Mature: 33707
Theoretical pI: Translated: 5.16; Mature: 5.16
Prosite motif: PS00097 CARBAMOYLTRANSFERASE
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.3 %Cys (Translated Protein) 4.3 %Met (Translated Protein) 5.6 %Cys+Met (Translated Protein) 1.3 %Cys (Mature Protein) 4.3 %Met (Mature Protein) 5.6 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MKRDVLSITDLSREEIYELLESAADLKKKRKAGEPTEYLKHKSLGMIFEKSSTRTRVSFE CCCCCCHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCEEEECCCCCEEEEEE VAMSDFGGHALYLNSRDIQVGRGETIEDTARTLSGYLHGIMARVMSHDTVEKLARFSTIP EEECCCCCEEEEEECCCEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCH VINALSDREHPCQILGDFMTIMEYKNRFEGLKFAWIGDGNNVCNSALLGSAIMGMEFVIA HHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCEEEEE CPEGYEPGAEFLEKAKALGGKFSITDDPKTAAKDADIIYTDVWVSMGDEAEQEKRLKDFG CCCCCCHHHHHHHHHHHCCCCEEECCCCCHHCCCCCEEEEEEEEECCCHHHHHHHHHHCC SFQVNTELLGVAKPDVIVMHCLPARRGLEITDEVMDGPNSVIFEEAENRLHAQKALILKL CEEECCEEEECCCCCEEEEEECCCCCCCCCCHHHHCCCCCEEHHHHHHHHHHHHHHHHHH MR CC >Mature Secondary Structure MKRDVLSITDLSREEIYELLESAADLKKKRKAGEPTEYLKHKSLGMIFEKSSTRTRVSFE CCCCCCHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCEEEECCCCCEEEEEE VAMSDFGGHALYLNSRDIQVGRGETIEDTARTLSGYLHGIMARVMSHDTVEKLARFSTIP EEECCCCCEEEEEECCCEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCH VINALSDREHPCQILGDFMTIMEYKNRFEGLKFAWIGDGNNVCNSALLGSAIMGMEFVIA HHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCEEEEE CPEGYEPGAEFLEKAKALGGKFSITDDPKTAAKDADIIYTDVWVSMGDEAEQEKRLKDFG CCCCCCHHHHHHHHHHHCCCCEEECCCCCHHCCCCCEEEEEEEEECCCHHHHHHHHHHCC SFQVNTELLGVAKPDVIVMHCLPARRGLEITDEVMDGPNSVIFEEAENRLHAQKALILKL CEEECCEEEECCCCCEEEEEECCCCCCCCCCHHHHCCCCCEEHHHHHHHHHHHHHHHHHH MR CC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 11932238 [H]