| Definition | Prochlorococcus marinus str. MIT 9312, complete genome. |
|---|---|
| Accession | NC_007577 |
| Length | 1,709,204 |
Click here to switch to the map view.
The map label for this gene is yafV [C]
Identifier: 78779000
GI number: 78779000
Start: 580616
End: 581443
Strand: Direct
Name: yafV [C]
Synonym: PMT9312_0615
Alternate gene names: 78779000
Gene position: 580616-581443 (Clockwise)
Preceding gene: 78778999
Following gene: 78779001
Centisome position: 33.97
GC content: 38.89
Gene sequence:
>828_bases TTGACTGATTTTTTGGTGGCTGCATTGCAAATTACAAGCACTTCTAATGTAGAAGCAAATTTTGTTGAAGCGGAGGAACA GATTGAATTAGCAGCTCGAAGAGGTGCTGAGTTAATCGGATTGCCTGAGAATTTTGCTTTTTTAGGAGAAGATGACGAAA AACTTAGATTAGCTCCTGAATTGTCAATGAAGTGTACAAACTTCCTAAAAACTATGTCACAGAGATATCAAGTTTTTCTA TTGGGAGGAGGATATCCTGTTCCAGCTGGTGATGATAGACATACTTTAAATAGATCAGCACTCTTTGGAAGAGATGGACA GGTTTTGGCAAAATATGACAAAATCCATTTGTTCGATGTTGATTTGCCAGACGGAAATTTATATAAGGAATCATCTACTA TTTTATCTGGGGAGGAGTATCCACCTGTTGTAGATGTCCCAGGTTTATGCAAAATAGGGTTATCGATTTGTTACGACGTT AGATTCCCTGAACTTTATAGATATTTGTCTTCTAATGGTGCAGAGCTAATTATGATTCCCGCAGCTTTTACAGCATTTAC TGGAAAAGATCATTGGCAAATCCTATTACAAGCAAGAGCGATTGAGAATACAGCATATGTAGTTGCTCCAGCGCAAACTG GGGTTCATTATGGAAGAAGGCAAAGTCATGGCCATGCAATGGTAATTGACCCATGGGGCACTGTTTTGTCTGATGCTGGG AAAACTCAGGGGGCCGCAATAGCGCCTGCGGATAAAAAAAGAGTAAAGAAGATTAGGGAGCAGATGCCAAGCCTTAAACA TAGAAAAAATAAATTGTTTTCAAACTAA
Upstream 100 bases:
>100_bases TTGCTCAGAACTTGATTGCTTAGATATTGTTCCTGCTCAAGTTGAAAGAGGTGTGATTCGTGCCTCATGATTTACGAATT GGTTTATTAGGAGTACTGTT
Downstream 100 bases:
>100_bases TGATAAAGTTTTTAGATAATAAACTTTTTCGTTATATATCCGTTTTTTTATTTTTAAATTCTGCAATCCTCCCTTTGAAA TCTTCAAGTGCTCTGGCAGC
Product: putative nitrilase
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 275; Mature: 274
Protein sequence:
>275_residues MTDFLVAALQITSTSNVEANFVEAEEQIELAARRGAELIGLPENFAFLGEDDEKLRLAPELSMKCTNFLKTMSQRYQVFL LGGGYPVPAGDDRHTLNRSALFGRDGQVLAKYDKIHLFDVDLPDGNLYKESSTILSGEEYPPVVDVPGLCKIGLSICYDV RFPELYRYLSSNGAELIMIPAAFTAFTGKDHWQILLQARAIENTAYVVAPAQTGVHYGRRQSHGHAMVIDPWGTVLSDAG KTQGAAIAPADKKRVKKIREQMPSLKHRKNKLFSN
Sequences:
>Translated_275_residues MTDFLVAALQITSTSNVEANFVEAEEQIELAARRGAELIGLPENFAFLGEDDEKLRLAPELSMKCTNFLKTMSQRYQVFL LGGGYPVPAGDDRHTLNRSALFGRDGQVLAKYDKIHLFDVDLPDGNLYKESSTILSGEEYPPVVDVPGLCKIGLSICYDV RFPELYRYLSSNGAELIMIPAAFTAFTGKDHWQILLQARAIENTAYVVAPAQTGVHYGRRQSHGHAMVIDPWGTVLSDAG KTQGAAIAPADKKRVKKIREQMPSLKHRKNKLFSN >Mature_274_residues TDFLVAALQITSTSNVEANFVEAEEQIELAARRGAELIGLPENFAFLGEDDEKLRLAPELSMKCTNFLKTMSQRYQVFLL GGGYPVPAGDDRHTLNRSALFGRDGQVLAKYDKIHLFDVDLPDGNLYKESSTILSGEEYPPVVDVPGLCKIGLSICYDVR FPELYRYLSSNGAELIMIPAAFTAFTGKDHWQILLQARAIENTAYVVAPAQTGVHYGRRQSHGHAMVIDPWGTVLSDAGK TQGAAIAPADKKRVKKIREQMPSLKHRKNKLFSN
Specific function: Unknown
COG id: COG0388
COG function: function code R; Predicted amidohydrolase
Gene ontology:
Cell location: Cytoplasm [C]
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 1 CN hydrolase domain [H]
Homologues:
Organism=Homo sapiens, GI297632348, Length=275, Percent_Identity=39.2727272727273, Blast_Score=196, Evalue=2e-50, Organism=Homo sapiens, GI297632350, Length=275, Percent_Identity=39.2727272727273, Blast_Score=195, Evalue=3e-50, Organism=Homo sapiens, GI5031947, Length=275, Percent_Identity=39.2727272727273, Blast_Score=195, Evalue=3e-50, Organism=Homo sapiens, GI9910460, Length=274, Percent_Identity=33.5766423357664, Blast_Score=174, Evalue=9e-44, Organism=Homo sapiens, GI297632346, Length=193, Percent_Identity=37.3056994818653, Blast_Score=114, Evalue=9e-26, Organism=Caenorhabditis elegans, GI17556280, Length=272, Percent_Identity=39.7058823529412, Blast_Score=207, Evalue=5e-54, Organism=Saccharomyces cerevisiae, GI6322335, Length=303, Percent_Identity=33.003300330033, Blast_Score=166, Evalue=3e-42, Organism=Saccharomyces cerevisiae, GI6323383, Length=282, Percent_Identity=31.5602836879433, Blast_Score=146, Evalue=4e-36, Organism=Drosophila melanogaster, GI17933642, Length=271, Percent_Identity=35.4243542435424, Blast_Score=164, Evalue=4e-41, Organism=Drosophila melanogaster, GI21355835, Length=276, Percent_Identity=31.5217391304348, Blast_Score=147, Evalue=8e-36,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): NA
Other databases:
- InterPro: IPR003010 - InterPro: IPR001110 [H]
Pfam domain/function: PF00795 CN_hydrolase [H]
EC number: 3.5.-.- [C]
Molecular weight: Translated: 30474; Mature: 30343
Theoretical pI: Translated: 6.88; Mature: 6.88
Prosite motif: PS50263 CN_HYDROLASE ; PS01227 UPF0012
Important sites: NA
Signals:
None
Transmembrane regions:
None
Cys/Met content:
1.1 %Cys (Translated Protein) 2.2 %Met (Translated Protein) 3.3 %Cys+Met (Translated Protein) 1.1 %Cys (Mature Protein) 1.8 %Met (Mature Protein) 2.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MTDFLVAALQITSTSNVEANFVEAEEQIELAARRGAELIGLPENFAFLGEDDEKLRLAPE CCCCEEEEEEEECCCCCEEEEECHHHHHHHHHHCCCEEEECCCCCEEECCCCCEEEECCC LSMKCTNFLKTMSQRYQVFLLGGGYPVPAGDDRHTLNRSALFGRDGQVLAKYDKIHLFDV HHHHHHHHHHHHHHCEEEEEEECCCCCCCCCCCCCCCCHHEECCCCCEEEEECEEEEEEE DLPDGNLYKESSTILSGEEYPPVVDVPGLCKIGLSICYDVRFPELYRYLSSNGAELIMIP ECCCCCCEECCCCEECCCCCCCEECCCCHHHHHHHHEEECCCHHHHHHHHCCCCEEEEEE AAFTAFTGKDHWQILLQARAIENTAYVVAPAQTGVHYGRRQSHGHAMVIDPWGTVLSDAG CEEEECCCCCHHEEEEEHEEECCEEEEEEECHHCCCCCCCCCCCCEEEECCCHHHHHCCC KTQGAAIAPADKKRVKKIREQMPSLKHRKNKLFSN CCCCCEECCCHHHHHHHHHHHCCHHHHHHHHCCCC >Mature Secondary Structure TDFLVAALQITSTSNVEANFVEAEEQIELAARRGAELIGLPENFAFLGEDDEKLRLAPE CCCEEEEEEEECCCCCEEEEECHHHHHHHHHHCCCEEEECCCCCEEECCCCCEEEECCC LSMKCTNFLKTMSQRYQVFLLGGGYPVPAGDDRHTLNRSALFGRDGQVLAKYDKIHLFDV HHHHHHHHHHHHHHCEEEEEEECCCCCCCCCCCCCCCCHHEECCCCCEEEEECEEEEEEE DLPDGNLYKESSTILSGEEYPPVVDVPGLCKIGLSICYDVRFPELYRYLSSNGAELIMIP ECCCCCCEECCCCEECCCCCCCEECCCCHHHHHHHHEEECCCHHHHHHHHCCCCEEEEEE AAFTAFTGKDHWQILLQARAIENTAYVVAPAQTGVHYGRRQSHGHAMVIDPWGTVLSDAG CEEEECCCCCHHEEEEEHEEECCEEEEEEECHHCCCCCCCCCCCCEEEECCCHHHHHCCC KTQGAAIAPADKKRVKKIREQMPSLKHRKNKLFSN CCCCCEECCCHHHHHHHHHHHCCHHHHHHHHCCCC
PDB accession: NA
Resolution: NA
Structure class: Alpha Beta
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 10.0
TargetDB status: NA
Availability: NA
References: 8590279; 8905231 [H]