| Definition | Buchnera aphidicola str. Sg (Schizaphis graminum), complete genome. |
|---|---|
| Accession | NC_004061 |
| Length | 641,454 |
Click here to switch to the map view.
The map label for this gene is yoaE [C]
Identifier: 21672590
GI number: 21672590
Start: 360592
End: 362139
Strand: Direct
Name: yoaE [C]
Synonym: BUsg314
Alternate gene names: 21672590
Gene position: 360592-362139 (Clockwise)
Preceding gene: 21672589
Following gene: 21672591
Centisome position: 56.21
GC content: 28.68
Gene sequence:
>1548_bases ATGGAGTTTTTTTTAGACCCGTCAATTTGGGCCGGCTTATTAACATTAGTTGTTTTAGAAGTAGTATTAGGGATTGATAA TTTAATATTTGTAGCAATTTTATCAGAAAAATTACCTCCTAATCAAAGAGATAAGGCACGTTTAATTGGTTTAGGACTGG CTTTAATTATGCGATTAGCGTTATTATCATTGATATCTTGGGTAGTAACACTCACTTCTCCTATTATTAGTAATAATTTT TTTTCTTTGTCAATACGTGATTTAATATTACTTATTGGTGGTTTATTTCTTTTATTTAAAGCTACAATTGAATTACATGA AAGACTAGAAAATGAAGACCATGAAAATACAGAAAATAAAAATTATGCCAGTTTTTGGGCTGTAGTTATTCAAATAGTTG TATTAGATGCAGTGTTTTCCTTAGATGCAATAATTACAGCAGTGGGCATGGTAAATCAATTATTAATCATGATGATAGCA GTTGTATTAGCTACAATATTAATGTTATTAGCATCGAAAGCATTAACAAATTTTATTAATATACATCAAACTGTAGTTGT ATTATGCCTTAGTTTCTTATTAATGATTGGTTTTAGTTTAGTCGCAGAAGCTTTAAAGTTTTATATTCCAAAAGGATATT TATATGCAGCAATAGGTTTTTCTATTTTAATCGAGATTTTTAATCAAATTGCTCGTCATAATTTTATGAAAAATCAATCT AGAAAACCTATGAGACAAAGAGCAGCTGAAGCAATTTTACGTTTAATGATAAGAGAAAAGAATAACAATAAAAATAGAAT AAAAACTGATAATAAAGCAGAAATAGTACTTTCATCTTCCTTAGAAACAGAAACTTTTAAAGATGAAGAAAAATATATGA TTAATGGAGTTCTTACTTTAGCCGGTCGATCAATTAAAAGTATTATGACTCCACGAAGTAATATATCTTGGGTAAATACA GAAAAAACAATCAATGAAATTCGATTACAATTATTAGATACACCTCATAATTTATTTCCTGTTTGTAAAGGTGAATTAGA TGAAATAATCGGTATTGTACGAGCTAAGGAATTATTAGTCGCTATTGAAAAAAATATAGACGTTTATACATTTGCCTCTC AAATACCACCTATTATTATACCAGATACTCTTGATCCTATAAATTTACTTGGAGTACTTCGTCGTGCTCAAGGTAGTTTT GTAATTGTCAGTAATGAATTCGGTGTTGTTCAAGGATTAATTACACCTTTAGATGTTTTAGAAGCTATAGCAGGTGAATT TCCAGACGCAGATGAAACTCCAGATATTATAAAAGAACAAAACAGCTGGTTAGTTAAGGGAGAAACAGATTTACATTCTT TACAACAATTACTTAATACTAAAGAATTAATTAAACAAGATGACTGTGCTTCTTTAGGAGGATTGCTAATTTCTCAAAAA GGTCAATTACCTCTCCCAGGAGAAACAATTAAGATTAATTCTTTTTCTTTTCACATTGTTAATGCTACAGAATATCGTAT CGATTTAGTGAGAATAACTAAAAATTAA
Upstream 100 bases:
>100_bases GATATAATAAAATATTATATAATCTCAAATTACATTATATCTTAAATATTTAGATATATTAAATATTGGGCTGTCCTATT TTTTTTACGGAGTTTTTCTG
Downstream 100 bases:
>100_bases GATATAAATAGCAAGTTTTCTAAGTAATTTTTTGTTTAAAAAAATTTTTTACTGAACTTATTTCGAGTAAGATATGTCTG ACATAATTTTAGCCATTGAT
Product: hypothetical protein
Products: NA
Alternate protein names: NA
Number of amino acids: Translated: 515; Mature: 515
Protein sequence:
>515_residues MEFFLDPSIWAGLLTLVVLEVVLGIDNLIFVAILSEKLPPNQRDKARLIGLGLALIMRLALLSLISWVVTLTSPIISNNF FSLSIRDLILLIGGLFLLFKATIELHERLENEDHENTENKNYASFWAVVIQIVVLDAVFSLDAIITAVGMVNQLLIMMIA VVLATILMLLASKALTNFINIHQTVVVLCLSFLLMIGFSLVAEALKFYIPKGYLYAAIGFSILIEIFNQIARHNFMKNQS RKPMRQRAAEAILRLMIREKNNNKNRIKTDNKAEIVLSSSLETETFKDEEKYMINGVLTLAGRSIKSIMTPRSNISWVNT EKTINEIRLQLLDTPHNLFPVCKGELDEIIGIVRAKELLVAIEKNIDVYTFASQIPPIIIPDTLDPINLLGVLRRAQGSF VIVSNEFGVVQGLITPLDVLEAIAGEFPDADETPDIIKEQNSWLVKGETDLHSLQQLLNTKELIKQDDCASLGGLLISQK GQLPLPGETIKINSFSFHIVNATEYRIDLVRITKN
Sequences:
>Translated_515_residues MEFFLDPSIWAGLLTLVVLEVVLGIDNLIFVAILSEKLPPNQRDKARLIGLGLALIMRLALLSLISWVVTLTSPIISNNF FSLSIRDLILLIGGLFLLFKATIELHERLENEDHENTENKNYASFWAVVIQIVVLDAVFSLDAIITAVGMVNQLLIMMIA VVLATILMLLASKALTNFINIHQTVVVLCLSFLLMIGFSLVAEALKFYIPKGYLYAAIGFSILIEIFNQIARHNFMKNQS RKPMRQRAAEAILRLMIREKNNNKNRIKTDNKAEIVLSSSLETETFKDEEKYMINGVLTLAGRSIKSIMTPRSNISWVNT EKTINEIRLQLLDTPHNLFPVCKGELDEIIGIVRAKELLVAIEKNIDVYTFASQIPPIIIPDTLDPINLLGVLRRAQGSF VIVSNEFGVVQGLITPLDVLEAIAGEFPDADETPDIIKEQNSWLVKGETDLHSLQQLLNTKELIKQDDCASLGGLLISQK GQLPLPGETIKINSFSFHIVNATEYRIDLVRITKN >Mature_515_residues MEFFLDPSIWAGLLTLVVLEVVLGIDNLIFVAILSEKLPPNQRDKARLIGLGLALIMRLALLSLISWVVTLTSPIISNNF FSLSIRDLILLIGGLFLLFKATIELHERLENEDHENTENKNYASFWAVVIQIVVLDAVFSLDAIITAVGMVNQLLIMMIA VVLATILMLLASKALTNFINIHQTVVVLCLSFLLMIGFSLVAEALKFYIPKGYLYAAIGFSILIEIFNQIARHNFMKNQS RKPMRQRAAEAILRLMIREKNNNKNRIKTDNKAEIVLSSSLETETFKDEEKYMINGVLTLAGRSIKSIMTPRSNISWVNT EKTINEIRLQLLDTPHNLFPVCKGELDEIIGIVRAKELLVAIEKNIDVYTFASQIPPIIIPDTLDPINLLGVLRRAQGSF VIVSNEFGVVQGLITPLDVLEAIAGEFPDADETPDIIKEQNSWLVKGETDLHSLQQLLNTKELIKQDDCASLGGLLISQK GQLPLPGETIKINSFSFHIVNATEYRIDLVRITKN
Specific function: Unknown
COG id: COG1253
COG function: function code R; Hemolysins and related proteins containing CBS domains
Gene ontology:
Cell location: Cell membrane; Multi-pass membrane protein (Potential)
Metaboloic importance: Unknown [C]
Operon status: Not Known
Operon components: None
Similarity: Contains 2 CBS domains
Homologues:
Organism=Homo sapiens, GI310128564, Length=243, Percent_Identity=25.1028806584362, Blast_Score=68, Evalue=2e-11, Organism=Escherichia coli, GI1788119, Length=515, Percent_Identity=66.4077669902913, Blast_Score=689, Evalue=0.0, Organism=Escherichia coli, GI87082033, Length=523, Percent_Identity=45.3154875717017, Blast_Score=404, Evalue=1e-113, Organism=Escherichia coli, GI1789197, Length=225, Percent_Identity=47.5555555555556, Blast_Score=188, Evalue=8e-49, Organism=Escherichia coli, GI1790664, Length=254, Percent_Identity=26.7716535433071, Blast_Score=118, Evalue=7e-28, Organism=Escherichia coli, GI1786879, Length=244, Percent_Identity=26.6393442622951, Blast_Score=94, Evalue=1e-20, Organism=Escherichia coli, GI145693175, Length=281, Percent_Identity=25.2669039145907, Blast_Score=75, Evalue=1e-14,
Paralogues:
None
Copy number: NA
Swissprot (AC and ID): Y314_BUCAP (Q8K9M0)
Other databases:
- EMBL: AE013218 - RefSeq: NP_660657.1 - ProteinModelPortal: Q8K9M0 - SMR: Q8K9M0 - EnsemblBacteria: EBBUCT00000000463 - GeneID: 1005519 - GenomeReviews: AE013218_GR - KEGG: bas:BUsg314 - GeneTree: EBGT00050000007724 - HOGENOM: HBG470183 - OMA: QIARHNF - ProtClustDB: CLSK866497 - BioCyc: BAPH198804:BUSG314-MONOMER - InterPro: IPR016169 - InterPro: IPR000644 - InterPro: IPR005496 - InterPro: IPR005170 - Gene3D: G3DSA:3.30.465.10 - SMART: SM00116
Pfam domain/function: PF00571 CBS; PF03471 CorC_HlyC; PF03741 TerC
EC number: NA
Molecular weight: Translated: 57609; Mature: 57609
Theoretical pI: Translated: 5.53; Mature: 5.53
Prosite motif: PS51371 CBS; PS00027 HOMEOBOX_1
Important sites: NA
Signals:
None
Transmembrane regions:
HASH(0xb37fa88)-; HASH(0x40484710)-; HASH(0xbff940c)-; HASH(0x969512c)-; HASH(0xc6452e8)-; HASH(0x404866cc)-; HASH(0xae39b18)-;
Cys/Met content:
0.6 %Cys (Translated Protein) 2.3 %Met (Translated Protein) 2.9 %Cys+Met (Translated Protein) 0.6 %Cys (Mature Protein) 2.3 %Met (Mature Protein) 2.9 %Cys+Met (Mature Protein)
Secondary structure:
>Translated Secondary Structure MEFFLDPSIWAGLLTLVVLEVVLGIDNLIFVAILSEKLPPNQRDKARLIGLGLALIMRLA CCCEECHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHH LLSLISWVVTLTSPIISNNFFSLSIRDLILLIGGLFLLFKATIELHERLENEDHENTENK HHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC NYASFWAVVIQIVVLDAVFSLDAIITAVGMVNQLLIMMIAVVLATILMLLASKALTNFIN HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IHQTVVVLCLSFLLMIGFSLVAEALKFYIPKGYLYAAIGFSILIEIFNQIARHNFMKNQS HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC RKPMRQRAAEAILRLMIREKNNNKNRIKTDNKAEIVLSSSLETETFKDEEKYMINGVLTL CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCCCHHCCCCHHHHHHHHHHH AGRSIKSIMTPRSNISWVNTEKTINEIRLQLLDTPHNLFPVCKGELDEIIGIVRAKELLV HCHHHHHHHCCCCCCCEECHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHH AIEKNIDVYTFASQIPPIIIPDTLDPINLLGVLRRAQGSFVIVSNEFGVVQGLITPLDVL HHHCCCCEEEEHHCCCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHH EAIAGEFPDADETPDIIKEQNSWLVKGETDLHSLQQLLNTKELIKQDDCASLGGLLISQK HHHHCCCCCCCCCCHHHHCCCCEEEECCCHHHHHHHHHHHHHHHCCCCHHHHCCHHCCCC GQLPLPGETIKINSFSFHIVNATEYRIDLVRITKN CCCCCCCCEEEEECEEEEEEECEEEEEEEEEEECH >Mature Secondary Structure MEFFLDPSIWAGLLTLVVLEVVLGIDNLIFVAILSEKLPPNQRDKARLIGLGLALIMRLA CCCEECHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHH LLSLISWVVTLTSPIISNNFFSLSIRDLILLIGGLFLLFKATIELHERLENEDHENTENK HHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC NYASFWAVVIQIVVLDAVFSLDAIITAVGMVNQLLIMMIAVVLATILMLLASKALTNFIN HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH IHQTVVVLCLSFLLMIGFSLVAEALKFYIPKGYLYAAIGFSILIEIFNQIARHNFMKNQS HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC RKPMRQRAAEAILRLMIREKNNNKNRIKTDNKAEIVLSSSLETETFKDEEKYMINGVLTL CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCCCHHCCCCHHHHHHHHHHH AGRSIKSIMTPRSNISWVNTEKTINEIRLQLLDTPHNLFPVCKGELDEIIGIVRAKELLV HCHHHHHHHCCCCCCCEECHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHH AIEKNIDVYTFASQIPPIIIPDTLDPINLLGVLRRAQGSFVIVSNEFGVVQGLITPLDVL HHHCCCCEEEEHHCCCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHH EAIAGEFPDADETPDIIKEQNSWLVKGETDLHSLQQLLNTKELIKQDDCASLGGLLISQK HHHHCCCCCCCCCCHHHHCCCCEEEECCCHHHHHHHHHHHHHHHCCCCHHHHCCHHCCCC GQLPLPGETIKINSFSFHIVNATEYRIDLVRITKN CCCCCCCCEEEEECEEEEEEECEEEEEEEEEEECH
PDB accession: NA
Resolution: NA
Structure class: Unstructured
Cofactors: NA
Metal ions: NA
Kcat value (1/min): NA
Specific activity: NA
Km value (mM): NA
Substrates: NA
Specific reaction: NA
General reaction: NA
Inhibitor: NA
Structure determination priority: 6.0
TargetDB status: NA
Availability: NA
References: 12089438