{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T06:45:53Z","timestamp":1776062753482,"version":"3.50.1"},"reference-count":49,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2024,1,26]],"date-time":"2024-01-26T00:00:00Z","timestamp":1706227200000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004063","name":"Knut and Alice Wallenberg Foundation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100004063","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001862","name":"Swedish Research Council Formas","doi-asserted-by":"publisher","award":["2020-01690"],"award-info":[{"award-number":["2020-01690"]}],"id":[{"id":"10.13039\/501100001862","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,2,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Proteomic profiles reflect the functional readout of the physiological state of an organism. An increased understanding of what controls and defines protein abundances is of high scientific interest. Saccharomyces cerevisiae is a well-studied model organism, and there is a large amount of structured knowledge on yeast systems biology in databases such as the Saccharomyces Genome Database, and highly curated genome-scale metabolic models like Yeast8. These datasets, the result of decades of experiments, are abundant in information, and adhere to semantically meaningful ontologies.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>By representing this knowledge in an expressive Datalog database we generated data descriptors using relational learning that, when combined with supervised machine learning, enables us to predict protein abundances in an explainable manner. We learnt predictive relationships between protein abundances, function and phenotype; such as \u03b1-amino acid accumulations and deviations in chronological lifespan. We further demonstrate the power of this methodology on the proteins His4 and Ilv2, connecting qualitative biological concepts to quantified abundances.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>All data and processing scripts are available at the following Github repository: https:\/\/github.com\/DanielBrunnsaker\/ProtPredict.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae050","type":"journal-article","created":{"date-parts":[[2024,1,26]],"date-time":"2024-01-26T06:46:09Z","timestamp":1706251569000},"source":"Crossref","is-referenced-by-count":1,"title":["Interpreting protein abundance in <i>Saccharomyces cerevisiae<\/i> through relational learning"],"prefix":"10.1093","volume":"40","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5167-0536","authenticated-orcid":false,"given":"Daniel","family":"Brunns\u00e5ker","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering, Chalmers University of Technology , Gothenburg 412 96, Sweden"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3011-5541","authenticated-orcid":false,"given":"Filip","family":"Kronstr\u00f6m","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Chalmers University of Technology , Gothenburg 412 96, Sweden"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0408-3515","authenticated-orcid":false,"given":"Ievgeniia A","family":"Tiukova","sequence":"additional","affiliation":[{"name":"Department of Life Sciences, Chalmers University of Technology , Gothenburg 412 96, Sweden"},{"name":"Department of Industrial Biotechnology, KTH Royal Institute of Technology , Stockholm 106 91, Sweden"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7208-4387","authenticated-orcid":false,"given":"Ross D","family":"King","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, Chalmers University of Technology , Gothenburg 412 96, Sweden"},{"name":"Department of Chemical Engineering and Biotechnology, University of Cambridge , Cambridge CB3 0AS, United Kingdom"},{"name":"The Alan Turing Institute , London NW1 2DB, United Kingdom"}]}],"member":"286","published-online":{"date-parts":[[2024,1,25]]},"reference":[{"key":"2024020805385482600_btae050-B1","doi-asserted-by":"crossref","first-page":"bar062","DOI":"10.1093\/database\/bar062","article-title":"YeastMine\u2014an integrated data warehouse for Saccharomyces cerevisiae data as a multipurpose tool-kit","volume":"2012","author":"Balakrishnan","year":"2012","journal-title":"Database (Oxford)"},{"key":"2024020805385482600_btae050-B2","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1126\/science.1153716","article-title":"From genotype to phenotype: systems biology meets natural variation","volume":"320","author":"Benfey","year":"2008","journal-title":"Science (New York, N.Y.)"},{"key":"2024020805385482600_btae050-B3","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1016\/j.tma.2019.09.001","article-title":"Amino acids in the regulation of aging and aging-related diseases","volume":"3","author":"Canfield","year":"2019","journal-title":"Transl Med Aging"},{"key":"2024020805385482600_btae050-B4","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1038\/nature18919","article-title":"The TORC1 pathway to protein destruction","volume":"536","author":"Chantranupong","year":"2016","journal-title":"Nature"},{"key":"2024020805385482600_btae050-B5","first-page":"785","author":"Chen","year":"2016"},{"key":"2024020805385482600_btae050-B6","doi-asserted-by":"crossref","first-page":"D700","DOI":"10.1093\/nar\/gkr1029","article-title":"Saccharomyces genome database: the genomics resource of budding yeast","volume":"40","author":"Cherry","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2024020805385482600_btae050-B7","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1023\/A:1009863704807","article-title":"Discovery of frequent DATALOG patterns","volume":"3","author":"Dehaspe","year":"1999","journal-title":"Data Min Knowledge Discov"},{"key":"2024020805385482600_btae050-B8","doi-asserted-by":"crossref","first-page":"349","DOI":"10.1016\/j.molcel.2012.05.043","article-title":"Glutaminolysis activates rag-mTORC1 signaling","volume":"47","author":"Dur\u00e1n","year":"2012","journal-title":"Mol Cell"},{"key":"2024020805385482600_btae050-B9","doi-asserted-by":"crossref","first-page":"665490","DOI":"10.3389\/ffunb.2021.665490","article-title":"Understanding the impact of industrial stress conditions on replicative aging in Saccharomyces cerevisiae","volume":"2","author":"Eigenfeld","year":"2021","journal-title":"Front Fungal Biol"},{"key":"2024020805385482600_btae050-B10","doi-asserted-by":"crossref","first-page":"348","DOI":"10.1038\/s41586-021-03922-4","article-title":"Biologically informed deep neural network for prostate cancer discovery","volume":"598","author":"Elmarakeby","year":"2021","journal-title":"Nature"},{"key":"2024020805385482600_btae050-B11","doi-asserted-by":"crossref","first-page":"4011","DOI":"10.1093\/nar\/13.11.4011","article-title":"Nucleotide sequence of the yeast ILV2 gene which encodes acetolactate synthase","volume":"13","author":"Falco","year":"1985","journal-title":"Nucleic Acids Res"},{"key":"2024020805385482600_btae050-B12","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1186\/s13059-020-02100-5","article-title":"Knowledge-primed neural networks enable biologically interpretable deep learning on single-cell sequencing data","volume":"21","author":"Fortelny","year":"2020","journal-title":"Genome Biol"},{"key":"2024020805385482600_btae050-B13","doi-asserted-by":"crossref","first-page":"461901","DOI":"10.1155\/2013\/461901","article-title":"Metabolic fate of the increased yeast amino acid uptake subsequent to catabolite derepression","volume":"2013","author":"Hothersall","year":"2013","journal-title":"J Amino Acids"},{"key":"2024020805385482600_btae050-B14","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1126\/science.1259472","article-title":"Differential regulation of mTORC1 by leucine and glutamine","volume":"347","author":"Jewell","year":"2015","journal-title":"Science"},{"key":"2024020805385482600_btae050-B15","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1023\/A:1008171016861","article-title":"Warmr: a data mining tool for chemical data","volume":"15","author":"King","year":"2001","journal-title":"J Comput Aided Mol Des"},{"key":"2024020805385482600_btae050-B16","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1038\/s42256-021-00332-z","article-title":"Cross-validation is safe to use","volume":"3","author":"King","year":"2021","journal-title":"Nat Mach Intell"},{"key":"2024020805385482600_btae050-B17","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1007\/978-3-662-04599-2_11","volume-title":"Relational Data Mining","author":"Kramer","year":"2001"},{"key":"2024020805385482600_btae050-B18","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1016\/j.cels.2017.03.003","article-title":"Absolute quantification of protein and mRNA abundances demonstrate variability in gene-specific translation efficiency in yeast","volume":"4","author":"Lahtvee","year":"2017","journal-title":"Cell Syst"},{"key":"2024020805385482600_btae050-B19","doi-asserted-by":"crossref","first-page":"1465","DOI":"10.1007\/s10994-020-05890-8","article-title":"Propositionalization and embeddings: two sides of the same coin","volume":"109","author":"Lavra\u010d","year":"2020","journal-title":"Mach Learn"},{"key":"2024020805385482600_btae050-B20","doi-asserted-by":"crossref","first-page":"D583","DOI":"10.1093\/nar\/gkac831","article-title":"GotEnzymes: an extensive database of enzyme parameter predictions","volume":"51","author":"Li","year":"2023","journal-title":"Nucleic Acids Res"},{"key":"2024020805385482600_btae050-B21","doi-asserted-by":"crossref","first-page":"1194","DOI":"10.7150\/ijbs.40769","article-title":"LncRNAs regulate metabolism in cancer","volume":"16","author":"Lin","year":"2020","journal-title":"Int J Biol Sci"},{"key":"2024020805385482600_btae050-B22","volume-title":"Foundations of Logic Programming","author":"Lloyd","year":"2012"},{"key":"2024020805385482600_btae050-B23","doi-asserted-by":"crossref","first-page":"3586","DOI":"10.1038\/s41467-019-11581-3","article-title":"A consensus S. cerevisiae metabolic model Yeast8 and its ecosystem for comprehensively probing cellular metabolism","volume":"10","author":"Lu","year":"2019","journal-title":"Nat Commun"},{"key":"2024020805385482600_btae050-B24","first-page":"4768","article-title":"A unified approach to interpreting model predictions","volume":"30","author":"Lundberg","year":"2017","journal-title":"Adv Neural Inform Process Syst"},{"key":"2024020805385482600_btae050-B25","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/s42256-019-0138-9","article-title":"From local explanations to global understanding with explainable AI for trees","volume":"2","author":"Lundberg","year":"2020","journal-title":"Nat Mach Intell"},{"key":"2024020805385482600_btae050-B26","doi-asserted-by":"crossref","first-page":"290","DOI":"10.1038\/nmeth.4627","article-title":"Using deep learning to model the hierarchical structure and function of a cell","volume":"15","author":"Ma","year":"2018","journal-title":"Nat Methods"},{"key":"2024020805385482600_btae050-B27","doi-asserted-by":"crossref","first-page":"1776","DOI":"10.15252\/embr.201642943","article-title":"Molecular insights into protein synthesis with proline residues","volume":"17","author":"Melnikov","year":"2016","journal-title":"EMBO Rep"},{"key":"2024020805385482600_btae050-B28","first-page":"2018","volume-title":"Cell","author":"Messner","year":"2023"},{"key":"2024020805385482600_btae050-B29","doi-asserted-by":"crossref","first-page":"e2200013","DOI":"10.1002\/pmic.202200013","article-title":"Mass spectrometry-based high-throughput proteomics and its role in biomedical studies and systems biology","volume":"23","author":"Messner","year":"2022","journal-title":"Proteomics"},{"key":"2024020805385482600_btae050-B30","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1016\/j.cell.2016.09.007","article-title":"Functional metabolomics describes the yeast biosynthetic regulome","volume":"167","author":"M\u00fclleder","year":"2016","journal-title":"Cell"},{"key":"2024020805385482600_btae050-B31","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1007\/s10994-011-5259-2","article-title":"ILP turns 20","volume":"86","author":"Muggleton","year":"2012","journal-title":"Mach Learn"},{"key":"2024020805385482600_btae050-B32","doi-asserted-by":"crossref","first-page":"10294","DOI":"10.1073\/pnas.1919250117","article-title":"Quantitative analysis of amino acid metabolism in liver cancer links glutamate excretion to nucleotide synthesis","volume":"117","author":"Nilsson","year":"2020","journal-title":"Proc Natl Acad Sci USA"},{"key":"2024020805385482600_btae050-B33","first-page":"374","volume-title":"Proceedings of the 23rd International Conference on Discovery Science","author":"Orhobor","year":"2020"},{"key":"2024020805385482600_btae050-B34","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1002\/pro.3978","article-title":"The BioGRID database: a comprehensive biomedical resource of curated protein, genetic, and chemical interactions","volume":"30","author":"Oughtred","year":"2021","journal-title":"Protein Sci"},{"key":"2024020805385482600_btae050-B35","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-540-68856-3","volume-title":"Logical and Relational Learning","author":"Raedt","year":"2008"},{"key":"2024020805385482600_btae050-B36","doi-asserted-by":"crossref","first-page":"1695","DOI":"10.1007\/s10994-021-06017-3","article-title":"Beyond graph neural networks with lifted relational neural networks","volume":"110","author":"\u0160ourek","year":"2021","journal-title":"Mach Learn"},{"key":"2024020805385482600_btae050-B37","author":"Srinivasan","year":"2001"},{"key":"2024020805385482600_btae050-B38","doi-asserted-by":"crossref","first-page":"293","DOI":"10.15698\/mic2018.06.637","article-title":"Valine biosynthesis in Saccharomyces cerevisiae is regulated by the mitochondrial branched-chain amino acid aminotransferase Bat1","volume":"5","author":"Takpho","year":"2018","journal-title":"Microb Cell"},{"key":"2024020805385482600_btae050-B39","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1038\/nrg3185","article-title":"Insights into the regulation of protein abundance from proteomic and transcriptomic analyses","volume":"13","author":"Vogel","year":"2012","journal-title":"Nat Rev Genet"},{"key":"2024020805385482600_btae050-B40","doi-asserted-by":"crossref","first-page":"M111.009217","DOI":"10.1074\/mcp.M111.009217","article-title":"Protein expression regulation under oxidative stress","volume":"10","author":"Vogel","year":"2011","journal-title":"Mol Cell Proteomics"},{"key":"2024020805385482600_btae050-B41","doi-asserted-by":"crossref","first-page":"giz137","DOI":"10.1093\/gigascience\/giz137","article-title":"Compartment and hub definitions tune metabolic networks for metabolomic interpretations","volume":"9","author":"Waller","year":"2020","journal-title":"Gigascience"},{"key":"2024020805385482600_btae050-B42","doi-asserted-by":"crossref","first-page":"e2102344118","DOI":"10.1073\/pnas.2102344118","article-title":"Genome-scale metabolic network reconstruction of model animals as a platform for translational research","volume":"118","author":"Wang","year":"2021","journal-title":"Proc Natl Acad Sci USA"},{"key":"2024020805385482600_btae050-B43","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1017\/S1471068411000494","article-title":"SWI-Prolog","volume":"12","author":"Wielemaker","year":"2012","journal-title":"Theory Pract Logic Program"},{"key":"2024020805385482600_btae050-B44","doi-asserted-by":"crossref","first-page":"180241","DOI":"10.1098\/rsob.180241","article-title":"Hidden in plain sight: what remains to be discovered in the eukaryotic proteome?","volume":"9","author":"Wood","year":"2019","journal-title":"Open Biol"},{"key":"2024020805385482600_btae050-B45","first-page":"100141","article-title":"clusterProfiler 4.0: a universal enrichment tool for interpreting omics data","volume":"2","author":"Wu","year":"2021","journal-title":"Innovation (Cambridge (Mass.))"},{"key":"2024020805385482600_btae050-B46","doi-asserted-by":"crossref","first-page":"984","DOI":"10.1139\/g88-156","article-title":"The yeast ILV2 gene is under general amino acid control","volume":"30","author":"Xiao","year":"1988","journal-title":"Genome"},{"key":"2024020805385482600_btae050-B47","doi-asserted-by":"crossref","first-page":"616","DOI":"10.1111\/j.1474-9726.2010.00590.x","article-title":"Metabolomics-based systematic prediction of yeast lifespan and its application for semi-rational screening of ageing-related mutants","volume":"9","author":"Yoshida","year":"2010","journal-title":"Aging Cell"},{"key":"2024020805385482600_btae050-B48","doi-asserted-by":"crossref","first-page":"726","DOI":"10.1109\/TETCI.2021.3100641","article-title":"A survey on neural network interpretability","volume":"5","author":"Zhang","year":"2021","journal-title":"IEEE Trans Emerg Top Comput Intell"},{"key":"2024020805385482600_btae050-B49","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1016\/j.aiopen.2021.01.001","article-title":"Graph neural networks: a review of methods and applications","volume":"1","author":"Zhou","year":"2020","journal-title":"AI Open"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae050\/56423152\/btae050.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/2\/btae050\/56619452\/btae050.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/2\/btae050\/56619452\/btae050.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,8]],"date-time":"2024-02-08T06:03:59Z","timestamp":1707372239000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae050\/7589924"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,1,25]]},"references-count":49,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2024,2,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae050","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,2,1]]},"published":{"date-parts":[[2024,1,25]]},"article-number":"btae050"}}