{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:44:12Z","timestamp":1753875852135,"version":"3.41.2"},"reference-count":35,"publisher":"Oxford University Press (OUP)","issue":"11","license":[{"start":{"date-parts":[[2024,10,21]],"date-time":"2024-10-21T00:00:00Z","timestamp":1729468800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001321","name":"National Research Foundation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001321","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Transcriptome data from gene knock-out (KO) experiments in mice provide crucial insights into the intricate interactions between genotype and phenotype. Differentially expressed gene (DEG) analysis and network propagation (NP) are well-established methods for analysing transcriptome data. To determine genes related to phenotype changes from a KO experiment, we need to choose a cutoff value for the corresponding criterion based on the specific method. Using a rigorous cutoff value for DEG analysis and NP is likely to select mostly positive genes related to the phenotype, but many will be rejected as false negatives. On the other hand, using a loose cutoff value for either method is prone to include a number of genes that are not phenotype-related, which are false positives. Thus, the research problem at hand is how to deal with the trade-off between false negatives and false positives.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We propose a novel framework called PONYTA for gene prioritization via positive-unlabeled (PU) learning on biological networks. Beginning with the selection of true phenotype-related genes using a rigorous cutoff value for DEG analysis and NP, we address the issue of handling false negatives by rescuing them through PU learning. Evaluations on transcriptome data from multiple studies show that our approach has superior gene prioritization ability compared to benchmark models. Therefore, PONYTA effectively prioritizes genes related to phenotypes derived from gene KO events and guides in vitro and in vivo gene KO experiments for increased efficiency.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The source code of PONYTA is available at https:\/\/github.com\/Jun-Hyeong-Kim\/PONYTA.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae634","type":"journal-article","created":{"date-parts":[[2024,10,21]],"date-time":"2024-10-21T17:59:48Z","timestamp":1729533588000},"source":"Crossref","is-referenced-by-count":0,"title":["PONYTA: prioritization of phenotype-related genes from mouse KO events using PU learning on a biological network"],"prefix":"10.1093","volume":"40","author":[{"given":"Jun Hyeong","family":"Kim","sequence":"first","affiliation":[{"name":"Interdisciplinary Program in Artificial Intelligence, Seoul National University , Seoul 08826,","place":["Republic of Korea"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4357-1850","authenticated-orcid":false,"given":"Bonil","family":"Koo","sequence":"additional","affiliation":[{"name":"Interdisciplinary Program in Bioinformatics, Seoul National University , Seoul 08826,","place":["Republic of Korea"]},{"name":"AIGENDRUG Co., Ltd. , Seoul 08758,","place":["Republic of Korea"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5385-9546","authenticated-orcid":false,"given":"Sun","family":"Kim","sequence":"additional","affiliation":[{"name":"Interdisciplinary Program in Artificial Intelligence, Seoul National University , Seoul 08826,","place":["Republic of Korea"]},{"name":"Interdisciplinary Program in Bioinformatics, Seoul National University , Seoul 08826,","place":["Republic of Korea"]},{"name":"AIGENDRUG Co., Ltd. , Seoul 08758,","place":["Republic of Korea"]},{"name":"Department of Computer Science and Engineering, Seoul National University , Seoul 08826,","place":["Republic of Korea"]}]}],"member":"286","published-online":{"date-parts":[[2024,10,21]]},"reference":[{"key":"2024111406110631400_btae634-B1","doi-asserted-by":"crossref","first-page":"101319","DOI":"10.1016\/j.softx.2023.101319","article-title":"Flagr: a flexible high-performance library for rank aggregation","volume":"21","author":"Akritidis","year":"2023","journal-title":"SoftwareX"},{"key":"2024111406110631400_btae634-B2","doi-asserted-by":"crossref","first-page":"117435","DOI":"10.1016\/j.eswa.2022.117435","article-title":"An unsupervised distance-based model for weighted rank aggregation with list pruning","volume":"202","author":"Akritidis","year":"2022","journal-title":"Expert Syst Appl"},{"key":"2024111406110631400_btae634-B3","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2010-11-10-r106","article-title":"Differential expression analysis for sequence count data","volume":"11","author":"Anders","year":"2010","journal-title":"Genome Biol"},{"key":"2024111406110631400_btae634-B4","doi-asserted-by":"crossref","first-page":"719","DOI":"10.1007\/s10994-020-05877-5","article-title":"Learning from positive and unlabeled data: a survey","volume":"109","author":"Bekker","year":"2020","journal-title":"Mach Learn"},{"key":"2024111406110631400_btae634-B5","doi-asserted-by":"crossref","first-page":"551","DOI":"10.1038\/nrg.2017.38","article-title":"Network propagation: a universal amplifier of genetic associations","volume":"18","author":"Cowen","year":"2017","journal-title":"Nat Rev Genet"},{"key":"2024111406110631400_btae634-B6","doi-asserted-by":"crossref","first-page":"2441","DOI":"10.1093\/bioinformatics\/bty148","article-title":"Network-based integration of multi-omics data for prioritizing cancer genes","volume":"34","author":"Dimitrakopoulos","year":"2018","journal-title":"Bioinformatics"},{"key":"2024111406110631400_btae634-B7","doi-asserted-by":"crossref","first-page":"101417","DOI":"10.1016\/j.jbc.2021.101417","article-title":"CRISPR\/Cas9-mediated SARM1 knockout and epitope-tagged mice reveal that SARM1 does not regulate nuclear transcription, but is expressed in macrophages","volume":"297","author":"Doran","year":"2021","journal-title":"J Biol Chem"},{"first-page":"301","year":"2003","author":"Fagin","key":"2024111406110631400_btae634-B8"},{"first-page":"591","year":"2007","author":"Farah","key":"2024111406110631400_btae634-B9"},{"key":"2024111406110631400_btae634-B10","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1016\/j.conb.2020.02.012","article-title":"The sarm1 axon degeneration pathway: control of the NAD+ metabolome regulates axon survival in health and disease","volume":"63","author":"Figley","year":"2020","journal-title":"Curr Opin Neurobiol"},{"key":"2024111406110631400_btae634-B11","doi-asserted-by":"crossref","first-page":"e1004120","DOI":"10.1371\/journal.pcbi.1004120","article-title":"A disease module detection (diamond) algorithm derived from a systematic analysis of connectivity patterns of disease proteins in the human interactome","volume":"11","author":"Ghiassian","year":"2015","journal-title":"PLoS Comput Biol"},{"key":"2024111406110631400_btae634-B12","first-page":"855","article-title":"node2vec: scalable feature learning for networks","volume":"2016","author":"Grover","year":"2016","journal-title":"KDD"},{"key":"2024111406110631400_btae634-B13","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1038\/s41587-023-01767-y","article-title":"Dictionary learning for integrative, multimodal and scalable single-cell analysis","volume":"42","author":"Hao","year":"2024","journal-title":"Nat Biotechnol"},{"issue":"Suppl 2","key":"2024111406110631400_btae634-B14","doi-asserted-by":"crossref","first-page":"S10","DOI":"10.1186\/1755-8794-8-S2-S10","article-title":"Combined analysis of gene regulatory network and SNV information enhances identification of potential gene markers in mouse knockout studies with small number of samples","volume":"8","author":"Hur","year":"2015","journal-title":"BMC Med Genomics"},{"volume-title":"IEEE Trans Pattern Anal Mach Intell","year":"2023","author":"Jiang Y, Xu Q, Zhao Y","key":"2024111406110631400_btae634-B35"},{"key":"2024111406110631400_btae634-B15","doi-asserted-by":"crossref","first-page":"949","DOI":"10.1016\/j.ajhg.2008.02.013","article-title":"Walking the interactome for prioritization of candidate disease genes","volume":"82","author":"K\u00f6hler","year":"2008","journal-title":"Am J Hum Genet"},{"key":"2024111406110631400_btae634-B16","doi-asserted-by":"crossref","first-page":"702","DOI":"10.1016\/j.cmet.2022.03.010","article-title":"SREBP1c-PARP1 axis tunes anti-senescence activity of adipocytes and ameliorates metabolic imbalance in obesity","volume":"34","author":"Lee","year":"2022","journal-title":"Cell Metab"},{"key":"2024111406110631400_btae634-B17","doi-asserted-by":"crossref","first-page":"1235","DOI":"10.1038\/s41380-019-0353-0","article-title":"Disruption of TCF4 regulatory networks leads to abnormal cortical development and mental disabilities","volume":"24","author":"Li","year":"2019","journal-title":"Mol Psychiatry"},{"first-page":"951","year":"2016","author":"Liang","key":"2024111406110631400_btae634-B18"},{"key":"2024111406110631400_btae634-B19","doi-asserted-by":"crossref","first-page":"20545","DOI":"10.1073\/pnas.1906663116","article-title":"Transcriptional control of lung alveolar type 1 cell development and maintenance by NK homeobox 2-1","volume":"116","author":"Little","year":"2019","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2024111406110631400_btae634-B20","first-page":"10","article-title":"Differential analysis of count data\u2013the deseq2 package","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2024111406110631400_btae634-B21","doi-asserted-by":"crossref","first-page":"8892","DOI":"10.1074\/jbc.M117.784629","article-title":"The transcription factor Pax6 is required for pancreatic \u03b2 cell identity, glucose-regulated ATP synthesis, and Ca2+ dynamics in adult mice","volume":"292","author":"Mitchell","year":"2017","journal-title":"J Biol Chem"},{"first-page":"841","year":"2003","author":"Renda","key":"2024111406110631400_btae634-B22"},{"first-page":"501","year":"2020","author":"Saito","key":"2024111406110631400_btae634-B23"},{"key":"2024111406110631400_btae634-B24","doi-asserted-by":"crossref","first-page":"839","DOI":"10.1261\/rna.053959.115","article-title":"How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?","volume":"22","author":"Schurch","year":"2016","journal-title":"RNA"},{"key":"2024111406110631400_btae634-B25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-8-239","article-title":"Probabilistic prediction and ranking of human protein\u2013protein interactions","volume":"8","author":"Scott","year":"2007","journal-title":"BMC Bioinformatics"},{"volume-title":"Development","year":"2023","author":"Souali-Crespo","key":"2024111406110631400_btae634-B26"},{"key":"2024111406110631400_btae634-B27","doi-asserted-by":"crossref","first-page":"D638","DOI":"10.1093\/nar\/gkac1000","article-title":"The string database in 2023: protein\u2013protein association networks and functional enrichment analyses for any sequenced genome of interest","volume":"51","author":"Szklarczyk","year":"2023","journal-title":"Nucleic Acids Res"},{"author":"Veli\u010dkovi\u0107","key":"2024111406110631400_btae634-B28"},{"issue":"Suppl 1","key":"2024111406110631400_btae634-B29","doi-asserted-by":"crossref","first-page":"S6","DOI":"10.1186\/1471-2105-16-S1-S6","article-title":"Network tuned multiple rank aggregation and applications to gene ranking","volume":"16","author":"Wang","year":"2015","journal-title":"BMC Bioinformatics"},{"key":"2024111406110631400_btae634-B30","doi-asserted-by":"crossref","first-page":"482","DOI":"10.1016\/j.copbio.2008.07.011","article-title":"Gene module level analysis: identification to networks and dynamics","volume":"19","author":"Wang","year":"2008","journal-title":"Curr Opin Biotechnol"},{"key":"2024111406110631400_btae634-B31","doi-asserted-by":"crossref","first-page":"dev196022","DOI":"10.1242\/dev.196022","article-title":"scRNA sequencing uncovers a TCF4-dependent transcription factor network regulating commissure development in mouse","volume":"148","author":"Wittmann","year":"2021","journal-title":"Development"},{"key":"2024111406110631400_btae634-B32","doi-asserted-by":"crossref","first-page":"43258","DOI":"10.1038\/srep43258","article-title":"GenePANDA\u2014a novel network-based gene prioritizing tool for complex diseases","volume":"7","author":"Yin","year":"2017","journal-title":"Sci Rep"},{"key":"2024111406110631400_btae634-B33","doi-asserted-by":"crossref","first-page":"827","DOI":"10.1109\/ICDM51629.2021.00094","volume-title":"2021 IEEE international conference on data mining (ICDM), Auckland, New Zealand, December 7\u201310","author":"Yoo","year":"2021"},{"key":"2024111406110631400_btae634-B34","doi-asserted-by":"crossref","first-page":"bbab006","DOI":"10.1093\/bib\/bbab006","article-title":"Benchmarking network-based gene prioritization methods for cerebral small vessel disease","volume":"22","author":"Zhang","year":"2021","journal-title":"Brief Bioinform"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae634\/59933657\/btae634.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/11\/btae634\/60658171\/btae634.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/11\/btae634\/60658171\/btae634.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,14]],"date-time":"2024-11-14T06:11:29Z","timestamp":1731564689000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae634\/7829145"}},"subtitle":[],"editor":[{"given":"Pier Luigi","family":"Martelli","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,10,21]]},"references-count":35,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2024,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae634","relation":{},"ISSN":["1367-4811"],"issn-type":[{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2024,11]]},"published":{"date-parts":[[2024,10,21]]},"article-number":"btae634"}}