{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T12:15:14Z","timestamp":1774008914131,"version":"3.50.1"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2025,7,15]],"date-time":"2025-07-15T00:00:00Z","timestamp":1752537600000},"content-version":"vor","delay-in-days":14,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100002341","name":"Research Council of Finland","doi-asserted-by":"publisher","award":["359135"],"award-info":[{"award-number":["359135"]}],"id":[{"id":"10.13039\/501100002341","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Cancer Foundation of Finland"},{"DOI":"10.13039\/501100006306","name":"Sigrid Jus\u00e9lius Foundation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100006306","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Multiple instance learning (MIL) provides a structured approach to patient phenotype prediction with single-cell RNA-sequencing (scRNA-seq) data. However, existing MIL methods tend to overlook the hierarchical structure inherent in scRNA-seq data, especially the biological groupings of cells or cell types. This limitation may lead to suboptimal performance and poor interpretability at higher levels of cellular division.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>To address this gap, we present a novel approach to incorporate hierarchical information into the attention-based MIL framework. Specifically, our model applies the attention-based aggregation mechanism over both cells and cell types, thus enforcing a hierarchical structure on the flow of information throughout the model. Across extensive experiments, our proposed approach demonstrates highly competitive performance and shows robustness against limited sample sizes. Moreover, ablation test results show that simply applying the attention mechanism on cell types instead of cells leads to improved performance, underscoring the benefits of incorporating the hierarchical groupings. By identifying the critical cell types that are most relevant for prediction, we show that our model is capable of capturing biologically meaningful associations, suggesting its potential to facilitate biological discoveries.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>Our source code is available at https:\/\/github.com\/minhchaudo\/hier-mil. All datasets used in this study are publicly available online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf241","type":"journal-article","created":{"date-parts":[[2025,7,15]],"date-time":"2025-07-15T13:03:41Z","timestamp":1752584621000},"page":"i96-i104","source":"Crossref","is-referenced-by-count":3,"title":["Incorporating hierarchical information into multiple instance learning for patient phenotype prediction with single-cell RNA-sequencing data"],"prefix":"10.1093","volume":"41","author":[{"given":"Chau","family":"Do","sequence":"first","affiliation":[{"name":"Department of Computer Science, Aalto University , Espoo 11000,","place":["Finland"]}]},{"given":"Harri","family":"L\u00e4hdesm\u00e4ki","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Aalto University , Espoo 11000,","place":["Finland"]}]}],"member":"286","published-online":{"date-parts":[[2025,7,15]]},"reference":[{"key":"2025071509033795000_btaf241-B1","doi-asserted-by":"publisher","first-page":"e148517","DOI":"10.1172\/JCI148517","article-title":"Nasal ciliated cells are primary targets for SARS-CoV-2 replication in the early stage of COVID-19","volume":"131","author":"Ahn","year":"2021","journal-title":"J Clin Invest"},{"key":"2025071509033795000_btaf241-B2","doi-asserted-by":"publisher","first-page":"109177","DOI":"10.1016\/j.clim.2022.109177","article-title":"Exhaustion and over-activation of immune cells in covid-19: challenges and therapeutic opportunities","volume":"245","author":"Alahdal","year":"2022","journal-title":"Clin Immunol"},{"key":"2025071509033795000_btaf241-B3","doi-asserted-by":"publisher","first-page":"996","DOI":"10.1158\/2326-6066.CIR-21-0870","article-title":"Microenvironmental landscape of human melanoma brain metastases in response to immune checkpoint inhibition","volume":"10","author":"Alvarez-Breckenridge","year":"2022","journal-title":"Cancer Immunol Res"},{"key":"2025071509033795000_btaf241-B4","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1038\/s41590-018-0276-y","article-title":"Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage","volume":"20","author":"Aran","year":"2019","journal-title":"Nat Immunol"},{"key":"2025071509033795000_btaf241-B5","doi-asserted-by":"publisher","first-page":"820","DOI":"10.1038\/s41591-021-01323-8","article-title":"A single-cell map of intratumoral changes during anti-pd1 treatment of patients with breast cancer","volume":"27","author":"Bassez","year":"2021","journal-title":"Nat Med"},{"key":"2025071509033795000_btaf241-B6","doi-asserted-by":"publisher","first-page":"174","DOI":"10.1038\/s41586-022-04817-8","article-title":"Single-nucleus profiling of human dilated and hypertrophic cardiomyopathy","volume":"608","author":"Chaffin","year":"2022","journal-title":"Nature"},{"key":"2025071509033795000_btaf241-B7","doi-asserted-by":"publisher","first-page":"1470","DOI":"10.1038\/s41592-024-02201-0","article-title":"Scgpt: toward building a foundation model for single-cell multi-omics using generative ai","volume":"21","author":"Cui","year":"2024","journal-title":"Nat Methods"},{"key":"2025071509033795000_btaf241-B8","author":"Engelmann","year":"2024"},{"key":"2025071509033795000_btaf241-B9","doi-asserted-by":"publisher","author":"Gondal","year":"2024","DOI":"10.1101\/2024.01.17.576110"},{"key":"2025071509033795000_btaf241-B10","author":"Hajj","year":"2024"},{"key":"2025071509033795000_btaf241-B11","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1142\/9789811250477_0031","article-title":"Cloudpred: predicting patient phenotypes from single-cell RNA-seq","volume":"27","author":"He","year":"2022","journal-title":"Pac Symp Biocomput"},{"key":"2025071509033795000_btaf241-B12","author":"Ilse","year":"2018"},{"key":"2025071509033795000_btaf241-B13","doi-asserted-by":"publisher","first-page":"632","DOI":"10.1186\/1471-2164-14-632","article-title":"An expression atlas of human primary cells: inference of gene function from coexpression networks","volume":"14","author":"Mabbott","year":"2013","journal-title":"BMC Genomics"},{"key":"2025071509033795000_btaf241-B14","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btae067","article-title":"Phenotype prediction from single-cell RNA-seq data using attention-based neural networks","volume":"40","author":"Mao","year":"2024","journal-title":"Bioinformatics"},{"key":"2025071509033795000_btaf241-B15","doi-asserted-by":"publisher","first-page":"1487","DOI":"10.3324\/haematol.2013.094243","article-title":"Blueprint: mapping human blood cell epigenomes","volume":"98","author":"Martens","year":"2013","journal-title":"Haematologica"},{"key":"2025071509033795000_btaf241-B16","doi-asserted-by":"publisher","first-page":"1008","DOI":"10.1038\/s41423-024-01167-5","article-title":"The role of plasmacytoid dendritic cells (PDCs) in immunity during viral infections and beyond","volume":"21","author":"Ngo","year":"2024","journal-title":"Cell Mol Immunol"},{"key":"2025071509033795000_btaf241-B17","doi-asserted-by":"publisher","first-page":"203","DOI":"10.1186\/s13059-019-1790-4","article-title":"Identifying significantly impacted pathways: a comprehensive review and assessment","volume":"20","author":"Nguyen","year":"2019","journal-title":"Genome Biol"},{"key":"2025071509033795000_btaf241-B18","doi-asserted-by":"publisher","first-page":"166","DOI":"10.1016\/j.cell.2023.11.037","article-title":"A tcf4-dependent gene regulatory network confers resistance to immunotherapy in melanoma","volume":"187","author":"Pozniak","year":"2024","journal-title":"Cell"},{"key":"2025071509033795000_btaf241-B19","doi-asserted-by":"publisher","first-page":"4354","DOI":"10.1038\/s41467-021-24521-x","article-title":"SARS-CoV-2 infection induces the dedifferentiation of multiciliated cells and impairs mucociliary clearance","volume":"12","author":"Robinot","year":"2021","journal-title":"Nat Commun"},{"key":"2025071509033795000_btaf241-B20","doi-asserted-by":"publisher","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc Natl Acad Sci USA"},{"key":"2025071509033795000_btaf241-B21","doi-asserted-by":"publisher","first-page":"57","DOI":"10.1038\/nature11247","article-title":"An integrated encyclopedia of DNA elements in the human genome","volume":"489","author":"The ENCODE Project Consortium","year":"2012","journal-title":"Nature"},{"key":"2025071509033795000_btaf241-B22","doi-asserted-by":"publisher","first-page":"btad493","DOI":"10.1093\/bioinformatics\/btad493","article-title":"Protocell4p: an explainable prototype-based neural network for patient classification using single-cell RNA-seq","volume":"39","author":"Xiong","year":"2023","journal-title":"Bioinformatics"},{"key":"2025071509033795000_btaf241-B23","doi-asserted-by":"publisher","first-page":"3910","DOI":"10.1038\/s41467-020-17796-z","article-title":"Morphogenesis and cytopathic effect of SARS-CoV-2 infection in human airway epithelial cells","volume":"11","author":"Zhu","year":"2020","journal-title":"Nat Commun"},{"key":"2025071509033795000_btaf241-B24","doi-asserted-by":"publisher","first-page":"4713","DOI":"10.1016\/j.cell.2021.07.023","article-title":"Impaired local intrinsic immunity to SARS-CoV-2 infection in severe covid-19","volume":"184","author":"Ziegler","year":"2021","journal-title":"Cell"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/Supplement_1\/i96\/63745319\/btaf241.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/Supplement_1\/i96\/63745319\/btaf241.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,15]],"date-time":"2025-07-15T13:03:42Z","timestamp":1752584622000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/41\/Supplement_1\/i96\/8199355"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,1]]},"references-count":24,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2025,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf241","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,7]]},"published":{"date-parts":[[2025,7,1]]}}}