{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,14]],"date-time":"2026-01-14T22:50:28Z","timestamp":1768431028613,"version":"3.49.0"},"reference-count":25,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2024,5,22]],"date-time":"2024-05-22T00:00:00Z","timestamp":1716336000000},"content-version":"vor","delay-in-days":21,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100005010","name":"Associazione Italiana per la Ricerca sul Cancro","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100005010","id-type":"DOI","asserted-by":"publisher"}]},{"name":"NHS","award":["J93C22002250006"],"award-info":[{"award-number":["J93C22002250006"]}]},{"name":"NHS","award":["PNRR-MAD-2022-12375871"],"award-info":[{"award-number":["PNRR-MAD-2022-12375871"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,5,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Cytometry comprises powerful techniques for analyzing the cell heterogeneity of a biological sample by examining the expression of protein markers. These technologies impact especially the field of oncoimmunology, where cell identification is essential to analyze the tumor microenvironment. Several classification tools have been developed for the annotation of cytometry datasets, which include supervised tools that require a training set as a reference (i.e. reference-based) and semisupervised tools based on the manual definition of a marker table. The latter is closer to the traditional annotation of cytometry data based on manual gating. However, they require the manual definition of a marker table that cannot be extracted automatically in a reference-based fashion. Therefore, we are lacking methods that allow both classification approaches while maintaining the high biological interpretability given by the marker table.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We present a new tool called GateMeClass (Gate Mining and Classification) which overcomes the limitation of the current methods of classification of cytometry data allowing both semisupervised and supervised annotation based on a marker table that can be defined manually or extracted from an external annotated dataset. We measured the accuracy of GateMeClass for annotating three well-established benchmark mass cytometry datasets and one flow cytometry dataset. The performance of GateMeClass is comparable to reference-based methods and marker table-based techniques, offering greater flexibility and rapid execution times.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>GateMeClass is implemented in R language and is publicly available at https:\/\/github.com\/simo1c\/GateMeClass<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae322","type":"journal-article","created":{"date-parts":[[2024,5,22]],"date-time":"2024-05-22T16:30:44Z","timestamp":1716395444000},"source":"Crossref","is-referenced-by-count":2,"title":["GateMeClass: Gate Mining and Classification of cytometry data"],"prefix":"10.1093","volume":"40","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9829-9527","authenticated-orcid":false,"given":"Simone","family":"Caligola","sequence":"first","affiliation":[{"name":"Veneto Institute of Oncology IOV-IRCCS , Padova, Italy"}]},{"given":"Luca","family":"Giacobazzi","sequence":"additional","affiliation":[{"name":"Section of Immunology, Department of Medicine, University of Verona , Verona, Italy"}]},{"given":"Stefania","family":"Can\u00e8","sequence":"additional","affiliation":[{"name":"Veneto Institute of Oncology IOV-IRCCS , Padova, Italy"}]},{"given":"Antonio","family":"Vella","sequence":"additional","affiliation":[{"name":"Section of Immunology, Azienda Ospedaliera Universitaria Integrata (AOUI) , Verona, Italy"}]},{"given":"Annalisa","family":"Adamo","sequence":"additional","affiliation":[{"name":"Section of Immunology, Department of Medicine, University of Verona , Verona, Italy"}]},{"given":"Stefano","family":"Ugel","sequence":"additional","affiliation":[{"name":"Section of Immunology, Department of Medicine, University of Verona , Verona, Italy"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9843-7638","authenticated-orcid":false,"given":"Rosalba","family":"Giugno","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Verona , Verona, Italy"}]},{"given":"Vincenzo","family":"Bronte","sequence":"additional","affiliation":[{"name":"Veneto Institute of Oncology IOV-IRCCS , Padova, Italy"}]}],"member":"286","published-online":{"date-parts":[[2024,5,22]]},"reference":[{"key":"2024052919154552900_btae322-B1","doi-asserted-by":"crossref","first-page":"769","DOI":"10.1002\/cyto.a.23738","article-title":"Predicting cell populations in single cell mass cytometry data","volume":"95","author":"Abdelaal","year":"2019","journal-title":"Cytometry A"},{"key":"2024052919154552900_btae322-B2","doi-asserted-by":"crossref","first-page":"6813","DOI":"10.1021\/ac901049w","article-title":"Mass cytometry: technique for real time single cell multitarget immunoassay based on inductively coupled plasma time-of-flight mass spectrometry","volume":"81","author":"Bandura","year":"2009","journal-title":"Anal Chem"},{"key":"2024052919154552900_btae322-B3","doi-asserted-by":"crossref","first-page":"1181","DOI":"10.1038\/ni.3006","article-title":"High-dimensional analysis of the murine myeloid cell system","volume":"15","author":"Becher","year":"2014","journal-title":"Nat Immunol"},{"key":"2024052919154552900_btae322-B4","doi-asserted-by":"crossref","first-page":"1428","DOI":"10.1038\/s41467-021-21702-6","article-title":"Deciphering the state of immune silence in fatal COVID-19 patients","volume":"12","author":"Bost","year":"2021","journal-title":"Nat Commun"},{"key":"2024052919154552900_btae322-B5","author":"Breiman","year":"1984"},{"key":"2024052919154552900_btae322-B6","doi-asserted-by":"crossref","first-page":"6409","DOI":"10.1172\/JCI141772","article-title":"Baricitinib restrains the immune dysregulation in patients with severe COVID-19","volume":"130","author":"Bronte","year":"2020","journal-title":"J Clin Invest"},{"key":"2024052919154552900_btae322-B7","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1002\/cyto.a.20583","article-title":"Statistical mixture modeling for cell subtype identification in flow cytometry","volume":"73A","author":"Chan","year":"2008","journal-title":"Cytometry Pt A"},{"key":"2024052919154552900_btae322-B8","doi-asserted-by":"crossref","first-page":"e1008885","DOI":"10.1371\/journal.pcbi.1008885","article-title":"DGCyTOF: deep learning with graphic cluster visualization to predict cell types of single cell mass cytometry data","volume":"18","author":"Cheng","year":"2022","journal-title":"PLoS Comput Biol"},{"key":"2024052919154552900_btae322-B9","doi-asserted-by":"crossref","first-page":"247646","DOI":"10.1155\/2009\/247646","article-title":"Merging mixture components for cell population identification in flow cytometry","volume":"2009","author":"Finak","year":"2009","journal-title":"Adv Bioinformatics"},{"key":"2024052919154552900_btae322-B10","first-page":"1","article-title":"Bayesian trees for automated cytometry data analysis","volume":"85","author":"Ji","year":"2018","journal-title":"Proc Mach Learn Res"},{"key":"2024052919154552900_btae322-B11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v028.i05","article-title":"Building predictive models in R using the caret package","volume":"28","author":"Kuhn","year":"2008","journal-title":"J Stat Soft"},{"key":"2024052919154552900_btae322-B12","doi-asserted-by":"crossref","first-page":"1689","DOI":"10.1093\/bioinformatics\/btx054","article-title":"Automated cell type discovery and classification through knowledge transfer","volume":"33","author":"Lee","year":"2017","journal-title":"Bioinformatics"},{"key":"2024052919154552900_btae322-B13","doi-asserted-by":"crossref","first-page":"184","DOI":"10.1016\/j.cell.2015.05.047","article-title":"Data-driven phenotypic dissection of AML reveals progenitor-like cells that correlate with prognosis","volume":"162","author":"Levine","year":"2015","journal-title":"Cell"},{"key":"2024052919154552900_btae322-B14","doi-asserted-by":"crossref","first-page":"3423","DOI":"10.1093\/bioinformatics\/btx448","article-title":"Gating mass cytometry data by deep learning","volume":"33","author":"Li","year":"2017","journal-title":"Bioinformatics"},{"key":"2024052919154552900_btae322-B15","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1002\/cyto.a.20531","article-title":"Automated gating of flow cytometry data via robust model-based clustering","volume":"73A","author":"Lo","year":"2008","journal-title":"Cytometry Pt A"},{"key":"2024052919154552900_btae322-B16","author":"Naim","year":"2010"},{"key":"2024052919154552900_btae322-B17","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1002\/cyto.a.20258","article-title":"A new \u2018logicle\u2019 display method avoids deceptive effects of logarithmic scaling for low signals and compensated data","volume":"69","author":"Parks","year":"2006","journal-title":"Cytometry A"},{"key":"2024052919154552900_btae322-B18","doi-asserted-by":"crossref","first-page":"8519","DOI":"10.1073\/pnas.0903028106","article-title":"Automated high-dimensional flow cytometric data analysis","volume":"106","author":"Pyne","year":"2009","journal-title":"Proc Natl Acad Sci USA"},{"key":"2024052919154552900_btae322-B19","doi-asserted-by":"crossref","first-page":"886","DOI":"10.1038\/nbt.1991","article-title":"Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE","volume":"29","author":"Qiu","year":"2011","journal-title":"Nat Biotechnol"},{"key":"2024052919154552900_btae322-B20","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1038\/nmeth.3863","article-title":"Automated mapping of phenotype space with single-cell data","volume":"13","author":"Samusik","year":"2016","journal-title":"Nat Methods"},{"key":"2024052919154552900_btae322-B21","doi-asserted-by":"crossref","first-page":"289","DOI":"10.32614\/RJ-2016-021","article-title":"mclust 5: clustering, classification and density estimation using Gaussian finite mixture models","volume":"8","author":"Scrucca","year":"2016","journal-title":"R Journal"},{"key":"2024052919154552900_btae322-B22","doi-asserted-by":"crossref","first-page":"202","DOI":"10.1073\/pnas.1321405111","article-title":"Automatic classification of cellular expression by nonlinear stochastic embedding (ACCENSE)","volume":"111","author":"Shekhar","year":"2014","journal-title":"Proc Natl Acad Sci USA"},{"key":"2024052919154552900_btae322-B23","doi-asserted-by":"crossref","first-page":"636","DOI":"10.1002\/cyto.a.22625","article-title":"FlowSOM: using self-organizing maps for visualization and interpretation of cytometry data","volume":"87","author":"Van Gassen","year":"2015","journal-title":"Cytometry Pt A"},{"key":"2024052919154552900_btae322-B24","doi-asserted-by":"crossref","first-page":"1459","DOI":"10.12688\/f1000research.20210.2","article-title":"HDCytoData: collection of high-dimensional cytometry benchmark datasets in bioconductor object formats","volume":"8","author":"Weber","year":"2019","journal-title":"F1000Res"},{"key":"2024052919154552900_btae322-B25","doi-asserted-by":"crossref","first-page":"1","DOI":"10.5402\/2012\/568385","article-title":"Ranked set sampling: its relevance and impact on statistical inference","volume":"2012","author":"Wolfe","year":"2012","journal-title":"ISRN Prob Stat"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae322\/57825837\/btae322.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/5\/btae322\/57986651\/btae322.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/5\/btae322\/57986651\/btae322.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,29]],"date-time":"2024-05-29T19:16:12Z","timestamp":1717010172000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae322\/7679648"}},"subtitle":[],"editor":[{"given":"Arne","family":"Elofsson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,5,1]]},"references-count":25,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2024,5,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae322","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,5,1]]},"published":{"date-parts":[[2024,5,1]]},"article-number":"btae322"}}