{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T20:34:51Z","timestamp":1772138091502,"version":"3.50.1"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2018,11,8]],"date-time":"2018-11-08T00:00:00Z","timestamp":1541635200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"name":"Portuguese Funda\u00e7\u00e3o para a Ci\u00eancia e Tecnologia"},{"DOI":"10.13039\/100006129","name":"FCT","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100006129","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100005032","name":"Bial Foundation","doi-asserted-by":"crossref","award":["185\/12"],"award-info":[{"award-number":["185\/12"]}],"id":[{"id":"10.13039\/501100005032","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100000654","name":"Marie Curie","doi-asserted-by":"crossref","award":["FP7-PEOPLE-2011-CIG"],"award-info":[{"award-number":["FP7-PEOPLE-2011-CIG"]}],"id":[{"id":"10.13039\/501100000654","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100006129","name":"FCT","doi-asserted-by":"publisher","award":["PTDC\/NEU-NMC\/1276\/2012"],"award-info":[{"award-number":["PTDC\/NEU-NMC\/1276\/2012"]}],"id":[{"id":"10.13039\/100006129","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010663","name":"European Research Council","doi-asserted-by":"publisher","award":["ERC-2017-COG-773012"],"award-info":[{"award-number":["ERC-2017-COG-773012"]}],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>How to partition a dataset into a set of distinct clusters is a ubiquitous and challenging problem. The fact that data vary widely in features such as cluster shape, cluster number, density distribution, background noise, outliers and degree of overlap, makes it difficult to find a single algorithm that can be broadly applied. One recent method, clusterdp, based on search of density peaks, can be applied successfully to cluster many kinds of data, but it is not fully automatic, and fails on some simple data distributions.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We propose an alternative approach, clusterdv, which estimates density dips between points, and allows robust determination of cluster number and distribution across a wide range of data, without any manual parameter adjustment. We show that this method is able to solve a range of synthetic and experimental datasets, where the underlying structure is known, and identifies consistent and meaningful clusters in new behavioral data.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The clusterdv is implemented in Matlab. Its source code, together with example datasets are available on: https:\/\/github.com\/jcbmarques\/clusterdv.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty932","type":"journal-article","created":{"date-parts":[[2018,11,7]],"date-time":"2018-11-07T07:09:36Z","timestamp":1541574576000},"page":"2125-2132","source":"Crossref","is-referenced-by-count":27,"title":["Clusterdv: a simple density-based clustering method that is robust, general and automatic"],"prefix":"10.1093","volume":"35","author":[{"given":"Jo\u00e3o C","family":"Marques","sequence":"first","affiliation":[{"name":"Champalimaud Research, Champalimaud Centre for the Unknown, Avenida Bras\u00edlia, Doca de Pedrou\u00e7os, Lisboa, Portugal"},{"name":"Rowland Institute at Harvard, 100 Edwin H. Land Boulevard, Cambridge, MA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9763-8902","authenticated-orcid":false,"given":"Michael B","family":"Orger","sequence":"additional","affiliation":[{"name":"Champalimaud Research, Champalimaud Centre for the Unknown, Avenida Bras\u00edlia, Doca de Pedrou\u00e7os, Lisboa, Portugal"}]}],"member":"286","published-online":{"date-parts":[[2018,11,8]]},"reference":[{"key":"2023012713222679300_bty932-B1","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1145\/304181.304187","article-title":"OPTICS: ordering points to identify the clustering structure","volume":"28","author":"Ankerst","year":"1999","journal-title":"ACM Sigmod Record"},{"key":"2023012713222679300_bty932-B2","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1080\/00401706.1977.10489521","article-title":"Variable kernel estimates of multivariate densities","volume":"19","author":"Breiman","year":"1977","journal-title":"Technometrics"},{"key":"2023012713222679300_bty932-B3","doi-asserted-by":"crossref","first-page":"4984","DOI":"10.1523\/JNEUROSCI.0615-07.2007","article-title":"Sensorimotor gating in larval zebrafish","volume":"27","author":"Burgess","year":"2007","journal-title":"J. Neurosci"},{"key":"2023012713222679300_bty932-B4","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1016\/j.patcog.2007.04.010","article-title":"Robust path-based spectral clustering","volume":"41","author":"Chang","year":"2008","journal-title":"Pattern Recogn"},{"key":"2023012713222679300_bty932-B5","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1007\/978-3-642-13105-9_2","article-title":"Complete gradient clustering algorithm for features analysis of x-ray images","volume":"69","author":"Charytanowicz","year":"2010","journal-title":"Inform. Technol. Biomed"},{"key":"2023012713222679300_bty932-B6","author":"Chen","year":"2016"},{"key":"2023012713222679300_bty932-B7","first-page":"1","author":"Courjault-Rad\u00e9","year":"2016"},{"key":"2023012713222679300_bty932-B8","doi-asserted-by":"crossref","first-page":"420","DOI":"10.1147\/rd.175.0420","article-title":"Lower bounds for the partitioning of graphs","volume":"17","author":"Donath","year":"1973","journal-title":"IBM J. Res. Dev"},{"key":"2023012713222679300_bty932-B9","first-page":"226","author":"Ester","year":"1996"},{"key":"2023012713222679300_bty932-B10","doi-asserted-by":"crossref","first-page":"761","DOI":"10.1016\/j.patcog.2005.09.012","article-title":"Iterative shrinking method for clustering problems","volume":"39","author":"Fr\u00e4nti","year":"2006","journal-title":"Pattern Recogn"},{"key":"2023012713222679300_bty932-B11","doi-asserted-by":"crossref","first-page":"972","DOI":"10.1126\/science.1136800","article-title":"Clustering by passing messages between data points","volume":"315","author":"Frey","year":"2007","journal-title":"Science"},{"key":"2023012713222679300_bty932-B12","doi-asserted-by":"crossref","first-page":"3.","DOI":"10.1186\/1471-2105-8-3","article-title":"FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data. A novel fuzzy clustering method for the analysis of DNA microarray data","volume":"8","author":"Fu","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012713222679300_bty932-B13","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1145\/1217299.1217303","article-title":"Clustering aggregation","volume":"1","author":"Gionis","year":"2007","journal-title":"ACM Trans. Knowl. Discov. Data"},{"key":"2023012713222679300_bty932-B14","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1016\/j.patrec.2009.09.011","article-title":"Data clustering: 50 years beyond K-means","volume":"31","author":"Jain","year":"2010","journal-title":"Pattern Recogn. Lett"},{"key":"2023012713222679300_bty932-B15","first-page":"264","author":"Jain","year":"1999"},{"key":"2023012713222679300_bty932-B16","author":"Jain","year":"2005"},{"key":"2023012713222679300_bty932-B17","author":"Karkkainen","year":"2002"},{"key":"2023012713222679300_bty932-B18","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1109\/2.781637","article-title":"Chameleon: hierarchical clustering using dynamic modeling","volume":"32","author":"Karypis","year":"1999","journal-title":"IEEE Comput"},{"key":"2023012713222679300_bty932-B19","first-page":"405","volume-title":"Statistical Data Analysis Based on the L1\u2013Norm and Related Methods","author":"Kaufmann","year":"1987"},{"key":"2023012713222679300_bty932-B20","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1090\/S0002-9939-1956-0078686-7","article-title":"On the shortest spanning subtree of a graph and the traveling salesman problem","volume":"7","author":"Kruskal","year":"1956","journal-title":"Proc. Am. Math. Soc"},{"key":"2023012713222679300_bty932-B21","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based learning applied to document recognition","volume":"86","author":"LeCun","year":"1998","journal-title":"Proc. IEEE"},{"key":"2023012713222679300_bty932-B22","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1109\/TIT.1982.1056489","article-title":"Least squares quantization in PCM","volume":"28","author":"Lloyd","year":"1982","journal-title":"IEEE Trans. Inform. Theory"},{"key":"2023012713222679300_bty932-B23","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Maaten","year":"2008","journal-title":"J. Mach. Learn. Res"},{"key":"2023012713222679300_bty932-B24","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1016\/j.cub.2017.12.002","article-title":"Structure of the zebrafish locomotor repertoire revealed with unsupervised behavioral clustering","volume":"28","author":"Marques","year":"2018","journal-title":"Curr. Biol"},{"key":"2023012713222679300_bty932-B25","article-title":"UMAP: uniform manifold approximation and projection for dimension reduction","author":"McInnes","year":"2018"},{"key":"2023012713222679300_bty932-B26","doi-asserted-by":"crossref","first-page":"45602","DOI":"10.1038\/srep45602","article-title":"Clustering by fast search and merge of local density peaks for gene expression microarray data","volume":"7","author":"Mehmood","year":"2017","journal-title":"Sci. Rep"},{"key":"2023012713222679300_bty932-B27","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/A:1023949509487","article-title":"Consensus clustering: a resampling-based method for class discovery and visualization of gene expression microarray data","volume":"52","author":"Monti","year":"2003","journal-title":"Mach. Learn"},{"key":"2023012713222679300_bty932-B29","doi-asserted-by":"crossref","first-page":"1492","DOI":"10.1126\/science.1242072","article-title":"Clustering by fast search and find of density peaks","volume":"344","author":"Rodriguez","year":"2014","journal-title":"Science"},{"key":"2023012713222679300_bty932-B30","author":"Samaria","year":"1994"},{"key":"2023012713222679300_bty932-B31","doi-asserted-by":"crossref","first-page":"2385","DOI":"10.1109\/TIP.2009.2025923","article-title":"Complex wavelet structural similarity: a new image similarity index","volume":"18","author":"Sampat","year":"2009","journal-title":"IEEE Trans. Image Process"},{"key":"2023012713222679300_bty932-B32","first-page":"888","article-title":"Normalized cuts and image segmentation","volume":"22","author":"Shi","year":"1997","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023012713222679300_bty932-B33","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1093\/comjnl\/16.1.30","article-title":"SLINK: an optimally efficient algorithm for the single-link cluster method","volume":"16","author":"Sibson","year":"1973","journal-title":"Comput. J"},{"key":"2023012713222679300_bty932-B34","first-page":"1409","article-title":"A statistical method for evaluating systematic relationships","volume":"28","author":"Sokal","year":"1958","journal-title":"Univ. Kansas Sci. Bull"},{"key":"2023012713222679300_bty932-B35","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1111\/1467-9868.00293","article-title":"Estimating the number of clusters in a data set via the gap statistic","volume":"63","author":"Tibshirani","year":"2001","journal-title":"J. Roy. Stat. Soc. B"},{"key":"2023012713222679300_bty932-B36","doi-asserted-by":"crossref","first-page":"1273","DOI":"10.1109\/TPAMI.2002.1033218","article-title":"A maximum variance cluster algorithm","volume":"24","author":"Veenman","year":"2002","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"2023012713222679300_bty932-B37","doi-asserted-by":"crossref","first-page":"2800","DOI":"10.1177\/0962280215609948","article-title":"Fast clustering using adaptive density peak detection","volume":"26","author":"Wang","year":"2017","journal-title":"Stat. Methods Med. Res"},{"key":"2023012713222679300_bty932-B38","doi-asserted-by":"crossref","first-page":"1033","DOI":"10.1038\/nmeth.3583","article-title":"Comparing the performance of biomedical clustering methods","volume":"12","author":"Wiwie","year":"2015","journal-title":"Nat. Methods"},{"key":"2023012713222679300_bty932-B39","doi-asserted-by":"crossref","first-page":"645","DOI":"10.1109\/TNN.2005.845141","article-title":"Survey of clustering algorithms","volume":"16","author":"Xu","year":"2005","journal-title":"IEEE Trans. Neural Netw"},{"key":"2023012713222679300_bty932-B40","doi-asserted-by":"crossref","first-page":"452","DOI":"10.1086\/jar.33.4.3629752","article-title":"An information flow model for conflict and fission in small groups","volume":"33","author":"Zachary","year":"1977","journal-title":"J. Anthropol. Res"},{"key":"2023012713222679300_bty932-B28","article-title":"Self-tuning spectral clustering","author":"Zelnik-Manor","year":"2004","journal-title":"Proceedings of the 17th International Conference on Neural Information Processing Systems (NIPS'04)"},{"key":"2023012713222679300_bty932-B41","first-page":"832","author":"Zhang","year":"2016"},{"key":"2023012713222679300_bty932-B42","first-page":"68","article-title":"Graph-theoretical methods for detecting and describing gestalt clusters","volume":"100","author":"Zhang","year":"1971","journal-title":"IEEE Trans. Comput"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/12\/2125\/48934669\/bioinformatics_35_12_2125.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/12\/2125\/48934669\/bioinformatics_35_12_2125.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T09:16:26Z","timestamp":1674810986000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/12\/2125\/5165379"}},"subtitle":[],"editor":[{"given":"Jonathan","family":"Wren","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,11,8]]},"references-count":42,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2019,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty932","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/224840","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,6]]},"published":{"date-parts":[[2018,11,8]]}}}