{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,28]],"date-time":"2025-10-28T10:52:48Z","timestamp":1761648768406,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,6,8]],"date-time":"2020-06-08T00:00:00Z","timestamp":1591574400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,6,8]]},"DOI":"10.1145\/3372278.3390691","type":"proceedings-article","created":{"date-parts":[[2020,6,2]],"date-time":"2020-06-02T04:35:27Z","timestamp":1591072527000},"page":"207-214","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Search Result Clustering in Collaborative Sound Collections"],"prefix":"10.1145","author":[{"given":"Xavier","family":"Favory","sequence":"first","affiliation":[{"name":"Universitat Pompeu Fabra, Barcelona, Spain"}]},{"given":"Frederic","family":"Font","sequence":"additional","affiliation":[{"name":"Universitat Pompeu Fabra, Barcelona, Spain"}]},{"given":"Xavier","family":"Serra","sequence":"additional","affiliation":[{"name":"Universitat Pompeu Fabra, Barcelona, Spain"}]}],"member":"320","published-online":{"date-parts":[[2020,6,8]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2019.02.006"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Alan W Black and Paul A Taylor. 1997. Automatically clustering similar units for unit selection in speech synthesis. (1997).  Alan W Black and Paul A Taylor. 1997. Automatically clustering similar units for unit selection in speech synthesis. (1997).","DOI":"10.21437\/Eurospeech.1997-219"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1088\/1742-5468\/2008\/10\/P10008"},{"key":"e_1_3_2_1_4_1","first-page":"493","volume-title":"14th Conference of the International Society for Music Information Retrieval","author":"Bogdanov Dmitry","year":"2013","unstructured":"Dmitry Bogdanov , Nicolas Wack , Emilia G\u00f3mez Guti\u00e9rrez , Sankalp Gulati , Perfecto Herrera Boyer , Oscar Mayor , Gerard Roma Trepat , Justin Salamon , Jos\u00e9 Ricardo Zapata Gonz\u00e1lez , and Xavier Serra . 2013 . Essentia: An audio analysis library for music information retrieval . In 14th Conference of the International Society for Music Information Retrieval , 2013. p. 493 -- 498 . Dmitry Bogdanov, Nicolas Wack, Emilia G\u00f3mez Guti\u00e9rrez, Sankalp Gulati, Perfecto Herrera Boyer, Oscar Mayor, Gerard Roma Trepat, Justin Salamon, Jos\u00e9 Ricardo Zapata Gonz\u00e1lez, and Xavier Serra. 2013. Essentia: An audio analysis library for music information retrieval. In 14th Conference of the International Society for Music Information Retrieval, 2013. p. 493--8."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1080\/03610927408827101"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1541880.1541884"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1390156.1390171"},{"key":"e_1_3_2_1_8_1","volume-title":"Transfer learning for music classification and regression tasks. arXiv preprint arXiv:1703.09179","author":"Choi Keunwoo","year":"2017","unstructured":"Keunwoo Choi , Gy\u00f6rgy Fazekas , Mark Sandler , and Kyunghyun Cho . 2017. Transfer learning for music classification and regression tasks. arXiv preprint arXiv:1703.09179 ( 2017 ). Keunwoo Choi, Gy\u00f6rgy Fazekas, Mark Sandler, and Kyunghyun Cho. 2017. Transfer learning for music classification and regression tasks. arXiv preprint arXiv:1703.09179 (2017)."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1963405.1963487"},{"key":"e_1_3_2_1_11_1","volume-title":"ICASSP'00","volume":"2","author":"Eronen Antti","year":"2000","unstructured":"Antti Eronen and Anssi Klapuri . 2000 . Musical instrument recognition using cepstral coefficients and temporal features. In Acoustics, Speech, and Signal Processing, 2000 . ICASSP'00 . Proceedings. 2000 IEEE International Conference on, Vol. 2 . IEEE, II753--II756. Antti Eronen and Anssi Klapuri. 2000. Musical instrument recognition using cepstral coefficients and temporal features. In Acoustics, Speech, and Signal Processing, 2000. ICASSP'00. Proceedings. 2000 IEEE International Conference on, Vol. 2. IEEE, II753--II756."},{"key":"e_1_3_2_1_12_1","volume-title":"A survey of clustering algorithms for big data: Taxonomy and empirical analysis","author":"Fahad Adil","year":"2014","unstructured":"Adil Fahad , Najlaa Alshatri , Zahir Tari , Abdullah Alamri , Ibrahim Khalil , Albert Y Zomaya , Sebti Foufou , and Abdelaziz Bouras . 2014. A survey of clustering algorithms for big data: Taxonomy and empirical analysis . IEEE transactions on emerging topics in computing, Vol. 2 , 3 ( 2014 ), 267--279. Adil Fahad, Najlaa Alshatri, Zahir Tari, Abdullah Alamri, Ibrahim Khalil, Albert Y Zomaya, Sebti Foufou, and Abdelaziz Bouras. 2014. A survey of clustering algorithms for big data: Taxonomy and empirical analysis. IEEE transactions on emerging topics in computing, Vol. 2, 3 (2014), 267--279."},{"key":"e_1_3_2_1_13_1","unstructured":"Per Fallgren Zofia Malisz and Jens Edlund. 2018. A Tool for Exploring Large Amounts of Found Audio Data.. In DHN. 499--503.  Per Fallgren Zofia Malisz and Jens Edlund. 2018. A Tool for Exploring Large Amounts of Found Audio Data.. In DHN. 499--503."},{"key":"e_1_3_2_1_14_1","first-page":"486","volume-title":"Proceedings of the 18th ISMIR Conference; 2017 oct 23--27; Suzhou, China.[Canada]: International Society for Music Information Retrieval;","author":"Fonseca Eduardo","year":"2017","unstructured":"Eduardo Fonseca , Jordi Pons Puig , Xavier Favory , Frederic Font Corbera , Dmitry Bogdanov , Andres Ferraro , Sergio Oramas , Alastair Porter , and Xavier Serra . 2017 . Freesound datasets: a platform for the creation of open audio datasets. In Hu X, Cunningham SJ, Turnbull D, Duan Z, editors . Proceedings of the 18th ISMIR Conference; 2017 oct 23--27; Suzhou, China.[Canada]: International Society for Music Information Retrieval; 2017. p. 486 -- 493 . International Society for Music Information Retrieval (ISMIR). Eduardo Fonseca, Jordi Pons Puig, Xavier Favory, Frederic Font Corbera, Dmitry Bogdanov, Andres Ferraro, Sergio Oramas, Alastair Porter, and Xavier Serra. 2017. Freesound datasets: a platform for the creation of open audio datasets. In Hu X, Cunningham SJ, Turnbull D, Duan Z, editors. Proceedings of the 18th ISMIR Conference; 2017 oct 23--27; Suzhou, China.[Canada]: International Society for Music Information Retrieval; 2017. p. 486--93. International Society for Music Information Retrieval (ISMIR)."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2502081.2502245"},{"volume-title":"Computational Analysis of Sound Scenes and Events","author":"Font Frederic","key":"e_1_3_2_1_16_1","unstructured":"Frederic Font , Gerard Roma , and Xavier Serra . 2018. Sound sharing and retrieval . In Computational Analysis of Sound Scenes and Events . Springer , 279--301. Frederic Font, Gerard Roma, and Xavier Serra. 2018. Sound sharing and retrieval. In Computational Analysis of Sound Scenes and Events. Springer, 279--301."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7952261"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1076\/jnmr.32.1.3.16798"},{"key":"e_1_3_2_1_19_1","volume-title":"Jort F Gemmeke, Aren Jansen, R Channing Moore, Manoj Plakal, Devin Platt, Rif A Saurous, Bryan Seybold, et al.","author":"Hershey Shawn","year":"2017","unstructured":"Shawn Hershey , Sourish Chaudhuri , Daniel PW Ellis , Jort F Gemmeke, Aren Jansen, R Channing Moore, Manoj Plakal, Devin Platt, Rif A Saurous, Bryan Seybold, et al. 2017 . CNN architectures for large-scale audio classification. In 2017 ieee international conference on acoustics, speech and signal processing (icassp). IEEE , 131--135. Shawn Hershey, Sourish Chaudhuri, Daniel PW Ellis, Jort F Gemmeke, Aren Jansen, R Channing Moore, Manoj Plakal, Devin Platt, Rif A Saurous, Bryan Seybold, et al. 2017. CNN architectures for large-scale audio classification. In 2017 ieee international conference on acoustics, speech and signal processing (icassp). IEEE, 131--135."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/642611.642616"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/276698.276876"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7952263"},{"key":"e_1_3_2_1_23_1","volume-title":"Proc. ISMIR","volume":"86","author":"Kim Youngmoo E","year":"2010","unstructured":"Youngmoo E Kim , Erik M Schmidt , Raymond Migneco , Brandon G Morton , Patrick Richardson , Jeffrey Scott , Jacquelin A Speck , and Douglas Turnbull . 2010 . Music emotion recognition: A state of the art review . In Proc. ISMIR , Vol. 86 . Citeseer, 937--952. Youngmoo E Kim, Erik M Schmidt, Raymond Migneco, Brandon G Morton, Patrick Richardson, Jeffrey Scott, Jacquelin A Speck, and Douglas Turnbull. 2010. Music emotion recognition: A state of the art review. In Proc. ISMIR, Vol. 86. Citeseer, 937--952."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2010.35"},{"key":"e_1_3_2_1_25_1","first-page":"100","article-title":"Introduction to information retrieval","volume":"16","author":"Manning Christopher","year":"2010","unstructured":"Christopher Manning , Prabhakar Raghavan , and Hinrich Sch\u00fctze . 2010 . Introduction to information retrieval . Natural Language Engineering , Vol. 16 , 1 (2010), 100 -- 103 . Christopher Manning, Prabhakar Raghavan, and Hinrich Sch\u00fctze. 2010. Introduction to information retrieval. Natural Language Engineering, Vol. 16, 1 (2010), 100--103.","journal-title":"Natural Language Engineering"},{"key":"e_1_3_2_1_26_1","volume-title":"George Tzanetakis, and Mathieu Lagrange.","author":"Martins Luis Gustavo","year":"2007","unstructured":"Luis Gustavo Martins , Juan Jos\u00e9 Burred , George Tzanetakis, and Mathieu Lagrange. 2007 . Polyphonic instrument recognition using spectral clustering.. In ISMIR. 213--218. Luis Gustavo Martins, Juan Jos\u00e9 Burred, George Tzanetakis, and Mathieu Lagrange. 2007. Polyphonic instrument recognition using spectral clustering.. In ISMIR. 213--218."},{"key":"e_1_3_2_1_27_1","unstructured":"Robert Neumayer Thomas Lidy and Andreas Rauber. 2005. Content-based organization of digital audio collections .na.  Robert Neumayer Thomas Lidy and Andreas Rauber. 2005. Content-based organization of digital audio collections .na."},{"key":"e_1_3_2_1_28_1","volume-title":"Finding and evaluating community structure in networks. Physical review E","author":"Newman Mark EJ","year":"2004","unstructured":"Mark EJ Newman and Michelle Girvan . 2004. Finding and evaluating community structure in networks. Physical review E , Vol. 69 , 2 ( 2004 ), 026113. Mark EJ Newman and Michelle Girvan. 2004. Finding and evaluating community structure in networks. Physical review E, Vol. 69, 2 (2004), 026113."},{"key":"e_1_3_2_1_29_1","first-page":"1","article-title":"Why You Only Need to Test with 5 Users","volume":"19","author":"Nielsen Jakob","year":"2000","unstructured":"Jakob Nielsen . 2000 . Why You Only Need to Test with 5 Users . Jakob Nielsens Alertbox , Vol. 19 , September 23 (2000), 1 -- 4 . https:\/\/www.nngroup.com\/articles\/why-you-only-need-to-test-with-5-users\/ http:\/\/www.useit.com\/alertbox\/20000319.html Jakob Nielsen. 2000. Why You Only Need to Test with 5 Users. Jakob Nielsens Alertbox, Vol. 19, September 23 (2000), 1--4. https:\/\/www.nngroup.com\/articles\/why-you-only-need-to-test-with-5-users\/ http:\/\/www.useit.com\/alertbox\/20000319.html","journal-title":"Jakob Nielsens Alertbox"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/WASPAA.2013.6701862"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.3642604"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-016-3378-2"},{"key":"e_1_3_2_1_33_1","volume-title":"End-to-end learning for music audio tagging at scale. arXiv preprint arXiv:1711.02520","author":"Pons Jordi","year":"2017","unstructured":"Jordi Pons , Oriol Nieto , Matthew Prockup , Erik Schmidt , Andreas Ehmann , and Xavier Serra . 2017. End-to-end learning for music audio tagging at scale. arXiv preprint arXiv:1711.02520 ( 2017 ). Jordi Pons, Oriol Nieto, Matthew Prockup, Erik Schmidt, Andreas Ehmann, and Xavier Serra. 2017. End-to-end learning for music audio tagging at scale. arXiv preprint arXiv:1711.02520 (2017)."},{"key":"e_1_3_2_1_34_1","unstructured":"Gerard Roma Anna Xamb\u00f3 Perfecto Herrera and Robin Laney. 2012. Factors in human recognition of timbre lexicons generated by data clustering. (2012).  Gerard Roma Anna Xamb\u00f3 Perfecto Herrera and Robin Laney. 2012. Factors in human recognition of timbre lexicons generated by data clustering. (2012)."},{"key":"e_1_3_2_1_35_1","unstructured":"Gerard Roma Trepat etal 2015. Algorithms and representations for supporting online music creation with large-scale audio databases. (2015).  Gerard Roma Trepat et al. 2015. Algorithms and representations for supporting online music creation with large-scale audio databases. (2015)."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.5555\/2946645.3007087"},{"key":"e_1_3_2_1_37_1","volume-title":"Reading: Addison-Wesley","volume":"169","author":"Salton Gerard","year":"1989","unstructured":"Gerard Salton . 1989 . Automatic text processing: The transformation, analysis, and retrieval of . Reading: Addison-Wesley , Vol. 169 (1989). Gerard Salton. 1989. Automatic text processing: The transformation, analysis, and retrieval of. Reading: Addison-Wesley, Vol. 169 (1989)."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1177\/0165551506078083"},{"key":"e_1_3_2_1_39_1","volume-title":"Faceted search. Synthesis lectures on information concepts, retrieval, and services","author":"Tunkelang Daniel","year":"2009","unstructured":"Daniel Tunkelang . 2009. Faceted search. Synthesis lectures on information concepts, retrieval, and services , Vol. 1 , 1 ( 2009 ), 1--80. Daniel Tunkelang. 2009. Faceted search. Synthesis lectures on information concepts, retrieval, and services, Vol. 1, 1 (2009), 1--80."},{"volume-title":"Marsyas3D: a prototype audio browser-editor","author":"Tzanetakis George","key":"e_1_3_2_1_40_1","unstructured":"George Tzanetakis and Perry Cook . 2001. Marsyas3D: a prototype audio browser-editor using a large scale immersive visual and audio display. Georgia Institute of Technology . George Tzanetakis and Perry Cook. 2001. Marsyas3D: a prototype audio browser-editor using a large scale immersive visual and audio display. Georgia Institute of Technology."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2002.800560"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.5555\/1756006.1953024"},{"key":"e_1_3_2_1_43_1","volume-title":"Voyager: Exploratory analysis via faceted browsing of visualization recommendations","author":"Wongsuphasawat Kanit","year":"2015","unstructured":"Kanit Wongsuphasawat , Dominik Moritz , Anushka Anand , Jock Mackinlay , Bill Howe , and Jeffrey Heer . 2015 . Voyager: Exploratory analysis via faceted browsing of visualization recommendations . IEEE transactions on visualization and computer graphics, Vol. 22 , 1 (2015), 649--658. Kanit Wongsuphasawat, Dominik Moritz, Anushka Anand, Jock Mackinlay, Bill Howe, and Jeffrey Heer. 2015. Voyager: Exploratory analysis via faceted browsing of visualization recommendations. IEEE transactions on visualization and computer graphics, Vol. 22, 1 (2015), 649--658."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1007\/s40745-015-0040-1"},{"key":"e_1_3_2_1_45_1","unstructured":"Jason Yosinski Jeff Clune Yoshua Bengio and Hod Lipson. 2014. How transferable are features in deep neural networks?. In Advances in neural information processing systems. 3320--3328.  Jason Yosinski Jeff Clune Yoshua Bengio and Hod Lipson. 2014. How transferable are features in deep neural networks?. In Advances in neural information processing systems. 3320--3328."}],"event":{"name":"ICMR '20: International Conference on Multimedia Retrieval","sponsor":["SIGMM ACM Special Interest Group on Multimedia"],"location":"Dublin Ireland","acronym":"ICMR '20"},"container-title":["Proceedings of the 2020 International Conference on Multimedia Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3372278.3390691","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3372278.3390691","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:32:10Z","timestamp":1750195930000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3372278.3390691"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,8]]},"references-count":45,"alternative-id":["10.1145\/3372278.3390691","10.1145\/3372278"],"URL":"https:\/\/doi.org\/10.1145\/3372278.3390691","relation":{},"subject":[],"published":{"date-parts":[[2020,6,8]]},"assertion":[{"value":"2020-06-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}