{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:25:31Z","timestamp":1750220731150,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":31,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,3,30]],"date-time":"2020-03-30T00:00:00Z","timestamp":1585526400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"European Unions Horizon 2020","award":["SEC-740754"],"award-info":[{"award-number":["SEC-740754"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,3,30]]},"DOI":"10.1145\/3341105.3374114","type":"proceedings-article","created":{"date-parts":[[2020,3,29]],"date-time":"2020-03-29T12:13:52Z","timestamp":1585484032000},"page":"706-713","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Unsupervised cross-modal audio representation learning from unstructured multilingual text"],"prefix":"10.1145","author":[{"given":"Alexander","family":"Schindler","sequence":"first","affiliation":[{"name":"Austrian Institute of Technology, Vienna, Austria"}]},{"given":"Sergiu","family":"Gordea","sequence":"additional","affiliation":[{"name":"Austrian Institute of Technology, Vienna, Austria"}]},{"given":"Peter","family":"Knees","sequence":"additional","affiliation":[{"name":"TU Wien, Vienna, Austria"}]}],"member":"320","published-online":{"date-parts":[[2020,3,30]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1162\/014892604323112257"},{"key":"e_1_3_2_1_2_1","volume-title":"Proceedings of the International Conference on Music Information Retrieval (ISMIR2011)","volume":"2","author":"Bertin-Mahieux Thierry","year":"2011","unstructured":"Thierry Bertin-Mahieux , Daniel PW Ellis , Brian Whitman , and Paul Lamere . 2011 . The Million Song Dataset . In Proceedings of the International Conference on Music Information Retrieval (ISMIR2011) , Vol. 2 . 10. Thierry Bertin-Mahieux, Daniel PW Ellis, Brian Whitman, and Paul Lamere. 2011. The Million Song Dataset. In Proceedings of the International Conference on Music Information Retrieval (ISMIR2011), Vol. 2. 10."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7952585"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"e_1_3_2_1_5_1","volume-title":"Evaluation of the audio beat tracking system beatroot. Journal of New Music Research","author":"Dixon Simon","year":"2007","unstructured":"Simon Dixon . 2007. Evaluation of the audio beat tracking system beatroot. Journal of New Music Research ( 2007 ). Simon Dixon. 2007. Evaluation of the audio beat tracking system beatroot. Journal of New Music Research (2007)."},{"key":"e_1_3_2_1_6_1","volume-title":"2013 IEEE Workshop on. IEEE, 1--4.","author":"Giannoulis Dimitrios","year":"2013","unstructured":"Dimitrios Giannoulis , Emmanouil Benetos , Dan Stowell , Mathias Rossignol , Mathieu Lagrange , and Mark D Plumbley . 2013 . Detection and classification of acoustic scenes and events: An IEEE AASP challenge. In Applications of Signal Processing to Audio and Acoustics (WASPAA) , 2013 IEEE Workshop on. IEEE, 1--4. Dimitrios Giannoulis, Emmanouil Benetos, Dan Stowell, Mathias Rossignol, Mathieu Lagrange, and Mark D Plumbley. 2013. Detection and classification of acoustic scenes and events: An IEEE AASP challenge. In Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on. IEEE, 1--4."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1178723.1178727"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24261-3_7"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10844-013-0248-5"},{"key":"e_1_3_2_1_10_1","volume-title":"A survey of music similarity and recommendation from music context data. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)","author":"Knees Peter","year":"2013","unstructured":"Peter Knees and Markus Schedl . 2013. A survey of music similarity and recommendation from music context data. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) ( 2013 ). Peter Knees and Markus Schedl. 2013. A survey of music similarity and recommendation from music context data. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) (2013)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766462.2767880"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-49722-7_6"},{"volume-title":"Music similarity and retrieval: An introduction to audio-and web-based strategies","author":"Knees Peter","key":"e_1_3_2_1_13_1","unstructured":"Peter Knees and Markus Schedl . 2016. Music similarity and retrieval: An introduction to audio-and web-based strategies . Vol. 36 . Springer . Peter Knees and Markus Schedl. 2016. Music similarity and retrieval: An introduction to audio-and web-based strategies. Vol. 36. Springer."},{"key":"e_1_3_2_1_14_1","unstructured":"Thomas Lidy and Andreas Rauber. 2005. Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In ISMIR.  Thomas Lidy and Andreas Rauber. 2005. Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In ISMIR."},{"key":"e_1_3_2_1_15_1","volume-title":"Proc. Int. Conf. Music Information Retrieval.","author":"Lidy Thomas","year":"2007","unstructured":"Thomas Lidy , Andreas Rauber , Antonio Pertusa , and Jos\u00e9 Manuel I\u00f1esta Quereda . 2007 . Improving Genre Classification by Combination of Audio and Symbolic Descriptors Using a Transcription Systems .. In Proc. Int. Conf. Music Information Retrieval. Thomas Lidy, Andreas Rauber, Antonio Pertusa, and Jos\u00e9 Manuel I\u00f1esta Quereda. 2007. Improving Genre Classification by Combination of Audio and Symbolic Descriptors Using a Transcription Systems.. In Proc. Int. Conf. Music Information Retrieval."},{"key":"e_1_3_2_1_16_1","volume-title":"Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016)","author":"Lidy Thomas","year":"2016","unstructured":"Thomas Lidy and Alexander Schindler . 2016 . CQT-based Convolutional Neural Networks for Audio Scene Classification . In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016) . 60--64. Thomas Lidy and Alexander Schindler. 2016. CQT-based Convolutional Neural Networks for Audio Scene Classification. In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016). 60--64."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Beth Logan and Ariel Salomon. 2001. A Music Similarity Function Based on Signal Analysis.. In ICME. 22--25.  Beth Logan and Ariel Salomon. 2001. A Music Similarity Function Based on Signal Analysis.. In ICME. 22--25.","DOI":"10.1109\/ICME.2001.1237829"},{"volume-title":"Fundamentals of music processing: Audio, analysis, algorithms, applications","author":"M\u00fcller Meinard","key":"e_1_3_2_1_18_1","unstructured":"Meinard M\u00fcller . 2015. Fundamentals of music processing: Audio, analysis, algorithms, applications . Springer . Meinard M\u00fcller. 2015. Fundamentals of music processing: Audio, analysis, algorithms, applications. Springer."},{"key":"e_1_3_2_1_19_1","volume-title":"19th International Society for Music Information Retrieval Conference (ISMIR","author":"Park Jiyoung","year":"2018","unstructured":"Jiyoung Park , Jongpil Lee , Jangyeon Park , Jung-Woo Ha , and Juhan Nam . 2018 . Representation learning of music using artist labels . In 19th International Society for Music Information Retrieval Conference (ISMIR 2018). Jiyoung Park, Jongpil Lee, Jangyeon Park, Jung-Woo Ha, and Juhan Nam. 2018. Representation learning of music using artist labels. In 19th International Society for Music Information Retrieval Conference (ISMIR 2018)."},{"key":"e_1_3_2_1_20_1","volume-title":"ISMIR","volume":"2007","author":"Raimond Yves","year":"2007","unstructured":"Yves Raimond , Samer A Abdallah , Mark B Sandler , and Frederick Giasson . 2007 . The Music Ontology .. In ISMIR , Vol. 2007 . Citeseer, 8th. Yves Raimond, Samer A Abdallah, Mark B Sandler, and Frederick Giasson. 2007. The Music Ontology.. In ISMIR, Vol. 2007. Citeseer, 8th."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2953886"},{"key":"e_1_3_2_1_22_1","volume-title":"Large Scale Audio-Visual Video Analytics Platform for Forensic Investigations of Terroristic Attacks. In International Conference on Multimedia Modeling. Springer, 106--119","author":"Schindler Alexander","year":"2019","unstructured":"Alexander Schindler , Martin Boyer , Andrew Lindley , David Schreiber , and Thomas Philipp . 2019 . Large Scale Audio-Visual Video Analytics Platform for Forensic Investigations of Terroristic Attacks. In International Conference on Multimedia Modeling. Springer, 106--119 . Alexander Schindler, Martin Boyer, Andrew Lindley, David Schreiber, and Thomas Philipp. 2019. Large Scale Audio-Visual Video Analytics Platform for Forensic Investigations of Terroristic Attacks. In International Conference on Multimedia Modeling. Springer, 106--119."},{"key":"e_1_3_2_1_23_1","volume-title":"Euro-Mediterranean Conference. Springer, 109--117","author":"Schindler Alexander","year":"2016","unstructured":"Alexander Schindler , Sergiu Gordea , and Harry van Biessum . 2016 . The euro-peana sounds music information retrieval pilot . In Euro-Mediterranean Conference. Springer, 109--117 . Alexander Schindler, Sergiu Gordea, and Harry van Biessum. 2016. The euro-peana sounds music information retrieval pilot. In Euro-Mediterranean Conference. Springer, 109--117."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CBMI.2019.8877462"},{"key":"e_1_3_2_1_25_1","volume-title":"9th Forum Media Technology (FMT2016)","volume":"1734","author":"Schindler Alexander","year":"2016","unstructured":"Alexander Schindler , Thomas Lidy , and Andreas Rauber . 2016 . Comparing shallow versus deep neural network architectures for automatic music genre classification . In 9th Forum Media Technology (FMT2016) , Vol. 1734 . 17--21. Alexander Schindler, Thomas Lidy, and Andreas Rauber. 2016. Comparing shallow versus deep neural network architectures for automatic music genre classification. In 9th Forum Media Technology (FMT2016), Vol. 1734. 17--21."},{"key":"e_1_3_2_1_26_1","volume-title":"Proceedings of the 13th International Conference on Music Information Retrieval (ISMIR","author":"Schindler Alexander","year":"2012","unstructured":"Alexander Schindler , Rudolf Mayer , and Andreas Rauber . 2012 . Facilitating comprehensive benchmarking experiments on the million song dataset . In Proceedings of the 13th International Conference on Music Information Retrieval (ISMIR 2012). Alexander Schindler, Rudolf Mayer, and Andreas Rauber. 2012. Facilitating comprehensive benchmarking experiments on the million song dataset. In Proceedings of the 13th International Conference on Music Information Retrieval (ISMIR 2012)."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298682"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2014.6854949"},{"key":"e_1_3_2_1_29_1","volume-title":"Marsyas: A framework for audio analysis. Organised sound 4, 3","author":"Tzanetakis George","year":"2000","unstructured":"George Tzanetakis and Perry Cook . 2000 . Marsyas: A framework for audio analysis. Organised sound 4, 3 (2000), 169--175. George Tzanetakis and Perry Cook. 2000. Marsyas: A framework for audio analysis. Organised sound 4, 3 (2000), 169--175."},{"key":"e_1_3_2_1_30_1","volume-title":"Proceedings of the Workshop on Advanced Technologies for Digital Libraries","author":"Walter Koch Henning Scholz","year":"2009","unstructured":"Henning Scholz Walter Koch . 2009 . DISMARC and BHL-Europe: multilingual access to two aggregation platforms for Europeana . In Proceedings of the Workshop on Advanced Technologies for Digital Libraries 2009. Trento, Italy, 25--29. Henning Scholz Walter Koch. 2009. DISMARC and BHL-Europe: multilingual access to two aggregation platforms for Europeana. In Proceedings of the Workshop on Advanced Technologies for Digital Libraries 2009. Trento, Italy, 25--29."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2168752.2168754"}],"event":{"name":"SAC '20: The 35th ACM\/SIGAPP Symposium on Applied Computing","sponsor":["SIGAPP ACM Special Interest Group on Applied Computing"],"location":"Brno Czech Republic","acronym":"SAC '20"},"container-title":["Proceedings of the 35th Annual ACM Symposium on Applied Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3341105.3374114","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3341105.3374114","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:38:24Z","timestamp":1750199904000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3341105.3374114"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,30]]},"references-count":31,"alternative-id":["10.1145\/3341105.3374114","10.1145\/3341105"],"URL":"https:\/\/doi.org\/10.1145\/3341105.3374114","relation":{},"subject":[],"published":{"date-parts":[[2020,3,30]]},"assertion":[{"value":"2020-03-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}