{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T06:07:23Z","timestamp":1775282843036,"version":"3.50.1"},"reference-count":40,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2011,10,1]],"date-time":"2011-10-01T00:00:00Z","timestamp":1317427200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000145","name":"Division of Information and Intelligent Systems","doi-asserted-by":"publisher","award":["IIS-0811994"],"award-info":[{"award-number":["IIS-0811994"]}],"id":[{"id":"10.13039\/100000145","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2011,10]]},"abstract":"<jats:p>This article examines the use of two kinds of context to improve the results of content-based music taggers: the relationships between tags and between the clips of songs that are tagged. We show that users agree more on tags applied to clips temporally \u201ccloser\u201d to one another; that conditional restricted Boltzmann machine models of tags can more accurately predict related tags when they take context into account; and that when training data is \u201csmoothed\u201d using context, support vector machines can better rank these clips according to the original, unsmoothed tags and do this more accurately than three standard multi-label classifiers.<\/jats:p>","DOI":"10.1145\/2037676.2037689","type":"journal-article","created":{"date-parts":[[2011,11,8]],"date-time":"2011-11-08T08:32:01Z","timestamp":1320741121000},"page":"1-18","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["Contextual tag inference"],"prefix":"10.1145","volume":"7S","author":[{"given":"Michael I.","family":"Mandel","sequence":"first","affiliation":[{"name":"Universit\u00e9 de Montr\u00e9al, Qu\u00e9bec, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Razvan","family":"Pascanu","sequence":"additional","affiliation":[{"name":"Universit\u00e9 de Montr\u00e9al, Qu\u00e9bec, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Douglas","family":"Eck","sequence":"additional","affiliation":[{"name":"Universit\u00e9 de Montr\u00e9al, Qu\u00e9bec, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yoshua","family":"Bengio","sequence":"additional","affiliation":[{"name":"Universit\u00e9 de Montr\u00e9al, Qu\u00e9bec, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Luca M.","family":"Aiello","sequence":"additional","affiliation":[{"name":"Universit\u00e0 di Torino"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rossano","family":"Schifanella","sequence":"additional","affiliation":[{"name":"Universit\u00e0 di Torino"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Filippo","family":"Menczer","sequence":"additional","affiliation":[{"name":"Indiana University"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2011,11,4]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Proceedings of the International Symposium on Music Information Retrieval. 425--430","author":"Aucouturier J.","unstructured":"Aucouturier, J., Pachet, F., Roy, P., and Beuriv, A. 2007. Signal + context = better classification. In Proceedings of the International Symposium on Music Information Retrieval. 425--430."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1080\/09298210802479250"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.2307\/2987782"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2004.03.009"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the 22nd IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 3440--3446","author":"Chen L.","unstructured":"Chen, L., Xu, D., Tsang, I. W., and Luo, J. 2010. Tag-based web photo retrieval improved by batch mode re-tagging. In Proceedings of the 22nd IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 3440--3446."},{"key":"e_1_2_1_6_1","volume-title":"Proceedings of the Conference on Advances in Neural Information Processing Systems. S. Thrun, L. Saul, and B. Sch\u00f6lkopf, Eds., MIT Press","author":"Cortes C.","unstructured":"Cortes, C. and Mohri, M. 2004. Auc optimization vs. error rate minimization. In Proceedings of the Conference on Advances in Neural Information Processing Systems. S. Thrun, L. Saul, and B. Sch\u00f6lkopf, Eds., MIT Press, Cambridge, MA."},{"key":"e_1_2_1_7_1","volume-title":"Proceedings of the Conference on Advances in Neural Information Processing Systems. J. Platt, D. Koller, Y Singer, and S. Roweis, Eds., MIT Press","author":"Eck D.","unstructured":"Eck, D., Lamere, P., Bertin-Mahieux, T., and Green, S. 2008. Automatic generation of social tags for music recommendation. In Proceedings of the Conference on Advances in Neural Information Processing Systems. J. Platt, D. Koller, Y Singer, and S. Roweis, Eds., MIT Press, Cambridge, MA, 385--392."},{"key":"e_1_2_1_8_1","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence. 469--474","author":"Han Y.","unstructured":"Han, Y., Wu, F., Jia, J., Zhuang, Y., and Yu, B. 2010. Multi-task sparse discriminant analysis (MtSDA) with overlapping categories. In Proceedings of the AAAI Conference on Artificial Intelligence. 469--474."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-009-5119-5"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","unstructured":"Heitz G. and Koller D. 2008. Learning spatial context: Using stuff to find things. In Proceedings of the European Conference on Computer Vision. D. Forsyth P. Torr and A. Zisserman Eds. Lecture Notes in Computer Science Series vol. 5302 Springer 30--43. 10.1007\/978-3-540-88682-2_4","DOI":"10.1007\/978-3-540-88682-2_4"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1162\/089976602760128018"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-008-0137-5"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2006.90"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1390156.1390224"},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of the International Symposium on Music Information Retrieval. 183--188","author":"Lee J. H.","year":"2010","unstructured":"Lee, J. H. 2010. Crowdsourcing music similarity judgments using mechanical turk. In Proceedings of the International Symposium on Music Information Retrieval. 183--188."},{"key":"e_1_2_1_16_1","unstructured":"Mandel M. Pascanu R. Larochelle H. and Bengio Y. 2011. Autotagging music with conditional restricted boltzmann machines. http:\/\/arxiv.org\/abs\/1103.2832."},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the International Symposium on Music Information Retrieval. 399--404","author":"Mandel M. I.","unstructured":"Mandel, M. I., Eck, D., and Bengio, Y. 2010. Learning tags that vary within a song. In Proceedings of the International Symposium on Music Information Retrieval. 399--404."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1080\/09298210802479300"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","unstructured":"Manning C. Raghavan P. and Sch\u00fctze H. 2008. Introduction to Information Retrieval. Cambridge University Press.","DOI":"10.5555\/1394399"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1526709.1526796"},{"key":"e_1_2_1_21_1","volume-title":"Proceedings of the International Symposium on Music Information Retrieval. 297--302","author":"Miotto R.","unstructured":"Miotto, R., Barrington, L., and Lanckriet, G. 2010. Improving auto-tagging by modeling semantic co-occurrences. In Proceedings of the International Symposium on Music Information Retrieval. 297--302."},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the Conference on Advances in Neural Information Processing Systems. S. Thrun, L. Saul, and B. Sch\u00f6lkopf, Eds., MIT Press","author":"Murphy K.","unstructured":"Murphy, K., Torralba, A., and Freeman, W. T. 2004. Using the forest to see the trees: A graphical model relating features, objects, and scenes. In Proceedings of the Conference on Advances in Neural Information Processing Systems. S. Thrun, L. Saul, and B. Sch\u00f6lkopf, Eds., MIT Press, Cambridge, MA."},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the International Conference on Computer Vision. IEEE, 1--8.","author":"Rabinovich A.","unstructured":"Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., and Belongie, S. 2007. Objects in context. In Proceedings of the International Conference on Computer Vision. IEEE, 1--8."},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition. IEEE","author":"Rasiwasia N.","year":"1889","unstructured":"Rasiwasia, N. and Vasconcelos, N. 2009. Holistic context modeling using semantic co-occurrences. In Proceedings of the International Conference on Computer Vision and Pattern Recognition. IEEE, Los Alamitos, CA, 1889--1895."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273596"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1718487.1718521"},{"key":"e_1_2_1_27_1","volume-title":"InProceedings of the International Conference on Acoustics, Speech, and Signal Processing.","author":"Slaney M.","year":"2002","unstructured":"Slaney, M. 2002. Semantic-audio retrieval. InProceedings of the International Conference on Acoustics, Speech, and Signal Processing."},{"key":"e_1_2_1_28_1","volume-title":"Information Processing in Dynamical Systems: Foundations of Harmony Theory","author":"Smolensky P.","unstructured":"Smolensky, P. 1986. Information Processing in Dynamical Systems: Foundations of Harmony Theory. MIT Press."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.5555\/1613715.1613751"},{"key":"e_1_2_1_30_1","volume-title":"Proceedings of the Workshop on Internet Vision at the IEEE Conference on Computer Vision and Pattern Recognition. 1--8.","author":"Sorokin A.","unstructured":"Sorokin, A. and Forsyth, D. 2008. Utility data annotation with amazon mechanical turk. In Proceedings of the Workshop on Internet Vision at the IEEE Conference on Computer Vision and Pattern Recognition. 1--8."},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of the Conference on Advances in Neural Information Processing Systems. B. Schiilkopf, J. Platt, and T. Hoffman, Eds., MIT Press","author":"Taylor G.","unstructured":"Taylor, G., Hinton, G. E., and Roweis, S. 2007. Modeling human motion using binary latent variables. In Proceedings of the Conference on Advances in Neural Information Processing Systems. B. Schiilkopf, J. Platt, and T. Hoffman, Eds., MIT Press, Cambridge, MA, 1345--1352."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1743384.1743400"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the International Symposium on Music Information Retrieval.","author":"Trohidis K.","unstructured":"Trohidis, K., Tsoumakas, G., Kalliris, G., and Vlahavas, I. 2008. Multilabel classification of music into emotions. In Proceedings of the International Symposium on Music Information Retrieval."},{"key":"e_1_2_1_34_1","first-page":"667","article-title":"Mining multi-label data. In Data Mining and Knowledge Discovery Handbook, O. Maimon and L. Rokach, Eds","volume":"34","author":"Tsoumakas G.","year":"2010","unstructured":"Tsoumakas, G., Katakis, I., and Vlahavas, I. 2010. Mining multi-label data. In Data Mining and Knowledge Discovery Handbook, O. Maimon and L. Rokach, Eds., Chapter 34, 667--685.","journal-title":"Chapter"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.5555\/2021026.2021078"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-74958-5_38"},{"key":"e_1_2_1_37_1","volume-title":"Proceedings of the Conference on Advances in Neural Information Processing Systems. Y. Bengio, D. Schuurmans, C. Williams, J. Lafferty, and A. Culotta, Eds.","author":"Whitehill J.","unstructured":"Whitehill, J., Ruvolo, P., Wu, T., Bergsma, J., and Movellan, J. 2009. Whose vote should count more: Optimal integration of labels from labelers of unknown expertise. In Proceedings of the Conference on Advances in Neural Information Processing Systems. Y. Bengio, D. Schuurmans, C. Williams, J. Lafferty, and A. Culotta, Eds., 2035--2043."},{"key":"e_1_2_1_38_1","volume-title":"Proceedings of the IEEE Workshop on Multimedia Signal Processing. 153--156","author":"Whitman B.","unstructured":"Whitman, B. and Rifkin, R. 2002. Musical query-by-description as a multiclass learning problem. In Proceedings of the IEEE Workshop on Multimedia Signal Processing. 153--156."},{"key":"e_1_2_1_39_1","volume-title":"Proceedings of the International Conference on Computer Vision and Pattern Recognition. IEEE, 17--24","author":"Yao B.","unstructured":"Yao, B. and Fei-Fei, L. 2010. Modeling mutual context of object and human pose in human-object interaction activities. In Proceedings of the International Conference on Computer Vision and Pattern Recognition. IEEE, 17--24."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2006.12.019"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2037676.2037689","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2037676.2037689","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2037676.2037689","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T09:33:27Z","timestamp":1763458407000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2037676.2037689"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,10]]},"references-count":40,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,10]]}},"alternative-id":["10.1145\/2037676.2037689"],"URL":"https:\/\/doi.org\/10.1145\/2037676.2037689","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,10]]},"assertion":[{"value":"2010-10-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2011-08-01","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2011-11-04","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}