{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,1,2]],"date-time":"2024-01-02T11:21:35Z","timestamp":1704194495034},"reference-count":34,"publisher":"MIT Press - Journals","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computer Music Journal"],"published-print":{"date-parts":[[2015,3]]},"abstract":"<jats:p> This article presents an offline method for aligning an audio signal to individual instrumental parts constituting a musical score. The proposed method is based on fitting multiple hidden semi-Markov models (HSMMs) to the observed audio signal. The emission probability of each state of the HSMM is described using latent harmonic allocation (LHA), a Bayesian model of a harmonic sound mixture. Each HSMM corresponds to one musical instrument\u2019s part, and the state duration probability is conditioned on a linear dynamics system (LDS) tempo model. Variational Bayesian inference is used to jointly infer LHA, HSMM, and the LDS. We evaluate the capability of the method to align musical audio to its score, under reverberation, structural variations, and fluctuations in onset timing among different parts. <\/jats:p>","DOI":"10.1162\/comj_a_00286","type":"journal-article","created":{"date-parts":[[2015,3,24]],"date-time":"2015-03-24T17:07:22Z","timestamp":1427216842000},"page":"74-87","source":"Crossref","is-referenced-by-count":5,"title":["Bayesian Audio-to-Score Alignment Based on Joint Inference of Timbre, Volume, Tempo, and Note Onset Timings"],"prefix":"10.1162","volume":"39","author":[{"given":"Akira","family":"Maezawa","sequence":"first","affiliation":[{"name":"Graduate School of Informatics Department of Intelligence Science and Technology Kyoto University Yoshida-Honmachi Sakyo, Kyoto 606-8501, Japan"}]},{"given":"Hiroshi G.","family":"Okuno","sequence":"additional","affiliation":[{"name":"Graduate School of Informatics Department of Intelligence Science and Technology Kyoto University Yoshida-Honmachi Sakyo, Kyoto 606-8501, Japan"}]}],"member":"281","reference":[{"key":"p_1","first-page":"241","author":"Arzt A.","year":"2008","journal-title":"Proceedings of the European Conference on Artificial Intelligence"},{"key":"p_2","first-page":"208","author":"Bryan N. J.","year":"2013","journal-title":"Proceedings of the International Conference on Machine Learning"},{"key":"p_3","first-page":"1","author":"Cho T.","year":"2010","journal-title":"Proceedings of the Sound and Music Computing Conference"},{"key":"p_4","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2009.106"},{"key":"p_5","first-page":"29","author":"Devaney J.","year":"2009","journal-title":"Proceedings of the International Computer Music Conference"},{"key":"p_6","doi-asserted-by":"publisher","DOI":"10.1109\/JSTSP.2011.2159701"},{"key":"p_7","first-page":"197","author":"Duan Z.","year":"2011","journal-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing"},{"key":"p_8","first-page":"245","author":"Ewert S.","year":"2011","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_9","first-page":"129","author":"Ewert S.","year":"2012","journal-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing"},{"key":"p_10","first-page":"1869","author":"Ewert S.","year":"2009","journal-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing"},{"key":"p_11","first-page":"131","author":"Fremerey C.","year":"2007","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_12","first-page":"645","author":"Fremerey C.","year":"2009","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_13","first-page":"464","author":"Fujishima T.","year":"1999","journal-title":"Proceedings of the International Computer Music Conference"},{"key":"p_14","first-page":"553","author":"Goto M.","year":"2004","journal-title":"Proceedings of the International Congress on Acoustics"},{"key":"p_15","first-page":"145","author":"Han Y.","year":"2007","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_16","first-page":"185","author":"Hu N.","year":"2003","journal-title":"Proceedings of the Workshop on Applications of Signal Processing to Audio and Acoustics"},{"key":"p_17","first-page":"57","volume":"1","author":"Itoyama K.","year":"2007","journal-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing"},{"key":"p_18","first-page":"39","author":"Joder C.","year":"2010","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_19","first-page":"423","author":"Macrae R.","year":"2010","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_20","first-page":"477","author":"Maezawa A.","year":"2010","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_21","first-page":"185","author":"Maezawa A.","year":"2011","journal-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing"},{"key":"p_22","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2355772"},{"key":"p_24","first-page":"225","author":"Molina-Solana M.","year":"2010","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_25","first-page":"193","author":"Montecchio N.","year":"2011","journal-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing"},{"key":"p_26","first-page":"389","author":"M\u00fcller M.","year":"2008","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_27","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2010.2041394"},{"key":"p_28","first-page":"9","author":"M\u00fcller M.","year":"2006","journal-title":"Proceedings of the International Conference on Acoustics, Speech, and Signal Processing"},{"key":"p_29","first-page":"417","author":"Niedermayer B.","year":"2010","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_30","first-page":"36","author":"Orio N.","year":"2003","journal-title":"Proceedings of the of the International Conference on New Interfaces for Music Expression"},{"key":"p_31","doi-asserted-by":"publisher","DOI":"10.1155\/2011\/384651"},{"key":"p_32","first-page":"267","author":"Peeling P.","year":"2007","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_33","first-page":"387","author":"Raphael C.","year":"2004","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_34","first-page":"2","author":"Sapp C. S.","year":"2007","journal-title":"Proceedings of the International Conference on Music Information Retrieval"},{"key":"p_36","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2011.2164530"}],"container-title":["Computer Music Journal"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/COMJ_a_00286","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T21:33:47Z","timestamp":1615584827000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/comj\/article\/39\/1\/74-87\/94499"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,3]]},"references-count":34,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2015,3]]}},"alternative-id":["10.1162\/COMJ_a_00286"],"URL":"https:\/\/doi.org\/10.1162\/comj_a_00286","relation":{},"ISSN":["0148-9267","1531-5169"],"issn-type":[{"value":"0148-9267","type":"print"},{"value":"1531-5169","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,3]]}}}