{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T23:45:56Z","timestamp":1740181556755,"version":"3.37.3"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"6","license":[{"start":{"date-parts":[[2022,10,6]],"date-time":"2022-10-06T00:00:00Z","timestamp":1665014400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,10,6]],"date-time":"2022-10-06T00:00:00Z","timestamp":1665014400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001691","name":"japan society for the promotion of science","doi-asserted-by":"crossref","award":["16H01744"],"award-info":[{"award-number":["16H01744"]}],"id":[{"id":"10.13039\/501100001691","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["SN COMPUT. SCI."],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This paper proposes an unsupervised learning of chord classification aiming at an autonomous recognition of chord functions. In this research, we employ hidden semi-Markov model to incorporate music metrical structure, and in addition, we combine the model with neural network components to embed context information such as beat positions and preceding chord sequences. Experimental results show that the added contexts considerably improve the perplexity. With the help of these neural networks, the proposed model automatically learns hidden states that appropriately represent chord categories. To this purpose, we pre-process the dataset minimally; that is, we only transpose pieces so as not to possess key signatures and ignore octave positions in pitch events. We observe the chord categories effectively cover chords that appeared in the corpus. We further show that the transitions between chord categories reflect the difference of tonalities with a tendency consistent with known chord functions.<\/jats:p>","DOI":"10.1007\/s42979-022-01395-4","type":"journal-article","created":{"date-parts":[[2022,10,6]],"date-time":"2022-10-06T10:02:21Z","timestamp":1665050541000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Chord Function Recognition as Latent State Transition"],"prefix":"10.1007","volume":"3","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0028-2549","authenticated-orcid":false,"given":"Yui","family":"Uehara","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Satoshi","family":"Tojo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2022,10,6]]},"reference":[{"key":"1395_CR1","volume-title":"Treatise on Harmony","author":"JP Rameau","year":"1971","unstructured":"Rameau JP. Treatise on Harmony. New York: Dover Publications; 1971."},{"key":"1395_CR2","volume-title":"Harmony Simplified: Or the Theory of the Tonal Functions of Chords","author":"H Riemann","year":"1896","unstructured":"Riemann H. Harmony Simplified: Or the Theory of the Tonal Functions of Chords. London: Augener; 1896."},{"key":"1395_CR3","unstructured":"Schoenberg A. Structural Functions of Harmony (revised edition), W. W. Norton & Company 1969"},{"key":"1395_CR4","unstructured":"Mulholland J, Hojnacki T. The Berklee Book of Jazz Harmony, Berklee Press; 2013"},{"key":"1395_CR5","unstructured":"Granroth\u2013Wilding M, Steedman M. Statistical parsing for harmonic analysis of jazz chord sequences, In: International Computer Music Conference, 2012; pp.478\u2013485"},{"issue":"1","key":"1395_CR6","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1080\/17459737.2011.573676","volume":"5","author":"M Rohrmeier","year":"2011","unstructured":"Rohrmeier M. Towards a generative syntax of tonal harmony. J Math Music. 2011;5(1):35\u201353.","journal-title":"J Math Music"},{"key":"1395_CR7","unstructured":"Kuo F, Shan M. A personalized music filtering system based on melody style classification, In: Proceedings of the 2002 IEEE International Conference on Data Mining, 2002;pp.649\u2013652"},{"key":"1395_CR8","doi-asserted-by":"crossref","unstructured":"Groves R. Automatic Harmonization Using a Hidden Semi-Markov Model, In: AIIDE Workshop,2013; pp.48\u201354","DOI":"10.1609\/aiide.v9i5.12654"},{"key":"1395_CR9","unstructured":"Anders T, Eduardo RM. A computational model that generalises schoenberg\u2019s guidelines for favourable chord progressions, In: 6th Sound and Music Computing Conference, 2009;pp.48\u201352"},{"key":"1395_CR10","doi-asserted-by":"crossref","unstructured":"Navarro M, Caetano M, Bernardes G, Nunes de Castro L, Corchado JM. In: International Conference on Evolutionary and Biologically Inspired Music and Art, 2015;pp.175\u2013186","DOI":"10.1007\/978-3-319-16498-4_16"},{"key":"1395_CR11","unstructured":"Hadjeres G, Pachet F, Nielsen F. Deepbach: a steerable model for bach chorales generation, In: International Conference on Machine Learning, 2017;pp.1362\u20131371"},{"key":"1395_CR12","unstructured":"Huang C-Z A, Vaswani A, Uszkoreit J, Shazeer N, Hawthorne C, Dai A M, Hoffman M D, Eck D. Music Transformer: Generating Music with Long-Term Structure,;2018 arXiv preprint arXiv:1809.04281"},{"key":"1395_CR13","unstructured":"Rohrmeier M, Cross I. Statistical properties of tonal harmony in Bach\u2019s chorales, In: 10th International Conference on Music Perception and Cognition, 2008; pp. 619\u2013627"},{"issue":"3","key":"1395_CR14","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1080\/09298215.2015.1036888","volume":"44","author":"N Jacoby","year":"2015","unstructured":"Jacoby N, Tishby N, Tymoczko D. An information theoretic approach to chord categorization and functional harmony. J New Music Res. 2015;44(3):219\u201344.","journal-title":"J New Music Res."},{"issue":"3","key":"1395_CR15","doi-asserted-by":"publisher","first-page":"226","DOI":"10.1080\/09298215.2018.1447584","volume":"47","author":"H Tsushima","year":"2018","unstructured":"Tsushima H, Nakamura E, Itoyama K, Yoshii K. Generative statistical models with self-emergent grammar of chord sequences. J New Music Res. 2018;47(3):226\u201348.","journal-title":"J New Music Res."},{"issue":"2","key":"1395_CR16","doi-asserted-by":"publisher","first-page":"314","DOI":"10.1093\/mts\/mty021","volume":"40","author":"CW White","year":"2018","unstructured":"White CW, Quinn I. Chord context and harmonic function in tonal music. Music Theory Spectr. 2018;40(2):314\u201335.","journal-title":"Music Theory Spectr."},{"key":"1395_CR17","unstructured":"Uehara Y, Nakamura E, Tojo S. Chord function identification with modulation detection based on HMM, In: Proceedings of 14th International Symposium on Computer Music Multidisciplinary Research, 2019; pp. 59\u201370"},{"issue":"3","key":"1395_CR18","doi-asserted-by":"publisher","first-page":"232","DOI":"10.1080\/09298215.2019.1613436","volume":"48","author":"HV Koops","year":"2019","unstructured":"Koops HV, de Haas WB, Burgoyne JA, Bransen J, Kent-Muller A, Volk A. Annotator subjectivity in harmony annotations of popular music. J New Music Res. 2019;48(3):232\u201352.","journal-title":"J New Music Res."},{"key":"1395_CR19","first-page":"24","volume":"15","author":"D Temperley","year":"2007","unstructured":"Temperley D. The tonal properties of pitch-class sets: Tonal implication, tonal ambiguity, and tonalness. Comput Musicol. 2007;15:24\u201338.","journal-title":"Comput Musicol."},{"key":"1395_CR20","doi-asserted-by":"crossref","unstructured":"Tran K, Bisk Y, Vaswani A, Marcu D, Knight K. Unsupervised neural hidden markov models, In: Proceedings of the Workshop on Structured Prediction for NLP, 2016; pp. 63\u201371","DOI":"10.18653\/v1\/W16-5907"},{"key":"1395_CR21","unstructured":"Harasim D, O\u2019Donnell TJ, Rohrmeier M. Harmonic syntax in time rhythm improves grammatical models of harm, In: Proceedings of the 20th International Conference on Music Information Retrieval, 2019; pp. 335\u2013342"},{"key":"1395_CR22","unstructured":"Christodoulopoulos C, Goldwater S, Steedman M. A Bayesian mixture model for PoS induction using multiple features, In: Proceedings of the 2011 conference on empirical methods in Natural Language Processing, 2011; pp. 638\u2013647"},{"key":"1395_CR23","unstructured":"Blunsom P, Cohn T. A hierarchical Pitman-Yor Process HMM for unsupervised part of speech induction, In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011; pp. 865\u2013874"},{"key":"1395_CR24","unstructured":"Yu D, Deng L. Dahl GE, Roles of pre-training and fine-tuning in context-dependent DBN-HMMs for real-world speech recognition, In: NIPS 2010 Workshop on Deep Learning and Unsupervised Feature Learning; 2010"},{"key":"1395_CR25","doi-asserted-by":"crossref","unstructured":"Hermansky H, Ellis DPW, Sharma S. Tandem connectionist feature extraction for conventional HMM systems, In: 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, Proceedings (Cat. No.00CH37100), 2000; vol. 3, pp. 1635\u20131638","DOI":"10.1109\/ICASSP.2000.862024"},{"key":"1395_CR26","unstructured":"Dai H, Dai B, Zhang YM, Li S, Song L. Recurrent hidden semi-markov model, In: Proceedings of the International Conference on Learning Representations; 2017"},{"issue":"8","key":"1395_CR27","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J. Long Short-Term Memory. Neural Comput. 1997;9(8):1735\u201380.","journal-title":"Neural Comput"},{"issue":"2","key":"1395_CR28","doi-asserted-by":"publisher","first-page":"257","DOI":"10.1109\/5.18626","volume":"77","author":"LR Rabiner","year":"1989","unstructured":"Rabiner LR. A tutorial on hidden markov models and selected applications in speech recognition. Proc IEEE. 1989;77(2):257\u201386.","journal-title":"Proc IEEE"},{"key":"1395_CR29","unstructured":"Kim Y, Wiseman S. Rush AM, A tutorial on deep latent variable models of natural language, 2018 arXiv preprint, arXiv:1812.06834"},{"issue":"4","key":"1395_CR30","doi-asserted-by":"publisher","first-page":"1035","DOI":"10.1002\/j.1538-7305.1983.tb03114.x","volume":"62","author":"SE Levinson","year":"1983","unstructured":"Levinson SE, Rabiner LR, Sondhi MM. An introduction to the application of the theory of probabilistic functions of a markov process to automatic speech recognition. Bell Syst Tech J. 1983;62(4):1035\u201374.","journal-title":"Bell Syst Tech J"},{"key":"1395_CR31","unstructured":"Ferguson JD. Variable duration models for speech, In Proc. Symposium on the Application of Hidden Markov Models to Text and Speech, 1980;143\u2013179"},{"issue":"1","key":"1395_CR32","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1016\/S0885-2308(86)80009-2","volume":"1","author":"SE Levinson","year":"1986","unstructured":"Levinson SE. Continuously variable duration hidden Markov models for automatic speech recognition. Comput Speech Lang. 1986;1(1):29\u201345.","journal-title":"Comput Speech Lang."},{"issue":"1","key":"1395_CR33","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1109\/LSP.2002.806705","volume":"10","author":"SZ Yu","year":"2003","unstructured":"Yu SZ, Kobayashi H. An efficient forward-backward algorithm for an explicit-duration hidden Markov model. IEEE Signal Process Lett. 2003;10(1):11\u20134.","journal-title":"IEEE Signal Process Lett"},{"issue":"2","key":"1395_CR34","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1016\/j.artint.2009.11.011","volume":"174","author":"SZ Yu","year":"2010","unstructured":"Yu SZ. Hidden semi-Markov models. Artif Intell. 2010;174(2):215\u201343.","journal-title":"Artif Intell"},{"key":"1395_CR35","unstructured":"Liu L, Jiang H, He P, Chen W, Liu X, Gao J, Han J. On the variance of the adaptive learning rate and beyond, In: International Conference on Learning Representations; 2020"},{"key":"1395_CR36","unstructured":"Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR. Improving neural networks by preventing co-adaptation of feature detectors, CoRR abs\/1207.0580; 2012"},{"key":"1395_CR37","unstructured":"Cuthbert MS, Ariza C. music21: a toolkit for computer-aided musicology and symbolic music data, In: Proceedings of the 11th International Society for Music Information Retrieval Conference;2010"},{"key":"1395_CR38","unstructured":"Dahn L. So how many Bach four-part chorales are there?, http:\/\/www.bach-chorales.com\/HowManyChorales.htm,; 2018"}],"container-title":["SN Computer Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s42979-022-01395-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s42979-022-01395-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s42979-022-01395-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,5]],"date-time":"2024-10-05T08:09:30Z","timestamp":1728115770000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s42979-022-01395-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,6]]},"references-count":38,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2022,11]]}},"alternative-id":["1395"],"URL":"https:\/\/doi.org\/10.1007\/s42979-022-01395-4","relation":{},"ISSN":["2661-8907"],"issn-type":[{"type":"electronic","value":"2661-8907"}],"subject":[],"published":{"date-parts":[[2022,10,6]]},"assertion":[{"value":"22 July 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 August 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 October 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"On behalf of all authors, the corresponding author states that there is no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"508"}}