{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T08:11:57Z","timestamp":1773994317127,"version":"3.50.1"},"reference-count":35,"publisher":"Elsevier BV","license":[{"start":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T00:00:00Z","timestamp":1777593600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.elsevier.com\/tdm\/userlicense\/1.0\/"},{"start":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T00:00:00Z","timestamp":1777593600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.elsevier.com\/legal\/tdmrep-license"},{"start":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T00:00:00Z","timestamp":1777593600000},"content-version":"stm-asf","delay-in-days":0,"URL":"https:\/\/doi.org\/10.15223\/policy-017"},{"start":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T00:00:00Z","timestamp":1777593600000},"content-version":"stm-asf","delay-in-days":0,"URL":"https:\/\/doi.org\/10.15223\/policy-037"},{"start":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T00:00:00Z","timestamp":1777593600000},"content-version":"stm-asf","delay-in-days":0,"URL":"https:\/\/doi.org\/10.15223\/policy-012"},{"start":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T00:00:00Z","timestamp":1777593600000},"content-version":"stm-asf","delay-in-days":0,"URL":"https:\/\/doi.org\/10.15223\/policy-029"},{"start":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T00:00:00Z","timestamp":1777593600000},"content-version":"stm-asf","delay-in-days":0,"URL":"https:\/\/doi.org\/10.15223\/policy-004"}],"content-domain":{"domain":["elsevier.com","sciencedirect.com"],"crossmark-restriction":true},"short-container-title":["Expert Systems with Applications"],"published-print":{"date-parts":[[2026,5]]},"DOI":"10.1016\/j.eswa.2026.131173","type":"journal-article","created":{"date-parts":[[2026,1,10]],"date-time":"2026-01-10T16:05:19Z","timestamp":1768061119000},"page":"131173","update-policy":"https:\/\/doi.org\/10.1016\/elsevier_cm_policy","source":"Crossref","is-referenced-by-count":0,"special_numbering":"C","title":["Fusing memory and attention: A study on LSTM, transformer and hybrid architectures for symbolic music generation"],"prefix":"10.1016","volume":"308","author":[{"ORCID":"https:\/\/orcid.org\/0009-0005-1433-7063","authenticated-orcid":false,"given":"Soudeep","family":"Ghoshal","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0006-5956-6692","authenticated-orcid":false,"given":"Sandipan","family":"Chakraborty","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0006-2856-2365","authenticated-orcid":false,"given":"Pradipto","family":"Chowdhury","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3679-3498","authenticated-orcid":false,"given":"Himanshu","family":"Buckchash","sequence":"additional","affiliation":[]}],"member":"78","reference":[{"key":"10.1016\/j.eswa.2026.131173_bib0001","series-title":"Classical form: A theory of formal functions for the instrumental music of Haydn, Mozart, and Beethoven","author":"Caplin","year":"1998"},{"key":"10.1016\/j.eswa.2026.131173_bib0002","unstructured":"Cuthbert, M. S., & Ariza, C. (2010). music21: A toolkit for computer-aided musicology and the generation of musical examples. https:\/\/www.music21.org\/music21docs\/. Python-based toolkit for computer-aided musicology."},{"key":"10.1016\/j.eswa.2026.131173_bib0003","unstructured":"Esling, P. et al. (2022). Challenges in creative generative models for music: a divergence maximization perspective. arXiv: 2211.08856."},{"key":"10.1016\/j.eswa.2026.131173_bib0004","series-title":"Study of Counterpoint: From Johann Joseph Fux\u2019s Gradus Ad Parnassum","author":"Fux","year":"1965"},{"key":"10.1016\/j.eswa.2026.131173_bib0005","unstructured":"Hawthorne, C., Simon, I., Roberts, A., Zeghidour, N., Gardner, J., Manilow, E., & Engel, J. (2022). Multi-instrument music synthesis with spectrogram diffusion. https:\/\/arxiv.org\/abs\/2206.05408."},{"key":"10.1016\/j.eswa.2026.131173_bib0006","unstructured":"Huang, C.-Z. A., Vaswani, A., Uszkoreit, J., Shazeer, N., Simon, I., Hawthorne, C., Dai, A. M., Hoffman, M. D., Dinculescu, M., & Eck, D. (2018). Music transformer. https:\/\/arxiv.org\/abs\/1809.04281."},{"key":"10.1016\/j.eswa.2026.131173_bib0007","series-title":"Sweet anticipation: Music and the psychology of expectation","author":"Huron","year":"2008"},{"key":"10.1016\/j.eswa.2026.131173_bib0008","article-title":"A comprehensive survey for evaluation methodologies of AI-generated music","author":"Ji","year":"2023","journal-title":"ResearchGate"},{"key":"10.1016\/j.eswa.2026.131173_bib0009","unstructured":"Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv: 1412.6980."},{"issue":"2","key":"10.1016\/j.eswa.2026.131173_bib0010","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1016\/j.jcm.2016.02.012","article-title":"A guideline of selecting and reporting intraclass correlation coefficients for reliability research","volume":"15","author":"Koo","year":"2016","journal-title":"Journal of Chiropractic Medicine"},{"key":"10.1016\/j.eswa.2026.131173_bib0011","series-title":"Cognitive foundations of musical pitch","author":"Krumhansl","year":"2001"},{"key":"10.1016\/j.eswa.2026.131173_bib0012","series-title":"Tonal pitch space","author":"Lerdahl","year":"2001"},{"key":"10.1016\/j.eswa.2026.131173_bib0013","unstructured":"Li, M., Soltanolkotabi, M., & Oymak, S. (2019). Gradient descent with early stopping is provably robust to label noise for overparameterized neural networks. https:\/\/arxiv.org\/abs\/1903.11680."},{"issue":"8","key":"10.1016\/j.eswa.2026.131173_bib0014","doi-asserted-by":"crossref","DOI":"10.3390\/math11081915","article-title":"Melodydiffusion: Chord-conditioned melody generation using a transformer-based diffusion model","volume":"11","author":"Li","year":"2023","journal-title":"Mathematics"},{"key":"10.1016\/j.eswa.2026.131173_bib0015","series-title":"Hearing in time: Psychological aspects of musical meter","author":"London","year":"2012"},{"issue":"2","key":"10.1016\/j.eswa.2026.131173_bib0016","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1068\/p110115","article-title":"The perception of musical rhythms","volume":"11","author":"Longuet-Higgins","year":"1982","journal-title":"Perception"},{"key":"10.1016\/j.eswa.2026.131173_bib0017","unstructured":"Mariani, G., Tallini, I., Postolache, E., Mancusi, M., Cosmo, L., & Rodol\u00e0, E. (2023). Multi-source diffusion models for simultaneous music generation and separation. arXiv: 2302.02257."},{"issue":"2","key":"10.1016\/j.eswa.2026.131173_bib0018","doi-asserted-by":"crossref","first-page":"205","DOI":"10.2307\/745814","article-title":"New directions in the theory and analysis of musical contour","volume":"15","author":"Morris","year":"1993","journal-title":"Music Theory Spectrum"},{"key":"10.1016\/j.eswa.2026.131173_bib0019","series-title":"Proceedings of the AAAI conference on artificial intelligence","first-page":"408","article-title":"Symbolic music generation with transformer-GANs","volume":"vol. 35","author":"Muhamed","year":"2021"},{"key":"10.1016\/j.eswa.2026.131173_bib0020","doi-asserted-by":"crossref","first-page":"114","DOI":"10.54254\/2755-2721\/21\/20231129","article-title":"Investigating midi data simplification by ai models","volume":"21","author":"Ou","year":"2023","journal-title":"Applied and Computational Engineering"},{"issue":"5","key":"10.1016\/j.eswa.2026.131173_bib0021","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1525\/mp.2006.23.5.377","article-title":"Expectation in melody: The influence of context and learning","volume":"23","author":"Pearce","year":"2006","journal-title":"Music Perception"},{"key":"10.1016\/j.eswa.2026.131173_bib0022","series-title":"Harmony","author":"Piston","year":"1959"},{"key":"10.1016\/j.eswa.2026.131173_bib0023","unstructured":"Schaffrath, H. (1987). The essen folksong collection. http:\/\/essen.themefinder.org\/. Collection of 10,000 folksongs from around the world, particularly from areas around Germany."},{"key":"10.1016\/j.eswa.2026.131173_bib0024","series-title":"The musical idea and the logic, technique, and art of its presentation","author":"Schoenberg","year":"2006"},{"key":"10.1016\/j.eswa.2026.131173_bib0025","first-page":"1","article-title":"Method for the subjective assessment of intermediate quality level of audio systems","volume":"2","author":"Series","year":"2014","journal-title":"International Telecommunication Union Radiocommunication Assembly"},{"key":"10.1016\/j.eswa.2026.131173_bib0026","unstructured":"Shahid, A. R. et al. (2022). Music generation using an LSTM. https:\/\/arxiv.org\/abs\/2203.12105."},{"key":"10.1016\/j.eswa.2026.131173_bib0027","article-title":"Theme transformer: Symbolic music generation with theme-conditioned transformer","author":"Shih","year":"2022","journal-title":"IEEE Transactions on Multimedia"},{"key":"10.1016\/j.eswa.2026.131173_bib0028","article-title":"Performance RNN: Generating music with expressive timing and dynamics","author":"Simon","year":"2017","journal-title":"Magenta Blog"},{"key":"10.1016\/j.eswa.2026.131173_bib0029","unstructured":"Singh, P., & Arora, V. (2024). Explainable deep learning analysis for raga identification in indian art music. https:\/\/arxiv.org\/abs\/2406.02443."},{"key":"10.1016\/j.eswa.2026.131173_bib0030","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, I., & Polosukhin, I. (2017). Attention is all you need. In Advances in neural information processing systems. (vol. 30). https:\/\/papers.nips.cc\/paper_files\/paper\/2017\/hash\/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html."},{"key":"10.1016\/j.eswa.2026.131173_bib0031","unstructured":"Wang, P.-H., Hsieh, S.-I., Chang, S.-C., Chen, Y.-T., Pan, J.-Y., Wei, W., & Juan, D.-C. (2020). Contextual temperature for language modeling. arXiv: 2012.13575."},{"issue":"4","key":"10.1016\/j.eswa.2026.131173_bib0032","first-page":"1","article-title":"Conditional LSTM-GAN for melody generation from lyrics","volume":"16","author":"Yu","year":"2020","journal-title":"ACM Transactions on Multimedia Computing, Communications, and Applications"},{"key":"10.1016\/j.eswa.2026.131173_bib0033","series-title":"2024\u202fIEEE 34th international workshop on machine learning for signal processing (MLSP)","first-page":"1","article-title":"Composer style-specific symbolic music generation using vector quantized discrete diffusion models","author":"Zhang","year":"2024"},{"issue":"1","key":"10.1016\/j.eswa.2026.131173_bib0034","first-page":"1","article-title":"A comparative study of LSTM and transformer models in music melody generation","volume":"1","author":"Zheng","year":"2023","journal-title":"Journal of Global Arts Studies (JGAS)"},{"key":"10.1016\/j.eswa.2026.131173_bib0035","unstructured":"Zou, Y., Zou, P., Zhao, Y., Zhang, K., Zhang, R., & Wang, X. (2021). Melons: generating melody with long-term structure using transformers and structure graph. In Proceedings of the 22nd international society for music information retrieval conference (ISMIR). https:\/\/arxiv.org\/abs\/2110.05020."}],"container-title":["Expert Systems with Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.elsevier.com\/content\/article\/PII:S0957417426000874?httpAccept=text\/xml","content-type":"text\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/api.elsevier.com\/content\/article\/PII:S0957417426000874?httpAccept=text\/plain","content-type":"text\/plain","content-version":"vor","intended-application":"text-mining"}],"deposited":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T05:18:20Z","timestamp":1773983900000},"score":1,"resource":{"primary":{"URL":"https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0957417426000874"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,5]]},"references-count":35,"alternative-id":["S0957417426000874"],"URL":"https:\/\/doi.org\/10.1016\/j.eswa.2026.131173","relation":{},"ISSN":["0957-4174"],"issn-type":[{"value":"0957-4174","type":"print"}],"subject":[],"published":{"date-parts":[[2026,5]]},"assertion":[{"value":"Elsevier","name":"publisher","label":"This article is maintained by"},{"value":"Fusing memory and attention: A study on LSTM, transformer and hybrid architectures for symbolic music generation","name":"articletitle","label":"Article Title"},{"value":"Expert Systems with Applications","name":"journaltitle","label":"Journal Title"},{"value":"https:\/\/doi.org\/10.1016\/j.eswa.2026.131173","name":"articlelink","label":"CrossRef DOI link to publisher maintained version"},{"value":"article","name":"content_type","label":"Content Type"},{"value":"\u00a9 2026 Elsevier Ltd. All rights are reserved, including those for text and data mining, AI training, and similar technologies.","name":"copyright","label":"Copyright"}],"article-number":"131173"}}