{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T17:47:09Z","timestamp":1754156829216,"version":"3.41.2"},"reference-count":40,"publisher":"Emerald","issue":"5","license":[{"start":{"date-parts":[[2023,3,17]],"date-time":"2023-03-17T00:00:00Z","timestamp":1679011200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["DTA"],"published-print":{"date-parts":[[2023,11,15]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>Music sentiment analysis helps to promote the diversification of music information retrieval methods. Traditional music emotion classification tasks suffer from high manual workload and low classification accuracy caused by difficulty in feature extraction and inaccurate manual determination of hyperparameter. In this paper, the authors propose an optimized convolution neural network-random forest (CNN-RF) model for music sentiment classification which is capable of optimizing the manually selected hyperparameters to improve the accuracy of music sentiment classification and reduce labor costs and human classification errors.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>A CNN-RF music sentiment classification model is designed based on quantum particle swarm optimization (QPSO). First, the audio data are transformed into a Mel spectrogram, and feature extraction is conducted by a CNN. Second, the music features extracted are processed by RF algorithm to complete a preliminary emotion classification. Finally, to select the suitable hyperparameters for a CNN, the QPSO algorithm is adopted to extract the best hyperparameters and obtain the final classification results.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>The model has gone through experimental validations and achieved a classification accuracy of 97 per cent for different sentiment categories with shortened training time. The proposed method with QPSO achieved 1.2 and 1.6 per cent higher accuracy than that with particle swarm optimization and genetic algorithm, respectively. The proposed model had great potential for music sentiment classification.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>The dual contribution of this work comprises the proposed model which integrated two deep learning models and the introduction of a QPSO into model optimization. With these two innovations, the efficiency and accuracy of music emotion recognition and classification have been significantly improved.<\/jats:p><\/jats:sec>","DOI":"10.1108\/dta-07-2022-0267","type":"journal-article","created":{"date-parts":[[2023,3,17]],"date-time":"2023-03-17T08:14:39Z","timestamp":1679040879000},"page":"719-733","source":"Crossref","is-referenced-by-count":4,"title":["Music sentiment classification based on an optimized CNN-RF-QPSO model"],"prefix":"10.1108","volume":"57","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9845-9351","authenticated-orcid":false,"given":"Rui","family":"Tian","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ruheng","family":"Yin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Feng","family":"Gan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","published-online":{"date-parts":[[2023,3,17]]},"reference":[{"key":"key2023111509325981800_ref001","doi-asserted-by":"publisher","DOI":"10.9781\/IJIMAI.2021.01.003","article-title":"Motivic pattern classification of music audio signals combining residual and LSTM networks","year":"2021","journal-title":"The International Journal of Interactive Multimedia and Artificial Intelligence"},{"issue":"2","key":"key2023111509325981800_ref002","first-page":"719","article-title":"Automatic genre classification using fractional Fourier transform based Mel frequency cepstral coefficient and timbral features","volume":"42","year":"2017","journal-title":"Archives of Acoustics"},{"issue":"3","key":"key2023111509325981800_ref003","first-page":"342","article-title":"Decision tree-based classification in coastal area integrating polarimetric SAR and optical data","volume":"56","year":"2021","journal-title":"Data Technologies and Applications"},{"journal-title":"18th International Society for Music Information Retrieval Conference, 2017, arXiv preprint arXiv:1612.01840","article-title":"FMA: a dataset for music analysis","year":"2016","key":"key2023111509325981800_ref004"},{"issue":"12","key":"key2023111509325981800_ref005","doi-asserted-by":"crossref","first-page":"627","DOI":"10.1049\/el.2019.4202","article-title":"Music genre classification and music recommendation by using deep learning","volume":"56","year":"2020","journal-title":"Electronics Letters"},{"key":"key2023111509325981800_ref006","first-page":"1","article-title":"SUBiNN: a stacked uni- and bivariate kNN sparse ensemble","year":"2021","journal-title":"Advances in Data Analysis and Classification"},{"issue":"2","key":"key2023111509325981800_ref007","first-page":"247","article-title":"Cancer data classification by quantum-inspired immune clone optimization-based optimal feature selection using gene expression data: deep learning approach","volume":"56","year":"2021","journal-title":"Data Technologies and Applications"},{"issue":"15","key":"key2023111509325981800_ref008","doi-asserted-by":"crossref","first-page":"11695","DOI":"10.1007\/s00500-019-04631-x","article-title":"Multivector particle swarm optimization algorithm","volume":"24","year":"2020","journal-title":"Soft Computing"},{"key":"key2023111509325981800_ref009","first-page":"1","article-title":"Application of music industry based on the deep neural network","volume":"2022","year":"2022","journal-title":"Scientific Programming"},{"key":"key2023111509325981800_ref010","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.apacoust.2016.10.014","article-title":"On sound signal processing in the image to sound mapping technique","volume":"117","year":"2017","journal-title":"Applied Acoustics"},{"key":"key2023111509325981800_ref011","doi-asserted-by":"crossref","first-page":"109088","DOI":"10.1016\/j.petrol.2021.109088","article-title":"Efficient and robust optimization for good patterns using a PSO algorithm with a CNN-based proxy model","volume":"207","year":"2021","journal-title":"Journal of Petroleum Science and Engineering"},{"key":"key2023111509325981800_ref012","doi-asserted-by":"crossref","first-page":"113507","DOI":"10.1016\/j.eswa.2020.113507","article-title":"Classification of EEG signals produced by musical notes as stimuli","volume":"159","year":"2020","journal-title":"Expert Systems with Applications"},{"issue":"9","key":"key2023111509325981800_ref013","doi-asserted-by":"crossref","first-page":"11563","DOI":"10.1007\/s11042-018-6637-6","article-title":"Regional classification of Chinese folk songs based on CRF model","volume":"78","year":"2019","journal-title":"Multimedia Tools and Applications"},{"key":"key2023111509325981800_ref014","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.patcog.2016.09.013","article-title":"Quantum-behaved discrete multi-objective particle swarm optimization for complex network clustering","volume":"63","year":"2017","journal-title":"Pattern Recognition"},{"issue":"5","key":"key2023111509325981800_ref015","doi-asserted-by":"crossref","first-page":"7313","DOI":"10.1007\/s11042-020-09643-6","article-title":"Bottom-up broadcast neural network for music genre classification","volume":"80","year":"2021","journal-title":"Multimedia Tools and Applications"},{"key":"key2023111509325981800_ref016","first-page":"108868","article-title":"Bearing performance degradation assessment based on optimized EWT and CNN","volume":"172","year":"2020","journal-title":"Measurement"},{"volume-title":"Speech Communication: Human and Machine","year":"1987","key":"key2023111509325981800_ref017"},{"key":"key2023111509325981800_ref018","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1155\/2022\/8752217","article-title":"Automatic classification method of music genres based on deep belief network and sparse representation","volume":"2022","year":"2022","journal-title":"Journal of Mathematics"},{"journal-title":"Data Technologies and Applications","article-title":"Classification of electrocardiogram signal using an ensemble of deep learning models","year":"2021","key":"key2023111509325981800_ref019"},{"key":"key2023111509325981800_ref020","doi-asserted-by":"crossref","first-page":"107627","DOI":"10.1016\/j.knosys.2021.107627","article-title":"Intrinsic dimension estimation method based on correlation dimension and kNN method","volume":"235","year":"2022","journal-title":"Knowledge-Based Systems"},{"issue":"1","key":"key2023111509325981800_ref021","first-page":"1","article-title":"Deep CNN with hybrid binary local search and particle swarm optimizer for exudates classification from fundus images","volume":"35","year":"2022","journal-title":"Journal of Digital Imaging"},{"issue":"3","key":"key2023111509325981800_ref022","first-page":"655","article-title":"Music style mining and classification by melody","volume":"86","year":"2003","journal-title":"IEICE TRANSACTIONS on Information and Systems"},{"key":"key2023111509325981800_ref023","doi-asserted-by":"crossref","first-page":"139332","DOI":"10.1109\/ACCESS.2020.3011882","article-title":"Recognizing emotions evoked by music using CNN-LSTM networks on EEG signals","volume":"8","year":"2020","journal-title":"IEEE Access"},{"key":"key2023111509325981800_ref024","doi-asserted-by":"crossref","first-page":"106702","DOI":"10.1016\/j.asoc.2020.106702","article-title":"Music auto-tagging using scattering transform and convolutional neural network with self-attention","volume":"96","year":"2020","journal-title":"Applied Soft Computing"},{"issue":"2","key":"key2023111509325981800_ref025","first-page":"4257","article-title":"SVM and KNN based CNN architectures for plant classification","volume":"71","year":"2022","journal-title":"Computers, Materials & Continua"},{"first-page":"000181","article-title":"Style-Specific Turkish Pop Music Composition with CNN and LSTM Network[C]2021","year":"2021","key":"key2023111509325981800_ref026"},{"issue":"3","key":"key2023111509325981800_ref027","article-title":"Combining CNN and broad learning for music classification","volume":"103","year":"2020","journal-title":"IEICE Transactions on Information and Systems"},{"issue":"2","key":"key2023111509325981800_ref028","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1007\/s11629-021-7022-x","article-title":"Impacts of anthropogenic and biophysical factors on ecological land using logistic regression and random forest: a case study in Mentougou District, Beijing, China","volume":"19","year":"2022","journal-title":"Journal of Mountain Science"},{"key":"key2023111509325981800_ref029","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1155\/2022\/2715765","article-title":"Music emotion classification method using improved deep belief network","volume":"2022","year":"2022","journal-title":"Mobile Information Systems"},{"key":"key2023111509325981800_ref030","doi-asserted-by":"crossref","first-page":"720","DOI":"10.1109\/TASLP.2021.3049337","article-title":"On improved training of CNN for acoustic source localisation","volume":"29","year":"2021","journal-title":"IEEE\/ACM Transactions on Audio, Speech, and Language Processing"},{"key":"key2023111509325981800_ref031","doi-asserted-by":"crossref","first-page":"106859","DOI":"10.1016\/j.knosys.2021.106859","article-title":"Development and application of quantum entanglement inspired particle swarm optimization","volume":"219","year":"2021","journal-title":"Knowledge-Based Systems"},{"key":"key2023111509325981800_ref032","first-page":"2","article-title":"Research on the detection of network intrusion prevention with SVM based optimization algorithm","volume":"44","year":"2020","journal-title":"Informatica"},{"issue":"1","key":"key2023111509325981800_ref033","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1080\/09298215.2021.1873393","article-title":"Experiments and detailed error-analysis of automatic square notation transcription of medieval music manuscripts using CNN\/LSTM-networks and a neume dictionary","volume":"50","year":"2021","journal-title":"Journal of New Music Research"},{"key":"key2023111509325981800_ref034","doi-asserted-by":"crossref","first-page":"4953288","DOI":"10.1155\/2021\/4953288","article-title":"Design of the Piano score recommendation image analysis system based on the big data and convolutional neural network","volume":"2021","year":"2021","journal-title":"Computational Intelligence and Neuroscience"},{"key":"key2023111509325981800_ref035","doi-asserted-by":"crossref","first-page":"645","DOI":"10.1016\/j.amc.2013.07.067","article-title":"An improved monkey algorithm with dynamic adaptation","volume":"222","year":"2013","journal-title":"Applied Mathematics and Computation"},{"issue":"2","key":"key2023111509325981800_ref036","first-page":"1","article-title":"Quantum particle swarm optimization algorithm with the truncated mean stabilization strategy","volume":"21","year":"2022","journal-title":"Quantum Information Processing"},{"issue":"3","key":"key2023111509325981800_ref037","first-page":"760","article-title":"Music emotion recognition using convolutional long short term memory deep neural network","volume":"24","year":"2021","journal-title":"Engineering Science and Technology"},{"article-title":"Music mood classification using audio power and audio harmonicity based on MPEG-7 audio features and Support Vector Machine","year":"2018","key":"key2023111509325981800_ref041","doi-asserted-by":"publisher","DOI":"10.1109\/ICSITech.2017.8257088"},{"issue":"5","key":"key2023111509325981800_ref039","doi-asserted-by":"crossref","first-page":"1850016","DOI":"10.1142\/S0218213018500161","article-title":"Classification of Music Mood Using MPEG-7 audio features and SVM with confidence interval","volume":"27","year":"2018","journal-title":"International Journal on Artificial Intelligence Tools"},{"issue":"4","key":"key2023111509325981800_ref040","doi-asserted-by":"crossref","first-page":"558","DOI":"10.1108\/DTA-12-2020-0298","article-title":"A systematic review of machine learning-based missing value imputation techniques","volume":"55","year":"2021","journal-title":"Data Technologies and Applications"}],"container-title":["Data Technologies and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/DTA-07-2022-0267\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/DTA-07-2022-0267\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T23:15:15Z","timestamp":1753398915000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/dta\/article\/57\/5\/719-733\/27225"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,17]]},"references-count":40,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2023,3,17]]},"published-print":{"date-parts":[[2023,11,15]]}},"alternative-id":["10.1108\/DTA-07-2022-0267"],"URL":"https:\/\/doi.org\/10.1108\/dta-07-2022-0267","relation":{},"ISSN":["2514-9288","2514-9288"],"issn-type":[{"type":"print","value":"2514-9288"},{"type":"electronic","value":"2514-9288"}],"subject":[],"published":{"date-parts":[[2023,3,17]]}}}