{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T03:00:26Z","timestamp":1775012426616,"version":"3.50.1"},"reference-count":121,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2024,1,21]],"date-time":"2024-01-21T00:00:00Z","timestamp":1705795200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"MCIN\/AEI\/10.13039\/501100011033","award":["PID2020-119082RB-{C21,C22}"],"award-info":[{"award-number":["PID2020-119082RB-{C21,C22}"]}]},{"name":"MCIN\/AEI\/10.13039\/501100011033","award":["P18-RT-1994"],"award-info":[{"award-number":["P18-RT-1994"]}]},{"name":"Ministry of Economy, Knowledge and University, Junta de Andaluc\u00eda, Spain","award":["PID2020-119082RB-{C21,C22}"],"award-info":[{"award-number":["PID2020-119082RB-{C21,C22}"]}]},{"name":"Ministry of Economy, Knowledge and University, Junta de Andaluc\u00eda, Spain","award":["P18-RT-1994"],"award-info":[{"award-number":["P18-RT-1994"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system\u2019s condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input\u2013classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.<\/jats:p>","DOI":"10.3390\/s24020682","type":"journal-article","created":{"date-parts":[[2024,1,22]],"date-time":"2024-01-22T12:01:13Z","timestamp":1705924873000},"page":"682","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers"],"prefix":"10.3390","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1968-082X","authenticated-orcid":false,"given":"Loredana Daria","family":"Mang","sequence":"first","affiliation":[{"name":"Department of Telecommunication Engineering, University of Jaen, 23700 Linares, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-0667-8513","authenticated-orcid":false,"given":"Francisco David","family":"Gonz\u00e1lez Mart\u00ednez","sequence":"additional","affiliation":[{"name":"Department of Telecommunication Engineering, University of Jaen, 23700 Linares, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0953-5947","authenticated-orcid":false,"given":"Damian","family":"Martinez Mu\u00f1oz","sequence":"additional","affiliation":[{"name":"Department of Telecommunication Engineering, University of Jaen, 23700 Linares, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3300-5794","authenticated-orcid":false,"given":"Sebasti\u00e1n","family":"Garc\u00eda Gal\u00e1n","sequence":"additional","affiliation":[{"name":"Department of Telecommunication Engineering, University of Jaen, 23700 Linares, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Raquel","family":"Cortina","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Oviedo, 33003 Oviedo, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,1,21]]},"reference":[{"key":"ref_1","unstructured":"(2023, November 26). Respiratory Diseases in the World, Realities of Today\u2014Opportunities for Tomorrow, Forum of International Respiratory Societies (FIRS). Available online: https:\/\/www.thoracic.org\/about\/global-public-health\/firs\/resources\/firs-report-for-web.pdf."},{"key":"ref_2","unstructured":"World Health Organization (2023, November 26). Chronic Obstructive Pulmonary Disease (COPD). Available online: https:\/\/www.who.int\/news-room\/fact-sheets\/detail\/chronic-obstructive-pulmonary-disease-(copd)."},{"key":"ref_3","unstructured":"World Health Organization (2023, November 26). Asthma. Available online: https:\/\/www.who.int\/news-room\/fact-sheets\/detail\/asthma."},{"key":"ref_4","unstructured":"World Health Organization (2023, November 26). Pneumonia. Available online: https:\/\/www.who.int\/health-topics\/pneumonia#tab=tab_1."},{"key":"ref_5","unstructured":"World Health Organization (2023, November 26). Tuberculosis. Available online: https:\/\/www.who.int\/news-room\/fact-sheets\/detail\/tuberculosis."},{"key":"ref_6","unstructured":"World Health Organization (2023, November 26). Cancer. Available online: https:\/\/www.who.int\/news-room\/fact-sheets\/detail\/cancer."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"126436","DOI":"10.1016\/j.neucom.2023.126436","article-title":"Ensemble deep learning in speech signal tasks: A review","volume":"550","author":"Tanveer","year":"2023","journal-title":"Neurocomputing"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"109046","DOI":"10.1016\/j.patcog.2022.109046","article-title":"Deep learning for image inpainting: A survey","volume":"134","author":"Xiang","year":"2023","journal-title":"Pattern Recognit."},{"key":"ref_9","first-page":"1","article-title":"Machine learning techniques for the diagnosis of Alzheimer\u2019s disease: A review","volume":"16","author":"Tanveer","year":"2020","journal-title":"ACM Trans. Multimed. Comput. Commun. Appl. (TOMM)"},{"key":"ref_10","first-page":"597","article-title":"Definition of terms for applications of respiratory sounds","volume":"10","author":"Sovijarvi","year":"2000","journal-title":"Eur. Respir. Rev."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"905","DOI":"10.1164\/ajrccm.162.3.9905104","article-title":"The relationship between normal lung sounds, age, and gender","volume":"162","author":"Gross","year":"2000","journal-title":"Am. J. Respir. Crit. Care Med."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"974","DOI":"10.1164\/ajrccm.156.3.9701115","article-title":"Respiratory sounds: Advances beyond the stethoscope","volume":"156","author":"Pasterkamp","year":"1997","journal-title":"Am. J. Respir. Crit. Care Med."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Ulukaya, S., Sen, I., and Kahya, Y.P. (2015, January 25\u201329). Feature extraction using time-frequency analysis for monophonic-polyphonic wheeze discrimination. Proceedings of the 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Milan, Italy.","DOI":"10.1109\/EMBC.2015.7319615"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Pramono, R.X.A., Bowyer, S., and Rodriguez-Villegas, E. (2017). Automatic adventitious respiratory sound analysis: A systematic review. PLoS ONE, 12.","DOI":"10.1371\/journal.pone.0177926"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"S489","DOI":"10.3233\/THC-150986","article-title":"The detection of crackles based on mathematical morphology in spectrogram analysis","volume":"23","author":"Zhang","year":"2015","journal-title":"Technol. Health Care"},{"key":"ref_16","unstructured":"Wisniewski, M., and Zielinski, T.P. (2012, January 11\u201313). Tonality detection methods for wheezes recognition system. Proceedings of the 2012 19th International Conference on Systems, Signals and Image Processing (IWSSIP), Vienna, Austria."},{"key":"ref_17","first-page":"1009","article-title":"Joint application of audio spectral envelope and tonality index in an e-asthma monitoring system","volume":"19","year":"2014","journal-title":"IEEE J. Biomed. Health Inform."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1016\/j.compbiomed.2009.06.011","article-title":"Pattern recognition methods applied to respiratory sounds classification into normal and wheeze classes","volume":"39","author":"Bahoura","year":"2009","journal-title":"Comput. Biol. Med."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Aras, S., and Gangal, A. (2017, January 5\u20137). Comparison of different features derived from mel frequency cepstrum coefficients for classification of single channel lung sounds. Proceedings of the 2017 40th International Conference on Telecommunications and Signal Processing (TSP), Barcelona, Spain.","DOI":"10.1109\/TSP.2017.8076002"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Okubo, T., Nakamura, N., Yamashita, M., and Matsunaga, S. (2014, January 26\u201330). Classification of healthy subjects and patients with pulmonary emphysema using continuous respiratory sounds. Proceedings of the 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Chicago, IL, USA.","DOI":"10.1109\/EMBC.2014.6943531"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1406","DOI":"10.1109\/JBHI.2017.2781135","article-title":"Asthmatic wheeze detection from compressively sensed respiratory sound spectra","volume":"22","author":"Oletic","year":"2017","journal-title":"IEEE J. Biomed. Health Inform."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Jakovljevi\u0107, N., and Lon\u010dar-Turukalo, T. (2017, January 18\u201321). Hidden markov model based respiratory sound classification. Proceedings of the Precision Medicine Powered by pHealth and Connected Health: ICBHI 2017, Thessaloniki, Greece.","DOI":"10.1007\/978-981-10-7419-6_7"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/j.compbiomed.2018.10.035","article-title":"Characterization and classification of asthmatic wheeze sounds according to severity level using spectral integrated features","volume":"104","author":"Nabi","year":"2019","journal-title":"Comput. Biol. Med."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1050","DOI":"10.1109\/TBME.2005.846717","article-title":"Wavelet-based enhancement of lung and bowel sounds using fractal dimension thresholding-Part II: Application results","volume":"52","author":"Hadjileontiadis","year":"2005","journal-title":"IEEE Trans. Biomed. Eng."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"705","DOI":"10.1016\/j.procs.2015.08.592","article-title":"Automatic crackle detection algorithm based on fractal dimension and box filtering","volume":"64","author":"Pinho","year":"2015","journal-title":"Procedia Comput. Sci."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Pal, R., and Barney, A. (2021). Iterative envelope mean fractal dimension filter for the separation of crackles from normal breath sounds. Biomed. Signal Process. Control, 66.","DOI":"10.1016\/j.bspc.2021.102454"},{"key":"ref_27","unstructured":"Hadjileontiadis, L.J. (2007). Empirical mode decomposition and fractal dimension filter. IEEE Eng. Med. Biol. Mag., 26."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Rocha, B.M., Pessoa, D., Marques, A., de Carvalho, P., and Paiva, R.P. (2023). Automatic wheeze segmentation using harmonic-percussive source separation and empirical mode decomposition. IEEE J. Biomed. Health Inform.","DOI":"10.36227\/techrxiv.19753570"},{"key":"ref_29","first-page":"1309","article-title":"Respiratory sounds classification using Gaussian mixture models","volume":"Volume 3","author":"Bahoura","year":"2004","journal-title":"Proceedings of the Canadian Conference on Electrical and Computer Engineering"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Mayorga, P., Druzgalski, C., Morelos, R., Gonzalez, O., and Vidales, J. (September, January 31). Acoustics based assessment of respiratory diseases using GMM classification. Proceedings of the 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology, Buenos Aires, Argentina.","DOI":"10.1109\/IEMBS.2010.5628092"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Maruf, S.O., Azhar, M.U., Khawaja, S.G., and Akram, M.U. (2015, January 18\u201320). Crackle separation and classification from normal Respiratory sounds using Gaussian Mixture Model. Proceedings of the 2015 IEEE 10th International Conference on Industrial and Information Systems (ICIIS), Peradeniya, Sri Lanka.","DOI":"10.1109\/ICIINFS.2015.7399022"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"517","DOI":"10.1007\/BF02442324","article-title":"Validated method for automatic detection of lung sound crackles","volume":"29","author":"Kaisia","year":"1991","journal-title":"Med. Biol. Eng. Comput."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"1073","DOI":"10.1016\/j.compbiomed.2006.09.007","article-title":"Wheeze detection based on time-frequency analysis of breath sounds","volume":"37","author":"Taplidou","year":"2007","journal-title":"Comput. Biol. Med."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Jain, A., and Vepa, J. (2008, January 20\u201325). Lung sound analysis for wheeze episode detection. Proceedings of the 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Vancouver, BC, Canada.","DOI":"10.1109\/IEMBS.2008.4649728"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"3078","DOI":"10.1109\/TBME.2011.2160721","article-title":"Adventitious sounds identification and extraction using temporal\u2013spectral dominance-based features","volume":"58","author":"Jin","year":"2011","journal-title":"IEEE Trans. Biomed. Eng."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Mendes, L., Vogiatzis, I., Perantoni, E., Kaimakamis, E., Chouvarda, I., Maglaveras, N., Tsara, V., Teixeira, C., Carvalho, P., and Henriques, J. (2015, January 25\u201329). Detection of wheezes using their signature in the spectrogram space and musical features. Proceedings of the 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Milan, Italy.","DOI":"10.1109\/EMBC.2015.7319657"},{"key":"ref_37","first-page":"2217","article-title":"Nonlinear separation of crackles and squawks from vesicular sounds using third-order statistics","volume":"Volume 5","author":"Hadjileontiadis","year":"1996","journal-title":"Proceedings of the 18th Annual International Conference of the IEEE Engineering in Medicine and Biology Society"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Cortes, S., Jane, R., Fiz, J., and Morera, J. (2006, January 17\u201318). Monitoring of wheeze duration during spontaneous respiration in asthmatic patients. Proceedings of the 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference, Shanghai, China.","DOI":"10.1109\/IEMBS.2005.1615896"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1016\/j.compbiomed.2011.04.009","article-title":"Assessment of multichannel lung sounds parameterization for two-class classification in interstitial lung disease patients","volume":"41","author":"Carrillo","year":"2011","journal-title":"Comput. Biol. Med."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Zhang, J., Ser, W., Yu, J., and Zhang, T. (2009, January 15\u201316). A novel wheeze detection method for wearable monitoring systems. Proceedings of the 2009 International Symposium on Intelligent Ubiquitous Computing and Education, Chengdu, China.","DOI":"10.1109\/IUCE.2009.66"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Liu, X., Ser, W., Zhang, J., and Goh, D.Y.T. (2015, January 2\u20134). Detection of adventitious lung sounds using entropy features and a 2-D threshold setting. Proceedings of the 2015 10th International Conference on Information, Communications and Signal Processing (ICICS), Singapore.","DOI":"10.1109\/ICICS.2015.7459851"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Rizal, A., Hidayat, R., and Nugroho, H.A. (2016, January 5\u20136). Pulmonary crackle feature extraction using tsallis entropy for automatic lung sound classification. Proceedings of the 2016 1st International Conference on Biomedical Engineering (IBIOMED), Yogyakarta, Indonesia.","DOI":"10.1109\/IBIOMED.2016.7869823"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1109\/10.649999","article-title":"Separation of discontinuous adventitious sounds from vesicular sounds using a wavelet-based filter","volume":"44","author":"Hadjileontiadis","year":"1997","journal-title":"IEEE Trans. Biomed. Eng."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"244","DOI":"10.1016\/j.bspc.2008.04.003","article-title":"An integrated automated system for crackles extraction and classification","volume":"3","author":"Lu","year":"2008","journal-title":"Biomed. Signal Process. Control"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Le Cam, S., Belghith, A., Collet, C., and Salzenstein, F. (2009, January 19\u201324). Wheezing sounds detection using multivariate generalized Gaussian distributions. Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan.","DOI":"10.1109\/ICASSP.2009.4959640"},{"key":"ref_46","first-page":"127","article-title":"Classification of wheeze sounds using wavelets and neural networks","volume":"Volume 11","author":"Hashemi","year":"2011","journal-title":"Proceedings of the International Conference on Biomedical Engineering and Technology"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1012","DOI":"10.1016\/j.dsp.2012.12.009","article-title":"Pulmonary crackle detection using time\u2013frequency and time\u2013scale analysis","volume":"23","author":"Serbes","year":"2013","journal-title":"Digit. Signal Process."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1016\/j.compbiomed.2018.11.004","article-title":"Wheeze type classification using non-dyadic wavelet transform based optimal energy ratio technique","volume":"104","author":"Ulukaya","year":"2019","journal-title":"Comput. Biol. Med."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Stasiakiewicz, P., Dobrowolski, A.P., Targowski, T., Ga\u0142\u0105zka-\u015awiderek, N., Sadura-Sieklucka, T., Majka, K., Skoczylas, A., Lejkowski, W., and Olszewski, R. (2021). Automatic classification of normal and sick patients with crackles using wavelet packet decomposition and support vector machine. Biomed. Signal Process. Control, 67.","DOI":"10.1016\/j.bspc.2021.102521"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Li, J., and Hong, Y. (2016, January 6\u201310). Crackles detection method based on time-frequency features analysis and SVM. Proceedings of the 2016 IEEE 13th International Conference on Signal Processing (ICSP), Chengdu, China.","DOI":"10.1109\/ICSP.2016.7878059"},{"key":"ref_51","unstructured":"Gr\u00f8nnesby, M., Solis, J.C.A., Holsb\u00f8, E., Melbye, H., and Bongo, L.A. (2017). Feature extraction for machine learning based crackle detection in lung sounds from a health survey. arXiv."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"020014","DOI":"10.1063\/5.0030369","article-title":"Crackle detection in lung sound using statistical feature of variogram","volume":"Volume 2296","author":"Pramudita","year":"2020","journal-title":"AIP Conference Proceedings"},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"1289","DOI":"10.1038\/s41598-023-27399-5","article-title":"A machine learning approach to the development and prospective evaluation of a pediatric lung sound classification model","volume":"13","author":"Park","year":"2023","journal-title":"Sci. Rep."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1007\/s12553-019-00365-w","article-title":"Automated extraction of fine and coarse crackles by independent component analysis","volume":"10","author":"Villalobos","year":"2020","journal-title":"Health Technol."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1016\/j.bspc.2018.07.004","article-title":"A robust audio classification system for detecting pulmonary edema","volume":"46","author":"Hong","year":"2018","journal-title":"Biomed. Signal Process. Control"},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"107188","DOI":"10.1016\/j.apacoust.2019.107188","article-title":"A constrained tonal semi-supervised non-negative matrix factorization to classify presence\/absence of wheezing in respiratory sounds","volume":"161","year":"2020","journal-title":"Appl. Acoust."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"113212","DOI":"10.1016\/j.eswa.2020.113212","article-title":"Combining a recursive approach via non-negative matrix factorization and Gini index sparsity to improve reliable detection of wheezing sounds","volume":"147","author":"Cruz","year":"2020","journal-title":"Expert Syst. Appl."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"108229","DOI":"10.1016\/j.apacoust.2021.108229","article-title":"An incremental algorithm based on multichannel non-negative matrix partial co-factorization for ambient denoising in auscultation","volume":"182","author":"Cruz","year":"2021","journal-title":"Appl. Acoust."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"De La Torre Cruz, J., Ca\u00f1adas Quesada, F.J., Ruiz Reyes, N., Garc\u00eda Gal\u00e1n, S., Carabias Orti, J.J., and Per\u00e9z Chica, G. (2021). Monophonic and polyphonic wheezing classification based on constrained low-rank non-negative matrix factorization. Sensors, 21.","DOI":"10.3390\/s21051661"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"035001","DOI":"10.1088\/1361-6579\/ab03ea","article-title":"An open access database for the evaluation of respiratory sound classification algorithms","volume":"40","author":"Rocha","year":"2019","journal-title":"Physiol. Meas."},{"key":"ref_61","unstructured":"(2024, January 18). ICBHI 2017 Challenge, Respiratory Sound Database. Available online: https:\/\/bhichallenge.med.auth.gr\/ICBHI_2017_Challenge."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Messner, E., Fediuk, M., Swatek, P., Scheidl, S., Smolle-Juttner, F.M., Olschewski, H., and Pernkopf, F. (2018, January 18\u201321). Crackle and breathing phase detection in lung sounds with deep bidirectional gated recurrent neural networks. Proceedings of the 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, USA.","DOI":"10.1109\/EMBC.2018.8512237"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Messner, E., Fediuk, M., Swatek, P., Scheidl, S., Smolle-J\u00fcttner, F.M., Olschewski, H., and Pernkopf, F. (2020). Multi-channel lung sound classification with convolutional recurrent neural networks. Comput. Biol. Med., 122.","DOI":"10.1016\/j.compbiomed.2020.103831"},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"107367","DOI":"10.1016\/j.compeleceng.2021.107367","article-title":"Classification of respiratory sounds using improved convolutional recurrent neural network","volume":"94","author":"Asatani","year":"2021","journal-title":"Comput. Electr. Eng."},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Petmezas, G., Cheimariotis, G.A., Stefanopoulos, L., Rocha, B., Paiva, R.P., Katsaggelos, A.K., and Maglaveras, N. (2022). Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function. Sensors, 22.","DOI":"10.3390\/s22031232"},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Wall, C., Zhang, L., Yu, Y., Kumar, A., and Gao, R. (2022). A deep ensemble neural network with attention mechanisms for lung abnormality classification using audio inputs. Sensors, 22.","DOI":"10.3390\/s22155566"},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"13405","DOI":"10.1007\/s00500-022-07499-6","article-title":"Deep learning models for detecting respiratory pathologies from raw lung auscultation sounds","volume":"26","author":"Alqudah","year":"2022","journal-title":"Soft Comput."},{"key":"ref_68","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1186\/s13640-017-0213-2","article-title":"Classification of lung sounds using convolutional neural networks","volume":"2017","author":"Aykanat","year":"2017","journal-title":"EURASIP J. Image Video Process."},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Kochetov, K., Putin, E., Balashov, M., Filchenkov, A., and Shalyto, A. (2018, January 4\u20137). Noise masking recurrent neural network for respiratory sound classification. Proceedings of the International Conference on Artificial Neural Networks, Rhodes, Greece.","DOI":"10.1007\/978-3-030-01424-7_21"},{"key":"ref_70","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1016\/j.artmed.2018.04.008","article-title":"Lung sounds classification using convolutional neural networks","volume":"88","author":"Bardou","year":"2018","journal-title":"Artif. Intell. Med."},{"key":"ref_71","doi-asserted-by":"crossref","unstructured":"Liu, R., Cai, S., Zhang, K., and Hu, N. (2019, January 21\u201324). Detection of adventitious respiratory sounds based on convolutional neural network. Proceedings of the 2019 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS), Shanghai, China.","DOI":"10.1109\/ICIIBMS46890.2019.8991459"},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Perna, D., and Tagarelli, A. (2019, January 5\u20137). Deep auscultation: Predicting respiratory anomalies and diseases via recurrent neural networks. Proceedings of the 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS), Cordoba, Spain.","DOI":"10.1109\/CBMS.2019.00020"},{"key":"ref_73","doi-asserted-by":"crossref","unstructured":"Minami, K., Lu, H., Kim, H., Mabu, S., Hirano, Y., and Kido, S. (2019, January 15\u201318). Automatic classification of large-scale respiratory sound dataset based on convolutional neural network. Proceedings of the 2019 19th International Conference on Control, Automation and Systems (ICCAS), Jeju, Republic of Korea.","DOI":"10.23919\/ICCAS47443.2019.8971689"},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Ma, Y., Xu, X., Yu, Q., Zhang, Y., Li, Y., Zhao, J., and Wang, G. (2019, January 17\u201319). LungBRN: A smart digital stethoscope for detecting respiratory disease using bi-resnet deep learning algorithm. Proceedings of the 2019 IEEE Biomedical Circuits and Systems Conference (BioCAS), Nara, Japan.","DOI":"10.1109\/BIOCAS.2019.8919021"},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Ngo, D., Pham, L., Nguyen, A., Phan, B., Tran, K., and Nguyen, T. (2021, January 15-16). Deep learning framework applied for predicting anomaly of respiratory sounds. Proceedings of the 2021 International Symposium on Electrical and Electronics Engineering (ISEE), Ho Chi Minh, Vietnam.","DOI":"10.1109\/ISEE51682.2021.9418742"},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Nguyen, T., and Pernkopf, F. (2020, January 20\u201324). Lung sound classification using snapshot ensemble of convolutional neural networks. Proceedings of the 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada.","DOI":"10.1109\/EMBC44109.2020.9176076"},{"key":"ref_77","first-page":"535","article-title":"Deep neural network for respiratory sound classification in wearable devices enabled by patient specific model tuning","volume":"14","author":"Acharya","year":"2020","journal-title":"IEEE Trans. Biomed. Circuits Syst."},{"key":"ref_78","doi-asserted-by":"crossref","first-page":"105376","DOI":"10.1109\/ACCESS.2020.3000111","article-title":"Classification of lung sounds with CNN model using parallel pooling structure","volume":"8","author":"Demir","year":"2020","journal-title":"IEEE Access"},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"Saraiva., A., Santos., D., Francisco., A., Sousa., J., Ferreira., N., Soares., S., and Valente., A. (2020, January 24\u201326). Classification of Respiratory Sounds with Convolutional Neural Network. Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies\u2014BIOINFORMATICS, INSTICC, Valletta, Malta.","DOI":"10.5220\/0008965101380144"},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Ma, Y., Xu, X., and Li, Y. (2020, January 25\u201329). LungRN+ NL: An Improved Adventitious Lung Sound Classification Using Non-Local Block ResNet Neural Network with Mixup Data Augmentation. Proceedings of the Interspeech, Shanghai, China.","DOI":"10.21437\/Interspeech.2020-2487"},{"key":"ref_81","doi-asserted-by":"crossref","unstructured":"Yang, Z., Liu, S., Song, M., Parada-Cabaleiro, E., and Schuller, B.W. (2020, January 25\u201329). Adventitious respiratory classification using attentive residual neural networks. Proceedings of the Interspeech, Shanghai, China.","DOI":"10.21437\/Interspeech.2020-2790"},{"key":"ref_82","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1007\/s12530-020-09339-0","article-title":"Automatic acoustic identification of respiratory diseases","volume":"12","author":"Ntalampiras","year":"2021","journal-title":"Evol. Syst."},{"key":"ref_83","doi-asserted-by":"crossref","unstructured":"Chanane, H., and Bahoura, M. (2021, January 9\u201311). Convolutional neural network-based model for lung sounds classification. Proceedings of the 2021 IEEE International Midwest Symposium on Circuits and Systems (MWSCAS), Lansing, MI, USA.","DOI":"10.1109\/MWSCAS47672.2021.9531887"},{"key":"ref_84","doi-asserted-by":"crossref","first-page":"714811","DOI":"10.3389\/fmed.2021.714811","article-title":"Abnormal respiratory sounds classification using deep CNN through artificial noise addition","volume":"8","author":"Zulfiqar","year":"2021","journal-title":"Front. Med."},{"key":"ref_85","doi-asserted-by":"crossref","first-page":"585578","DOI":"10.3389\/fmed.2021.585578","article-title":"End-to-end AI-based point-of-care diagnosis system for classifying respiratory illnesses and early detection of COVID-19: A theoretical framework","volume":"8","author":"Belkacem","year":"2021","journal-title":"Front. Med."},{"key":"ref_86","doi-asserted-by":"crossref","first-page":"17186","DOI":"10.1038\/s41598-021-96724-7","article-title":"Respiratory sound classification for crackles, wheezes, and rhonchi in the clinical field using deep learning","volume":"11","author":"Kim","year":"2021","journal-title":"Sci. Rep."},{"key":"ref_87","doi-asserted-by":"crossref","unstructured":"Song, W., Han, J., and Song, H. (2021, January 6\u201311). Contrastive embeddind learning method for respiratory sound classification. Proceedings of the ICASSP 2021\u20132021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada.","DOI":"10.1109\/ICASSP39728.2021.9414385"},{"key":"ref_88","doi-asserted-by":"crossref","unstructured":"Gairola, S., Tom, F., Kwatra, N., and Jain, M. (2021, January 1\u20135). Respirenet: A deep neural network for accurately detecting abnormal lung sounds in limited data setting. Proceedings of the 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Guadalajara, Mexico.","DOI":"10.1109\/EMBC46164.2021.9630091"},{"key":"ref_89","doi-asserted-by":"crossref","first-page":"e369","DOI":"10.7717\/peerj-cs.369","article-title":"Deep learning based respiratory sound analysis for detection of chronic obstructive pulmonary disease","volume":"7","author":"Srivastava","year":"2021","journal-title":"PeerJ Comput. Sci."},{"key":"ref_90","doi-asserted-by":"crossref","unstructured":"Tariq, Z., Shah, S.K., and Lee, Y. (2022). Feature-based fusion using CNN for lung and heart sound classification. Sensors, 22.","DOI":"10.3390\/s22041521"},{"key":"ref_91","doi-asserted-by":"crossref","unstructured":"Choi, Y., Choi, H., Lee, H., Lee, S., and Lee, H. (2022). Lightweight Skip Connections with Efficient Feature Stacking for Respiratory Sound Classification. IEEE Access.","DOI":"10.1109\/ACCESS.2022.3174678"},{"key":"ref_92","doi-asserted-by":"crossref","first-page":"2872","DOI":"10.1109\/TBME.2022.3156293","article-title":"Lung Sound Classification Using Co-tuning and Stochastic Normalization","volume":"69","author":"Nguyen","year":"2022","journal-title":"IEEE Trans. Biomed. Eng."},{"key":"ref_93","doi-asserted-by":"crossref","unstructured":"Zhao, Z., Gong, Z., Niu, M., Ma, J., Wang, H., Zhang, Z., and Li, Y. (2022, January 23\u201327). Automatic Respiratory Sound Classification Via Multi-Branch Temporal Convolutional Network. Proceedings of the ICASSP 2022\u20132022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Singapore.","DOI":"10.1109\/ICASSP43922.2022.9746182"},{"key":"ref_94","doi-asserted-by":"crossref","unstructured":"Saldanha, J., Chakraborty, S., Patil, S., Kotecha, K., Kumar, S., and Nayyar, A. (2022). Data augmentation using Variational Autoencoders for improvement of respiratory disease classification. PLoS ONE, 17.","DOI":"10.1371\/journal.pone.0266467"},{"key":"ref_95","doi-asserted-by":"crossref","first-page":"3201","DOI":"10.1007\/s42835-023-01425-y","article-title":"Ensemble Learning Model for Classification of Respiratory Anomalies","volume":"18","author":"Kim","year":"2023","journal-title":"J. Electr. Eng. Technol."},{"key":"ref_96","doi-asserted-by":"crossref","unstructured":"Alice, R.S., Wendling, L., and Santosh, K. (2022, January 1\u20132). 2D Respiratory Sound Analysis to Detect Lung Abnormalities. Proceedings of the Recent Trends in Image Processing and Pattern Recognition: 5th International Conference, RTIP2R 2022, Kingsville, TX, USA. Revised Selected Papers.","DOI":"10.1007\/978-3-031-23599-3_5"},{"key":"ref_97","doi-asserted-by":"crossref","first-page":"327","DOI":"10.12785\/ijcds\/130126","article-title":"Voice Based Pathology Detection from Respiratory Sounds using Optimized Classifiers","volume":"13","author":"Chudasama","year":"2023","journal-title":"Int. J. Comput. Digit. Syst."},{"key":"ref_98","doi-asserted-by":"crossref","unstructured":"Cinyol, F., Baysal, U., K\u00f6ksal, D., Babao\u011flu, E., and Ula\u015fl\u0131, S.S. (2023). Incorporating support vector machine to the classification of respiratory sounds by Convolutional Neural Network. Biomed. Signal Process. Control, 79.","DOI":"10.1016\/j.bspc.2022.104093"},{"key":"ref_99","doi-asserted-by":"crossref","unstructured":"Dianat, B., La Torraca, P., Manfredi, A., Cassone, G., Vacchi, C., Sebastiani, M., and Pancaldi, F. (2023). Classification of pulmonary sounds through deep learning for the diagnosis of interstitial lung diseases secondary to connective tissue diseases. Comput. Biol. Med., 160.","DOI":"10.1016\/j.compbiomed.2023.106928"},{"key":"ref_100","doi-asserted-by":"crossref","first-page":"2595","DOI":"10.1109\/JBHI.2020.3048006","article-title":"A lightweight cnn model for detecting respiratory diseases from lung auscultation sounds using emd-cwt-based hybrid scalogram","volume":"25","author":"Shuvo","year":"2020","journal-title":"IEEE J. Biomed. Health Inform."},{"key":"ref_101","doi-asserted-by":"crossref","unstructured":"Zhang, Q., and Ma, P. (2023). Classification of pulmonary arterial pressure using photoplethysmography and bi-directional LSTM. Biomed. Signal Process. Control, 86.","DOI":"10.1016\/j.bspc.2023.105071"},{"key":"ref_102","doi-asserted-by":"crossref","unstructured":"Rocha, B.M., Pessoa, D., Marques, A., Carvalho, P., and Paiva, R.P. (2020). Automatic classification of adventitious respiratory sounds: A (un) solved problem?. Sensors, 21.","DOI":"10.3390\/s21010057"},{"key":"ref_103","doi-asserted-by":"crossref","unstructured":"Mang, L., Canadas-Quesada, F., Carabias-Orti, J., Combarro, E., and Ranilla, J. (2023). Cochleogram-based adventitious sounds classification using convolutional neural networks. Biomed. Signal Process. Control, 82.","DOI":"10.1016\/j.bspc.2022.104555"},{"key":"ref_104","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020, January 26\u201330). An image is worth 16x16 words: Transformers for image recognition at scale. Proceedings of the International Conference on Learning Representations, Addis Ababa, Ethiopia."},{"key":"ref_105","doi-asserted-by":"crossref","unstructured":"Gemmeke, J.F., Ellis, D.P.W., Freedman, D., Jansen, A., Lawrence, W., Moore, R.C., Plakal, M., and Ritter, M. (2017, January 5\u20139). Audio Set: An ontology and human-labeled dataset for audio events. Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA.","DOI":"10.1109\/ICASSP.2017.7952261"},{"key":"ref_106","unstructured":"Warden, P. (2018). Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition. arXiv."},{"key":"ref_107","doi-asserted-by":"crossref","unstructured":"Neto, J., Arrais, N., Vinuto, T., and Lucena, J. (2022, January 24\u201327). Convolution-Vision Transformer for Automatic Lung Sound Classification. Proceedings of the 2022 35th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), Natal, Brazil.","DOI":"10.1109\/SIBGRAPI55357.2022.9991756"},{"key":"ref_108","doi-asserted-by":"crossref","unstructured":"Das, S., Pal, S., and Mitra, M. (2020). Acoustic feature based unsupervised approach of heart sound event detection. Comput. Biol. Med., 126.","DOI":"10.1016\/j.compbiomed.2020.103990"},{"key":"ref_109","doi-asserted-by":"crossref","first-page":"1171","DOI":"10.1121\/1.4864294","article-title":"Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation","volume":"135","author":"Gao","year":"2014","journal-title":"J. Acoust. Soc. Am."},{"key":"ref_110","doi-asserted-by":"crossref","first-page":"1993","DOI":"10.1109\/TASLP.2014.2359159","article-title":"A feature study for classification-based speech separation at low signal-to-noise ratios","volume":"22","author":"Chen","year":"2014","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_111","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141, and Polosukhin, I. (2017, January 4\u20139). Attention is all you need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_112","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1016\/j.apacoust.2018.12.035","article-title":"A novel wheezing detection approach based on constrained non-negative matrix factorization","volume":"148","year":"2019","journal-title":"Appl. Acoust."},{"key":"ref_113","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"Imagenet classification with deep convolutional neural networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Commun. ACM"},{"key":"ref_114","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","article-title":"Imagenet large scale visual recognition challenge","volume":"115","author":"Russakovsky","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_115","doi-asserted-by":"crossref","first-page":"101809","DOI":"10.1016\/j.artmed.2020.101809","article-title":"Scalogram based prediction model for respiratory disorders using optimized convolutional neural networks","volume":"103","author":"Jayalakshmy","year":"2020","journal-title":"Artif. Intell. Med."},{"key":"ref_116","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_117","doi-asserted-by":"crossref","unstructured":"Zakaria, N., Mohamed, F., Abdelghani, R., and Sundaraj, K. (2021, January 20\u201321). VGG16, ResNet-50, and GoogLeNet Deep Learning Architecture for Breathing Sound Classification: A Comparative Study. Proceedings of the 2021 International Conference on Artificial Intelligence for Cyber Security Systems and Privacy (AI-CSP), El Oued, Algeria.","DOI":"10.1109\/AI-CSP52968.2021.9671124"},{"key":"ref_118","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_119","unstructured":"Wilcoxon, F. (1992). Breakthroughs in Statistics, Springer."},{"key":"ref_120","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1214\/aoms\/1177730491","article-title":"On a test of whether one of two random variables is stochastically larger than the other","volume":"18","author":"Mann","year":"1947","journal-title":"Ann. Math. Stat."},{"key":"ref_121","doi-asserted-by":"crossref","unstructured":"Chambres, G., Hanna, P., and Desainte-Catherine, M. (2018, January 4\u20136). Automatic detection of patient with respiratory diseases using lung sound analysis. Proceedings of the 2018 International Conference on Content-Based Multimedia Indexing (CBMI), La Rochelle, France.","DOI":"10.1109\/CBMI.2018.8516489"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/2\/682\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T13:46:41Z","timestamp":1760104001000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/2\/682"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,21]]},"references-count":121,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2024,1]]}},"alternative-id":["s24020682"],"URL":"https:\/\/doi.org\/10.3390\/s24020682","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,1,21]]}}}