{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,7]],"date-time":"2026-05-07T12:11:32Z","timestamp":1778155892814,"version":"3.51.4"},"reference-count":15,"publisher":"SAGE Publications","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IDT"],"published-print":{"date-parts":[[2022,1,10]]},"abstract":"<jats:p>Reliable and rapid non-invasive testing has become essential for COVID-19 diagnosis and tracking statistics. Recent studies motivate the use of modern machine learning (ML) and deep learning (DL) tools that utilize features of coughing sounds for COVID-19 diagnosis. In this paper, we describe system designs that we developed for COVID-19 cough detection with the long-term objective of embedding them in a testing device. More specifically, we use log-mel spectrogram features extracted from the coughing audio signal and design a series of customized deep learning algorithms to develop fast and automated diagnosis tools for COVID-19 detection. We first explore the use of a deep neural network with fully connected layers. Additionally, we investigate prospects of efficient implementation by examining the impact on the detection performance by pruning the fully connected neural network based on the Lottery Ticket Hypothesis (LTH) optimization process. In general, pruned neural networks have been shown to provide similar performance gains to that of unpruned networks with reduced computational complexity in a variety of signal processing applications. Finally, we investigate the use of convolutional neural network architectures and in particular the VGG-13 architecture which we tune specifically for this application. Our results show that a unique ensembling of the VGG-13 architecture trained using a combination of binary cross entropy and focal losses with data augmentation significantly outperforms the fully connected networks and other recently proposed baselines on the DiCOVA 2021 COVID-19 cough audio dataset. Our customized VGG-13 model achieves an average validation AUROC of 82.23% and a test AUROC of 78.3% at a sensitivity of 80.49%.<\/jats:p>","DOI":"10.3233\/idt-210206","type":"journal-article","created":{"date-parts":[[2021,12,7]],"date-time":"2021-12-07T14:08:17Z","timestamp":1638886097000},"page":"655-665","source":"Crossref","is-referenced-by-count":10,"title":["COVID-19 detection using cough sound analysis and deep learning algorithms"],"prefix":"10.1177","volume":"15","author":[{"given":"Sunil","family":"Rao","sequence":"first","affiliation":[{"name":"School of ECEE, SenSIP Center, Arizona State University, Tempe, AZ, USA"}]},{"given":"Vivek","family":"Narayanaswamy","sequence":"additional","affiliation":[{"name":"School of ECEE, SenSIP Center, Arizona State University, Tempe, AZ, USA"}]},{"given":"Michael","family":"Esposito","sequence":"additional","affiliation":[{"name":"School of ECEE, SenSIP Center, Arizona State University, Tempe, AZ, USA"}]},{"given":"Jayaraman J.","family":"Thiagarajan","sequence":"additional","affiliation":[{"name":"Lawrence Livermore Nat. Labs, Livermore, CA, USA"}]},{"given":"Andreas","family":"Spanias","sequence":"additional","affiliation":[{"name":"School of ECEE, SenSIP Center, Arizona State University, Tempe, AZ, USA"}]}],"member":"179","reference":[{"key":"10.3233\/IDT-210206_ref2","doi-asserted-by":"crossref","first-page":"100378","DOI":"10.1016\/j.imu.2020.100378","article-title":"AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app","volume":"20","author":"Imran","year":"2020","journal-title":"Informatics in Medicine Unlocked."},{"issue":"1","key":"10.3233\/IDT-210206_ref4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41597-021-00937-4","article-title":"The COUGHVID crowdsourcing dataset, a corpus for the study of large-scale cough analysis algorithms","volume":"8","author":"Orlandic","year":"2021","journal-title":"Scientific Data."},{"key":"10.3233\/IDT-210206_ref6","doi-asserted-by":"crossref","unstructured":"Spanias AS. Speech coding: A tutorial review. Proceedings of the IEEE. 1994 Oct; 82(10): 1541-82.","DOI":"10.1109\/5.326413"},{"key":"10.3233\/IDT-210206_ref7","doi-asserted-by":"crossref","unstructured":"Wichern G, Xue J, Thornburg H, Mechtley B, Spanias A. Segmentation, indexing, and retrieval for environmental and natural sounds. IEEE Transactions on Audio, Speech, and Language Processing. 2010 Feb 8; 18(3): 688-707.","DOI":"10.1109\/TASL.2010.2041384"},{"key":"10.3233\/IDT-210206_ref8","doi-asserted-by":"crossref","unstructured":"Loizou PC, Spanias AS. High-performance alphabet recognition. IEEE Transactions on Speech and Audio Processing. 1996 Nov; 4(6): 430-45.","DOI":"10.1109\/89.544528"},{"issue":"3","key":"10.3233\/IDT-210206_ref11","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1109\/89.759042","article-title":"Cepstrum-based pitch detection using a new statistical V\/UV classification algorithm","volume":"7","author":"Ahmadi","year":"1999","journal-title":"IEEE Transactions on Speech and Audio Processing."},{"key":"10.3233\/IDT-210206_ref13","doi-asserted-by":"crossref","unstructured":"Coppock H, Gaskell A, Tzirakis P, Baird A, Jones L, Schuller B. End-to-end convolutional neural network enables COVID-19 detection from breath and cough audio: A pilot study. BMJ Innovations. 2021 Apr 1; 7(2).","DOI":"10.1136\/bmjinnov-2021-000668"},{"issue":"2","key":"10.3233\/IDT-210206_ref15","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1056\/NEJMc2008646","article-title":"Early detection of Covid-19 through a citywide pandemic surveillance platform","volume":"383","author":"Chu","year":"2020","journal-title":"New England Journal of Medicine."},{"key":"10.3233\/IDT-210206_ref20","doi-asserted-by":"crossref","unstructured":"Berisha V, Wisler A, Hero AO, Spanias A. Empirically estimable classification bounds based on a nonparametric divergence measure. IEEE Transactions on Signal Processing. 2015 Sep 10; 64(3): 580-91.","DOI":"10.1109\/TSP.2015.2477805"},{"key":"10.3233\/IDT-210206_ref23","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1109\/OJEMB.2020.3026928","article-title":"COVID-19 artificial intelligence diagnosis using only cough recordings","volume":"1","author":"Laguarta","year":"2020","journal-title":"IEEE Open Journal of Engineering in Medicine and Biology."},{"key":"10.3233\/IDT-210206_ref24","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1109\/OJEMB.2020.2998051","article-title":"A framework for biomarkers of COVID-19 based on coordination of speech-production subsystems","volume":"1","author":"Quatieri","year":"2020","journal-title":"IEEE Open Journal of Engineering in Medicine and Biology."},{"issue":"1","key":"10.3233\/IDT-210206_ref27","doi-asserted-by":"crossref","first-page":"1","DOI":"10.2200\/S01027ED1V01Y202006PEL013","article-title":"Machine learning for solar array monitoring, optimization, and control","volume":"7","author":"Rao","year":"2020","journal-title":"Synthesis Lectures on Power Electronics."},{"issue":"2","key":"10.3233\/IDT-210206_ref32","doi-asserted-by":"crossref","first-page":"338","DOI":"10.2214\/AJR.20.22961","article-title":"Early clinical and CT manifestations of coronavirus disease 2019 (COVID-19) pneumonia","volume":"215","author":"Han","year":"2020","journal-title":"American Journal of Roentgenology."},{"issue":"10","key":"10.3233\/IDT-210206_ref36","doi-asserted-by":"crossref","first-page":"1202","DOI":"10.1089\/tmj.2020.0114","article-title":"Smartphone-based self-testing of COVID-19 using breathing sounds","volume":"26","author":"Faezipour","year":"2020","journal-title":"Telemedicine and e-Health."},{"issue":"4","key":"10.3233\/IDT-210206_ref37","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1109\/TASSP.1980.1163420","article-title":"Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences","volume":"28","author":"Davis","year":"1980","journal-title":"IEEE Transactions on Acoustics, Speech, and Signal Processing."}],"container-title":["Intelligent Decision Technologies"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/IDT-210206","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:23:01Z","timestamp":1777454581000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/IDT-210206"}},"subtitle":[],"editor":[{"given":"George A.","family":"Tsihrintzis","sequence":"additional","affiliation":[]},{"given":"Maria","family":"Virvou","sequence":"additional","affiliation":[]},{"given":"Ioannis","family":"Hatzilygeroudis","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,1,10]]},"references-count":15,"journal-issue":{"issue":"4"},"URL":"https:\/\/doi.org\/10.3233\/idt-210206","relation":{},"ISSN":["1872-4981","1875-8843"],"issn-type":[{"value":"1872-4981","type":"print"},{"value":"1875-8843","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,1,10]]}}}