{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,13]],"date-time":"2026-01-13T21:15:47Z","timestamp":1768338947404,"version":"3.49.0"},"reference-count":89,"publisher":"Association for Computing Machinery (ACM)","issue":"1","license":[{"start":{"date-parts":[[2021,3,19]],"date-time":"2021-03-19T00:00:00Z","timestamp":1616112000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Interact. Mob. Wearable Ubiquitous Technol."],"published-print":{"date-parts":[[2021,3,19]]},"abstract":"<jats:p>The prevalence of ubiquitous computing enables new opportunities for lung health monitoring and assessment. In the past few years, there have been extensive studies on cough detection using passively sensed audio signals. However, the generalizability of a cough detection model when applied to external datasets, especially in real-world implementation, is questionable and not explored adequately. Beyond detecting coughs, researchers have looked into how cough sounds can be used in assessing lung health. However, due to the challenges in collecting both cough sounds and lung health condition ground truth, previous studies have been hindered by the limited datasets. In this paper, we propose Listen2Cough to address these gaps. We first build an end-to-end deep learning architecture using public cough sound datasets to detect coughs within raw audio recordings. We employ a pre-trained MobileNet and integrate a number of augmentation techniques to improve the generalizability of our model. Without additional fine-tuning, our model is able to achieve an F1 score of 0.948 when tested against a new clean dataset, and 0.884 on another in-the-wild noisy dataset, leading to an advantage of 5.8% and 8.4% on average over the best baseline model, respectively. Then, to mitigate the issue of limited lung health data, we propose to transform the cough detection task to lung health assessment tasks so that the rich cough data can be leveraged. Our hypothesis is that these tasks extract and utilize similar effective representation from cough sounds. We embed the cough detection model into a multi-instance learning framework with the attention mechanism and further tune the model for lung health assessment tasks. Our final model achieves an F1-score of 0.912 on healthy v.s. unhealthy, 0.870 on obstructive v.s. non-obstructive, and 0.813 on COPD v.s. asthma classification, outperforming the baseline by 10.7%, 6.3%, and 3.7%, respectively. Moreover, the weight value in the attention layer can be used to identify important coughs highly correlated with lung health, which can potentially provide interpretability for expert diagnosis in the future.<\/jats:p>","DOI":"10.1145\/3448124","type":"journal-article","created":{"date-parts":[[2021,3,30]],"date-time":"2021-03-30T18:56:41Z","timestamp":1617130601000},"page":"1-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":37,"title":["Listen2Cough"],"prefix":"10.1145","volume":"5","author":[{"given":"Xuhai","family":"Xu","sequence":"first","affiliation":[{"name":"University of Washington, Parkway, Seattle, WA"}]},{"given":"Ebrahim","family":"Nemati","sequence":"additional","affiliation":[{"name":"Digital Health Lab, Samsung Research America, Mountain View, CA"}]},{"given":"Korosh","family":"Vatanparvar","sequence":"additional","affiliation":[{"name":"Digital Health Lab, Samsung Research America, Mountain View, CA"}]},{"given":"Viswam","family":"Nathan","sequence":"additional","affiliation":[{"name":"Digital Health Lab, Samsung Research America, Mountain View, CA"}]},{"given":"Tousif","family":"Ahmed","sequence":"additional","affiliation":[{"name":"Digital Health Lab, Samsung Research America, Mountain View, CA"}]},{"given":"Md Mahbubur","family":"Rahman","sequence":"additional","affiliation":[{"name":"Digital Health Lab, Samsung Research America, Mountain View, CA"}]},{"given":"Daniel","family":"McCaffrey","sequence":"additional","affiliation":[{"name":"Digital Health Lab, Samsung Research America, Mountain View, CA"}]},{"given":"Jilong","family":"Kuang","sequence":"additional","affiliation":[{"name":"Digital Health Lab, Samsung Research America, Mountain View, CA"}]},{"given":"Jun Alex","family":"Gao","sequence":"additional","affiliation":[{"name":"Digital Health Lab, Samsung Research America, Mountain View, CA"}]}],"member":"320","published-online":{"date-parts":[[2021,3,30]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"The Cost of Lung Disease","unstructured":"2014. The Cost of Lung Disease . Lung Health Institute . https:\/\/lunginstitute.com\/blog\/the-cost-of-lung-disease\/ 2014. The Cost of Lung Disease. Lung Health Institute. https:\/\/lunginstitute.com\/blog\/the-cost-of-lung-disease\/"},{"key":"e_1_2_1_2_1","unstructured":"2020. Covid-19 Sounds App. https:\/\/www.covid-19-sounds.org\/en\/  2020. Covid-19 Sounds App. https:\/\/www.covid-19-sounds.org\/en\/"},{"key":"e_1_2_1_3_1","unstructured":"2020. Global Initiative for Chronic Obstructive Lung Disease - Global Initiative for Chronic Obstructive Lung Disease. https:\/\/goldcopd.org\/  2020. Global Initiative for Chronic Obstructive Lung Disease - Global Initiative for Chronic Obstructive Lung Disease. https:\/\/goldcopd.org\/"},{"key":"e_1_2_1_4_1","unstructured":"2020. Lung Health & Diseases. American Lung Association. https:\/\/www.lung.org\/lung-health-diseases  2020. Lung Health & Diseases. American Lung Association. https:\/\/www.lung.org\/lung-health-diseases"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/3381014"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/BioCAS.2015.7348395"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TBCAS.2016.2598794"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.bspc.2015.05.001"},{"key":"e_1_2_1_9_1","volume-title":"Soundnet: Learning sound representations from unlabeled video. In Advances in neural information processing systems. 892--900.","author":"Aytar Yusuf","year":"2016","unstructured":"Yusuf Aytar , Carl Vondrick , and Antonio Torralba . 2016 . Soundnet: Learning sound representations from unlabeled video. In Advances in neural information processing systems. 892--900. Yusuf Aytar, Carl Vondrick, and Antonio Torralba. 2016. Soundnet: Learning sound representations from unlabeled video. In Advances in neural information processing systems. 892--900."},{"key":"e_1_2_1_10_1","volume-title":"Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473","author":"Bahdanau Dzmitry","year":"2014","unstructured":"Dzmitry Bahdanau , Kyunghyun Cho , and Yoshua Bengio . 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 ( 2014 ). Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014)."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICHI.2019.8904554"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2011.194"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/0021-9681(58)90054-7"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1183\/09031936.00057407"},{"key":"e_1_2_1_15_1","volume-title":"Variability of Voluntary Cough Airflow in Healthy Adults and Parkinson's Disease. Dysphagia","author":"Borders James C","year":"2020","unstructured":"James C Borders , Alexandra E Brandimore , and Michelle S Troche . 2020. Variability of Voluntary Cough Airflow in Healthy Adults and Parkinson's Disease. Dysphagia ( 2020 ), 1--7. James C Borders, Alexandra E Brandimore, and Michelle S Troche. 2020. Variability of Voluntary Cough Airflow in Healthy Adults and Parkinson's Disease. Dysphagia (2020), 1--7."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2017.10.009"},{"key":"e_1_2_1_17_1","volume-title":"Feature learning from spectrograms for assessment of personality traits","author":"Carbonneau Marc-Andr\u00e9","year":"2017","unstructured":"Marc-Andr\u00e9 Carbonneau , Eric Granger , Yazid Attabi , and Ghyslain Gagnon . 2017. Feature learning from spectrograms for assessment of personality traits . IEEE Transactions on Affective Computing ( 2017 ). Marc-Andr\u00e9 Carbonneau, Eric Granger, Yazid Attabi, and Ghyslain Gagnon. 2017. Feature learning from spectrograms for assessment of personality traits. IEEE Transactions on Affective Computing (2017)."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/EMBC.2016.7591897"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376444"},{"key":"e_1_2_1_20_1","volume-title":"An attentive survey of attention models. arXiv preprint arXiv:1904.02874","author":"Chaudhari Sneha","year":"2019","unstructured":"Sneha Chaudhari , Gungor Polatkan , Rohan Ramanath , and Varun Mithal . 2019. An attentive survey of attention models. arXiv preprint arXiv:1904.02874 ( 2019 ). Sneha Chaudhari, Gungor Polatkan, Rohan Ramanath, and Varun Mithal. 2019. An attentive survey of attention models. arXiv preprint arXiv:1904.02874 (2019)."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1089\/tmj.2017.0008"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR.2014.268"},{"key":"e_1_2_1_23_1","volume-title":"Jimeng Sun, Joshua Kulas, Andy Schuetz, and Walter Stewart.","author":"Choi Edward","year":"2016","unstructured":"Edward Choi , Mohammad Taha Bahadori , Jimeng Sun, Joshua Kulas, Andy Schuetz, and Walter Stewart. 2016 . Retain : An interpretable predictive model for healthcare using reverse time attention mechanism. In Advances in Neural Information Processing Systems . 3504--3512. Edward Choi, Mohammad Taha Bahadori, Jimeng Sun, Joshua Kulas, Andy Schuetz, and Walter Stewart. 2016. Retain: An interpretable predictive model for healthcare using reverse time attention mechanism. In Advances in Neural Information Processing Systems. 3504--3512."},{"key":"e_1_2_1_24_1","volume-title":"Weakly supervised object localization with multi-fold multiple instance learning","author":"Cinbis Ramazan Gokberk","year":"2016","unstructured":"Ramazan Gokberk Cinbis , Jakob Verbeek , and Cordelia Schmid . 2016. Weakly supervised object localization with multi-fold multiple instance learning . IEEE transactions on pattern analysis and machine intelligence 39, 1 ( 2016 ), 189--203. Ramazan Gokberk Cinbis, Jakob Verbeek, and Cordelia Schmid. 2016. Weakly supervised object localization with multi-fold multiple instance learning. IEEE transactions on pattern analysis and machine intelligence 39, 1 (2016), 189--203."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7952190"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_2_1_27_1","volume-title":"Solving the multiple instance problem with axis-parallel rectangles. Artificial intelligence 89, 1-2","author":"Dietterich Thomas G","year":"1997","unstructured":"Thomas G Dietterich , Richard H Lathrop , and Tom\u00e1s Lozano-P\u00e9rez . 1997. Solving the multiple instance problem with axis-parallel rectangles. Artificial intelligence 89, 1-2 ( 1997 ), 31--71. Thomas G Dietterich, Richard H Lathrop, and Tom\u00e1s Lozano-P\u00e9rez. 1997. Solving the multiple instance problem with axis-parallel rectangles. Artificial intelligence 89, 1-2 (1997), 31--71."},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP40776.2020.9054725"},{"key":"e_1_2_1_29_1","volume-title":"Xavier Favory, Jordi Pons, and Xavier Serra.","author":"Fonseca Eduardo","year":"2018","unstructured":"Eduardo Fonseca , Manoj Plakal , Frederic Font , Daniel PW Ellis , Xavier Favory, Jordi Pons, and Xavier Serra. 2018 . General-purpose tagging of freesound audio with audioset labels: Task de scription, dataset, and baseline. arXiv preprint arXiv:1807.09902 (2018). Eduardo Fonseca, Manoj Plakal, Frederic Font, Daniel PW Ellis, Xavier Favory, Jordi Pons, and Xavier Serra. 2018. General-purpose tagging of freesound audio with audioset labels: Task description, dataset, and baseline. arXiv preprint arXiv:1807.09902 (2018)."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10878-017-0236-8"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7952261"},{"key":"e_1_2_1_32_1","volume-title":"Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems","author":"G\u00e9ron Aur\u00e9lien","unstructured":"Aur\u00e9lien G\u00e9ron . 2019. Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems . O'Reilly Media . Aur\u00e9lien G\u00e9ron. 2019. Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O'Reilly Media."},{"key":"e_1_2_1_33_1","volume-title":"Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS). 315--323","author":"Glorot Xavier","year":"2011","unstructured":"Xavier Glorot , Antoine Bordes , and Yoshua Bengio . 2011 . Deep Sparse Rectifier Neural Networks . In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS). 315--323 . Xavier Glorot, Antoine Bordes, and Yoshua Bengio. 2011. Deep Sparse Rectifier Neural Networks. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS). 315--323."},{"key":"e_1_2_1_34_1","volume-title":"Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM","author":"Goel Mayank","unstructured":"Mayank Goel , Elliot Saba , Maia Stiber , Eric Whitmire , Josh Fromm , Eric C. Larson , Gaetano Borriello , and Shwetak N. Patel . 2016. Spirocall: Measuring lung function over a phone call . In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM , New York, NY, USA, 5675--5685. https:\/\/doi.org\/10.1145\/2858036.2858401 10.1145\/2858036.2858401 Mayank Goel, Elliot Saba, Maia Stiber, Eric Whitmire, Josh Fromm, Eric C. Larson, Gaetano Borriello, and Shwetak N. Patel. 2016. Spirocall: Measuring lung function over a phone call. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, New York, NY, USA, 5675--5685. https:\/\/doi.org\/10.1145\/2858036.2858401"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2064942.2064944"},{"key":"e_1_2_1_36_1","volume-title":"Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149","author":"Han Song","year":"2015","unstructured":"Song Han , Huizi Mao , and William J Dally . 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 ( 2015 ). Song Han, Huizi Mao, and William J Dally. 2015. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 (2015)."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00140"},{"key":"e_1_2_1_39_1","volume-title":"Aclnet: efficient end-to-end audio classification cnn. arXiv preprint arXiv:1811.06669","author":"Huang Jonathan J","year":"2018","unstructured":"Jonathan J Huang and Juan Jose Alvarado Leanos . 2018. Aclnet: efficient end-to-end audio classification cnn. arXiv preprint arXiv:1811.06669 ( 2018 ). Jonathan J Huang and Juan Jose Alvarado Leanos. 2018. Aclnet: efficient end-to-end audio classification cnn. arXiv preprint arXiv:1811.06669 (2018)."},{"key":"e_1_2_1_40_1","volume-title":"It sounds like you have a cold! Testing voice features for the Interspeech 2017 Computational Paralinguistics Cold Challenge","author":"Huckvale MA","unstructured":"MA Huckvale and Andr\u00e1s Beke . 2017. It sounds like you have a cold! Testing voice features for the Interspeech 2017 Computational Paralinguistics Cold Challenge . International Speech Communication Association (ISCA) . MA Huckvale and Andr\u00e1s Beke. 2017. It sounds like you have a cold! Testing voice features for the Interspeech 2017 Computational Paralinguistics Cold Challenge. International Speech Communication Association (ISCA)."},{"key":"e_1_2_1_41_1","volume-title":"Attention-based deep multiple instance learning. arXiv preprint arXiv:1802.04712","author":"Ilse Maximilian","year":"2018","unstructured":"Maximilian Ilse , Jakub M Tomczak , and Max Welling . 2018. Attention-based deep multiple instance learning. arXiv preprint arXiv:1802.04712 ( 2018 ). Maximilian Ilse, Jakub M Tomczak, and Max Welling. 2018. Attention-based deep multiple instance learning. arXiv preprint arXiv:1802.04712 (2018)."},{"key":"e_1_2_1_42_1","volume-title":"AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app. arXiv preprint arXiv:2004.01275","author":"Imran Ali","year":"2020","unstructured":"Ali Imran , Iryna Posokhova , Haneya N Qureshi , Usama Masood , Sajid Riaz , Kamran Ali , Charles N John , and Muhammad Nabeel . 2020. AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app. arXiv preprint arXiv:2004.01275 ( 2020 ). Ali Imran, Iryna Posokhova, Haneya N Qureshi, Usama Masood, Sajid Riaz, Kamran Ali, Charles N John, and Muhammad Nabeel. 2020. AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app. arXiv preprint arXiv:2004.01275 (2020)."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDSP.2009.5201259"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1089\/tmj.2014.0025"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/JBHI.2015.2427511"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00277"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/3065386"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/OJEMB.2020.3026928"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3242587.3242609"},{"key":"e_1_2_1_50_1","volume-title":"Proceedings of the 2012 ACM Conference on Ubiquitous Computing - UbiComp '12","author":"Larson Eric C.","unstructured":"Eric C. Larson , Mayank Goel , Gaetano Boriello , Sonya Heltshe , Margaret Rosenfeld , and Shwetak N. Patel . 2012. SpiroSmart: using a microphone to measure lung function on a mobile phone . In Proceedings of the 2012 ACM Conference on Ubiquitous Computing - UbiComp '12 . ACM Press, New York, New York, USA, 280. https:\/\/doi.org\/10.1145\/2370216.2370261 10.1145\/2370216.2370261 Eric C. Larson, Mayank Goel, Gaetano Boriello, Sonya Heltshe, Margaret Rosenfeld, and Shwetak N. Patel. 2012. SpiroSmart: using a microphone to measure lung function on a mobile phone. In Proceedings of the 2012 ACM Conference on Ubiquitous Computing - UbiComp '12. ACM Press, New York, New York, USA, 280. https:\/\/doi.org\/10.1145\/2370216.2370261"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/CISP.2013.6743882"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/3363574"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/ChinaSIP.2013.6625319"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/BIBM.2014.6999220"},{"key":"e_1_2_1_55_1","volume-title":"Audiogmenter: a MATLAB Toolbox for Audio Data Augmentation. arXiv preprint arXiv:1912.05472","author":"Maguolo Gianluca","year":"2019","unstructured":"Gianluca Maguolo , Michelangelo Paci , Loris Nanni , and Ludovico Bonan . 2019. Audiogmenter: a MATLAB Toolbox for Audio Data Augmentation. arXiv preprint arXiv:1912.05472 ( 2019 ). Gianluca Maguolo, Michelangelo Paci, Loris Nanni, and Ludovico Bonan. 2019. Audiogmenter: a MATLAB Toolbox for Audio Data Augmentation. arXiv preprint arXiv:1912.05472 (2019)."},{"key":"e_1_2_1_56_1","volume-title":"Obstructive and restrictive lung disease and markers of inflammation: data from the Third National Health and Nutrition Examination. The American journal of medicine 114, 9","author":"Mannino David M","year":"2003","unstructured":"David M Mannino , Earl S Ford , and Stephen C Redd . 2003. Obstructive and restrictive lung disease and markers of inflammation: data from the Third National Health and Nutrition Examination. The American journal of medicine 114, 9 ( 2003 ), 758--762. David M Mannino, Earl S Ford, and Stephen C Redd. 2003. Obstructive and restrictive lung disease and markers of inflammation: data from the Third National Health and Nutrition Examination. The American journal of medicine 114, 9 (2003), 758--762."},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/TBME.2006.873548"},{"key":"e_1_2_1_58_1","volume-title":"HLH Across Speciality Collaboration, et al","author":"Mehta Puja","year":"2020","unstructured":"Puja Mehta , Daniel F McAuley , Michael Brown , Emilie Sanchez , Rachel S Tattersall , Jessica J Manson , HLH Across Speciality Collaboration, et al . 2020 . COVID-19: consider cytokine storm syndromes and immunosuppression. Lancet (London, England) 395, 10229 (2020), 1033. Puja Mehta, Daniel F McAuley, Michael Brown, Emilie Sanchez, Rachel S Tattersall, Jessica J Manson, HLH Across Speciality Collaboration, et al. 2020. COVID-19: consider cytokine storm syndromes and immunosuppression. Lancet (London, England) 395, 10229 (2020), 1033."},{"key":"e_1_2_1_59_1","volume-title":"Robust detection of audio-cough events using local Hu moments","author":"Monge-\u00c1lvarez Jes\u00fas","year":"2018","unstructured":"Jes\u00fas Monge-\u00c1lvarez , Carlos Hoyos-Barcel\u00f3 , Paul Lesso , and Pablo Casaseca-de- la Higuera . 2018. Robust detection of audio-cough events using local Hu moments . IEEE journal of biomedical and health informatics 23, 1 ( 2018 ), 184--196. Jes\u00fas Monge-\u00c1lvarez, Carlos Hoyos-Barcel\u00f3, Paul Lesso, and Pablo Casaseca-de-la Higuera. 2018. Robust detection of audio-cough events using local Hu moments. IEEE journal of biomedical and health informatics 23, 1 (2018), 184--196."},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/BSN.2019.8771043"},{"key":"e_1_2_1_61_1","volume-title":"Estimation of the Lung Function Using Acoustic Features of the Voluntary Cough. 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, EMBC 2020.","author":"Nemati Ebrahim","year":"2020","unstructured":"Ebrahim Nemati , Md Juber Rahman , Korosh Vatanparvar , Viswam Nathan , and Jilong Kuang . 2020 . Estimation of the Lung Function Using Acoustic Features of the Voluntary Cough. 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, EMBC 2020. Ebrahim Nemati, Md Juber Rahman, Korosh Vatanparvar, Viswam Nathan, and Jilong Kuang. 2020. Estimation of the Lung Function Using Acoustic Features of the Voluntary Cough. 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, EMBC 2020."},{"key":"e_1_2_1_62_1","volume-title":"EAI International Conference on Body Area Networks. Springer, 221--232","author":"Nemati Ebrahim","year":"2018","unstructured":"Ebrahim Nemati , Md Mahbubur Rahman , Viswam Nathan , and Jilong Kuang . 2018 . Private audio-based cough sensing for in-home pulmonary assessment using mobile devices . In EAI International Conference on Body Area Networks. Springer, 221--232 . Ebrahim Nemati, Md Mahbubur Rahman, Viswam Nathan, and Jilong Kuang. 2018. Private audio-based cough sensing for in-home pulmonary assessment using mobile devices. In EAI International Conference on Body Area Networks. Springer, 221--232."},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.5087827"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1052"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/2733373.2806390"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1378\/chest.07-0141"},{"key":"e_1_2_1_67_1","doi-asserted-by":"crossref","first-page":"e0162128","DOI":"10.1371\/journal.pone.0162128","article-title":"A cough-based algorithm for automatic diagnosis of pertussis","volume":"11","author":"Adhi Pramono Renard Xaviero","year":"2016","unstructured":"Renard Xaviero Adhi Pramono , Syed Anas Imtiaz , and Esther Rodriguez-Villegas . 2016 . A cough-based algorithm for automatic diagnosis of pertussis . PloS one 11 , 9 (2016), e0162128 . Renard Xaviero Adhi Pramono, Syed Anas Imtiaz, and Esther Rodriguez-Villegas. 2016. A cough-based algorithm for automatic diagnosis of pertussis. PloS one 11, 9 (2016), e0162128.","journal-title":"PloS one"},{"key":"e_1_2_1_68_1","first-page":"396","article-title":"Transmission and Quality Aspects (STQ); Speech Quality Performance in the Presence of Background Noise; Part 1: Background Noise Simulation Technique and Background Noise Database","volume":"202","author":"Processing Speech","year":"2008","unstructured":"Speech Processing . 2008 . Transmission and Quality Aspects (STQ); Speech Quality Performance in the Presence of Background Noise; Part 1: Background Noise Simulation Technique and Background Noise Database . ETSI EG 202 (2008), 396 -- 391 . Speech Processing. 2008. Transmission and Quality Aspects (STQ); Speech Quality Performance in the Presence of Background Noise; Part 1: Background Noise Simulation Technique and Background Noise Database. ETSI EG 202 (2008), 396--1.","journal-title":"ETSI EG"},{"key":"e_1_2_1_69_1","volume-title":"A multiple-instance learning framework for diabetic retinopathy screening. Medical image analysis 16, 6","author":"Quellec Gw\u00e9nol\u00e9","year":"2012","unstructured":"Gw\u00e9nol\u00e9 Quellec , Mathieu Lamard , Michael D Abr\u00e0moff , Etienne Decenci\u00e8re , Bruno Lay , Ali Erginay , B\u00e9atrice Cochener , and Guy Cazuguel . 2012. A multiple-instance learning framework for diabetic retinopathy screening. Medical image analysis 16, 6 ( 2012 ), 1228--1240. Gw\u00e9nol\u00e9 Quellec, Mathieu Lamard, Michael D Abr\u00e0moff, Etienne Decenci\u00e8re, Bruno Lay, Ali Erginay, B\u00e9atrice Cochener, and Guy Cazuguel. 2012. A multiple-instance learning framework for diabetic retinopathy screening. Medical image analysis 16, 6 (2012), 1228--1240."},{"key":"e_1_2_1_70_1","volume-title":"Feed-forward networks with attention can solve some long-term memory problems. arXiv preprint arXiv:1512.08756","author":"Raffel Colin","year":"2015","unstructured":"Colin Raffel and Daniel PW Ellis . 2015. Feed-forward networks with attention can solve some long-term memory problems. arXiv preprint arXiv:1512.08756 ( 2015 ). Colin Raffel and Daniel PW Ellis. 2015. Feed-forward networks with attention can solve some long-term memory problems. arXiv preprint arXiv:1512.08756 (2015)."},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.smhl.2019.100081"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1109\/PerCom45495.2020.9127355"},{"key":"e_1_2_1_73_1","volume-title":"Twenty-Fourth International Joint Conference on Artificial Intelligence.","author":"Ruiz-Mu\u00f1oz Jose F","year":"2015","unstructured":"Jose F Ruiz-Mu\u00f1oz , Mauricio Orozco Alzate , and Germ\u00e1n Castellanos-Dom\u00ednguez . 2015 . Multiple instance learning-based birdsong classification using unsupervised recording segmentation . In Twenty-Fourth International Joint Conference on Artificial Intelligence. Jose F Ruiz-Mu\u00f1oz, Mauricio Orozco Alzate, and Germ\u00e1n Castellanos-Dom\u00ednguez. 2015. Multiple instance learning-based birdsong classification using unsupervised recording segmentation. In Twenty-Fourth International Joint Conference on Artificial Intelligence."},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1109\/PerCom45495.2020.9127380"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00474"},{"key":"e_1_2_1_76_1","volume-title":"Predicting spirometry readings using cough sound features and regression. Physiological measurement 39, 9","author":"Sharan Roneel V","year":"2018","unstructured":"Roneel V Sharan , Udantha R Abeyratne , Vinayak R Swarnkar , Scott Claxton , Craig Hukins , and Paul Porter . 2018. Predicting spirometry readings using cough sound features and regression. Physiological measurement 39, 9 ( 2018 ), 095001. Roneel V Sharan, Udantha R Abeyratne, Vinayak R Swarnkar, Scott Claxton, Craig Hukins, and Paul Porter. 2018. Predicting spirometry readings using cough sound features and regression. Physiological measurement 39, 9 (2018), 095001."},{"key":"e_1_2_1_77_1","volume-title":"Prasanta Kumar Ghosh, Sriram Ganapathy, et al.","author":"Sharma Neeraj","year":"2020","unstructured":"Neeraj Sharma , Prashant Krishnan , Rohit Kumar , Shreyas Ramoji , Srikanth Raj Chetupalli , Prasanta Kumar Ghosh, Sriram Ganapathy, et al. 2020 . Coswara-A Database of Breathing, Cough , and Voice Sounds for COVID-19 Diagnosis . arXiv preprint arXiv:2005.10548 (2020). Neeraj Sharma, Prashant Krishnan, Rohit Kumar, Shreyas Ramoji, Srikanth Raj Chetupalli, Prasanta Kumar Ghosh, Sriram Ganapathy, et al. 2020. Coswara-A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis. arXiv preprint arXiv:2005.10548 (2020)."},{"key":"e_1_2_1_78_1","volume-title":"Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman . 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 ( 2014 ). Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.2147\/copd.2006.1.3.305"},{"key":"e_1_2_1_80_1","volume-title":"inception-resnet and the impact of residual connections on learning. arXiv preprint arXiv:1602.07261","author":"Szegedy Christian","year":"2016","unstructured":"Christian Szegedy , Sergey Ioffe , Vincent Vanhoucke , and Alex Alemi . 2016. Inception-v4 , inception-resnet and the impact of residual connections on learning. arXiv preprint arXiv:1602.07261 ( 2016 ). Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, and Alex Alemi. 2016. Inception-v4, inception-resnet and the impact of residual connections on learning. arXiv preprint arXiv:1602.07261 (2016)."},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.4799597"},{"key":"e_1_2_1_82_1","volume-title":"Alzheimer's Disease Neuroimaging Initiative, et al","author":"Tong Tong","year":"2014","unstructured":"Tong Tong , Robin Wolz , Qinquan Gao , Ricardo Guerrero , Joseph V Hajnal , Daniel Rueckert , Alzheimer's Disease Neuroimaging Initiative, et al . 2014 . Multiple instance learning for classification of dementia in brain MRI. Medical image analysis 18, 5 (2014), 808--818. Tong Tong, Robin Wolz, Qinquan Gao, Ricardo Guerrero, Joseph V Hajnal, Daniel Rueckert, Alzheimer's Disease Neuroimaging Initiative, et al. 2014. Multiple instance learning for classification of dementia in brain MRI. Medical image analysis 18, 5 (2014), 808--818."},{"key":"e_1_2_1_83_1","volume-title":"Gonzalez","author":"Wan Alvin","year":"2020","unstructured":"Alvin Wan , Xiaoliang Dai , Peizhao Zhang , Zijian He , Yuandong Tian , Saining Xie , Bichen Wu , Matthew Yu , Tao Xu , Kan Chen , Peter Vajda , and Joseph E . Gonzalez . 2020 . FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions. In arXiv. arXiv:2004.05565 Alvin Wan, Xiaoliang Dai, Peizhao Zhang, Zijian He, Yuandong Tian, Saining Xie, Bichen Wu, Matthew Yu, Tao Xu, Kan Chen, Peter Vajda, and Joseph E. Gonzalez. 2020. FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions. In arXiv. arXiv:2004.05565"},{"key":"e_1_2_1_84_1","volume-title":"Weakly supervised histopathology cancer image segmentation and classification. Medical image analysis 18, 3","author":"Xu Yan","year":"2014","unstructured":"Yan Xu , Jun-Yan Zhu , I Eric , Chao Chang , Maode Lai , and Zhuowen Tu. 2014. Weakly supervised histopathology cancer image segmentation and classification. Medical image analysis 18, 3 ( 2014 ), 591--604. Yan Xu, Jun-Yan Zhu, I Eric, Chao Chang, Maode Lai, and Zhuowen Tu. 2014. Weakly supervised histopathology cancer image segmentation and classification. Medical image analysis 18, 3 (2014), 591--604."},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1378\/chest.11-2728"},{"key":"e_1_2_1_86_1","volume-title":"mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412","author":"Zhang Hongyi","year":"2017","unstructured":"Hongyi Zhang , Moustapha Cisse , Yann N Dauphin , and David Lopez-Paz . 2017. mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 ( 2017 ). Hongyi Zhang, Moustapha Cisse, Yann N Dauphin, and David Lopez-Paz. 2017. mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 (2017)."},{"key":"e_1_2_1_88_1","volume-title":"Multi-instance learning based web mining. Applied intelligence 22, 2","author":"Zhou Zhi-Hua","year":"2005","unstructured":"Zhi-Hua Zhou , Kai Jiang , and Ming Li. 2005. Multi-instance learning based web mining. Applied intelligence 22, 2 ( 2005 ), 135--147. Zhi-Hua Zhou, Kai Jiang, and Ming Li. 2005. Multi-instance learning based web mining. Applied intelligence 22, 2 (2005), 135--147."},{"key":"e_1_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.1109\/BMEI.2013.6746943"},{"key":"e_1_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.1109\/CCWC.2017.7868345"}],"container-title":["Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3448124","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3448124","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:59Z","timestamp":1750195499000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3448124"}},"subtitle":["Leveraging End-to-End Deep Learning Cough Detection Model to Enhance Lung Health Assessment Using Passively Sensed Audio"],"short-title":[],"issued":{"date-parts":[[2021,3,19]]},"references-count":89,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,3,19]]}},"alternative-id":["10.1145\/3448124"],"URL":"https:\/\/doi.org\/10.1145\/3448124","relation":{},"ISSN":["2474-9567"],"issn-type":[{"value":"2474-9567","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,3,19]]},"assertion":[{"value":"2021-03-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}