{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T19:16:58Z","timestamp":1774120618455,"version":"3.50.1"},"reference-count":48,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2023,9,27]],"date-time":"2023-09-27T00:00:00Z","timestamp":1695772800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Interact. Mob. Wearable Ubiquitous Technol."],"published-print":{"date-parts":[[2023,9,27]]},"abstract":"<jats:p>The coronavirus disease 2019 (COVID-19) pneumonia still persists and its chief complaint is dry cough. Physicians design wireless stethoscopes to facilitate diagnosis, however, lung sounds could be easily interfered with by external noises. To achieve lung sound enhancement, prior researches mostly assume the amount of clean and noisy data are the same. This assumption is hardly met due to extensive labor effort for data collection and annotation. The data imbalance across domains widely happens in real-world IoT systems, e.g. sound enhancement and WiFi-based human sensing. In this paper, we propose SIDA, a self-supervised imbalanced domain adaptation framework for sound enhancement and WiFi sensing, which makes it a generic time series domain adaptation solution for IoT systems. SIDA proposes a self-supervised imbalanced domain adaptation model that separately learns the representation of time series signals in a minority domain with limited samples, a majority domain with rich samples, and their mapping relations. For lung sound enhancement, we further proposes a phase correction model to sanitize the phase and a SNR prediction algorithm to recursively perform domain adaptation in an imbalanced noisy and clean lung sound dataset. Extensive experiments demonstrate SIDA increases noisy samples' SNR by 16.49dB and 4.06dB on a synthetic and a realistic imbalanced lung sound dataset, respectively. For WiFi-based human sensing, SIDA designs a cross-domain WiFi-based human identification model irrespective of walking trajectory. A specific trajectory where a group of people walks along in a realistic testing environment is considered the minority domain, and several other trajectories are stored at a server as the majority domain. Extensive experiments show SIDA could recognize individuals with an average accuracy of 94.72% and significantly outperform baselines on highly imbalanced WiFi dataset in cross-domain human identification tasks.<\/jats:p>","DOI":"10.1145\/3610919","type":"journal-article","created":{"date-parts":[[2023,9,27]],"date-time":"2023-09-27T15:45:03Z","timestamp":1695829503000},"page":"1-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["SIDA"],"prefix":"10.1145","volume":"7","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9001-1931","authenticated-orcid":false,"given":"Jin","family":"Zhang","sequence":"first","affiliation":[{"name":"National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5611-0511","authenticated-orcid":false,"given":"Yuyan","family":"Dai","sequence":"additional","affiliation":[{"name":"National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9811-1694","authenticated-orcid":false,"given":"Jie","family":"Chen","sequence":"additional","affiliation":[{"name":"College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0293-0781","authenticated-orcid":false,"given":"Chengwen","family":"Luo","sequence":"additional","affiliation":[{"name":"National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0781-9655","authenticated-orcid":false,"given":"Bo","family":"Wei","sequence":"additional","affiliation":[{"name":"School of Computing, Newcastle University, Newcastle, United Kingdom"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3529-2640","authenticated-orcid":false,"given":"Victor C. M.","family":"Leung","sequence":"additional","affiliation":[{"name":"College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2208-962X","authenticated-orcid":false,"given":"Jianqiang","family":"Li","sequence":"additional","affiliation":[{"name":"National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, China"}]}],"member":"320","published-online":{"date-parts":[[2023,9,27]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.28978\/nesciences.349282"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/3494994"},{"key":"e_1_2_1_3_1","volume-title":"TransferSense: towards environment independent and one-shot wifi sensing. Personal and Ubiquitous Computing","author":"Bu Qirong","year":"2021","unstructured":"Qirong Bu, Xingxia Ming, Jingzhao Hu, Tuo Zhang, Jun Feng, and Jing Zhang. 2021. TransferSense: towards environment independent and one-shot wifi sensing. Personal and Ubiquitous Computing (2021), 1--19."},{"key":"e_1_2_1_4_1","doi-asserted-by":"crossref","unstructured":"Aggelina Chatziagapi Georgios Paraskevopoulos Dimitris Sgouropoulos Georgios Pantazopoulos Malvina Nikandrou Theodoros Giannakopoulos Athanasios Katsamanis Alexandros Potamianos and Shrikanth Narayanan. 2019. Data Augmentation Using GANs for Speech Emotion Recognition.. In Interspeech. 171--175.","DOI":"10.21437\/Interspeech.2019-2561"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.5555\/1622407.1622416"},{"key":"e_1_2_1_6_1","volume-title":"Workshop on learning from imbalanced datasets II","volume":"11","author":"Drummond Chris","year":"2003","unstructured":"Chris Drummond, Robert C Holte, et al. 2003. C4. 5, class imbalance, and cost sensitivity: why under-sampling beats over-sampling. In Workshop on learning from imbalanced datasets II, Vol. 11. Citeseer, 1--8."},{"key":"e_1_2_1_7_1","volume-title":"David SC Hui, et al","author":"Hu Yu","year":"2020","unstructured":"Wei-jie Guan, Zheng-yi Ni, Yu Hu, Wen-hua Liang, Chun-quan Ou, Jian-xing He, Lei Liu, Hong Shan, Chun-liang Lei, David SC Hui, et al. 2020. Clinical characteristics of coronavirus disease 2019 in China. New England journal of medicine 382, 18 (2020), 1708--1720."},{"key":"e_1_2_1_8_1","volume-title":"Savitzky-Golay filter for denoising lung sound. Brazilian Archives of Biology and Technology 61","author":"Haider Nishi Shahnaj","year":"2018","unstructured":"Nishi Shahnaj Haider, R Periyasamy, Deepak Joshi, and BK Singh. 2018. Savitzky-Golay filter for denoising lung sound. Brazilian Archives of Biology and Technology 61 (2018)."},{"key":"e_1_2_1_9_1","volume-title":"An Improved Lung Sound De-noising Method by Wavelet Packet Transform with Pso-Based Threshold Selection. Intelligent Automation & Soft Computing","author":"He Qing-Hua","year":"2016","unstructured":"Qing-Hua He, Bin Yu, Xin Hong, Bo Lv, Tao Liu, Jian Ran, and Yu-Tian Bi. 2016. An Improved Lung Sound De-noising Method by Wavelet Packet Transform with Pso-Based Threshold Selection. Intelligent Automation & Soft Computing (2016), 1--7."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2007.366913"},{"key":"e_1_2_1_11_1","volume-title":"A multi-discriminator cyclegan for unsupervised non-parallel speech domain adaptation. arXiv preprint arXiv:1804.00522","author":"Hosseini-Asl Ehsan","year":"2018","unstructured":"Ehsan Hosseini-Asl, Yingbo Zhou, Caiming Xiong, and Richard Socher. 2018. A multi-discriminator cyclegan for unsupervised non-parallel speech domain adaptation. arXiv preprint arXiv:1804.00522 (2018)."},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Yinghui Huang Sijun Meng Yi Zhang Shuisheng Wu Yu Zhang Yawei Zhang Yixiang Ye Qifeng Wei Niangui Zhao Jianping Jiang et al. 2020. The respiratory sound features of COVID-19 patients fill gaps between clinical data and screening methods. MedRxiv (2020).","DOI":"10.1101\/2020.04.07.20051060"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3412382.3458265"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP40776.2020.9053726"},{"key":"e_1_2_1_15_1","volume-title":"Self-Supervised Learning based Monaural Speech Enhancement with Complex-Cycle-Consistent. arXiv preprint arXiv:2112.11142","author":"Li Yi","year":"2021","unstructured":"Yi Li, Yang Sun, and Syed Mohsen Naqvi. 2021. Self-Supervised Learning based Monaural Speech Enhancement with Complex-Cycle-Consistent. arXiv preprint arXiv:2112.11142 (2021)."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.3390\/s22155677"},{"key":"e_1_2_1_17_1","volume-title":"Unsupervised image-to-image translation networks. Advances in neural information processing systems 30","author":"Liu Ming-Yu","year":"2017","unstructured":"Ming-Yu Liu, Thomas Breuel, and Jan Kautz. 2017. Unsupervised image-to-image translation networks. Advances in neural information processing systems 30 (2017)."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2018.8462116"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TAP.2020.3026447"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.7150\/ijbs.33274"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.2169\/internalmedicine.5565-20"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3131898"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW53098.2021.00299"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/RadarConf2147009.2021.9455204"},{"key":"e_1_2_1_25_1","volume-title":"International Conference on Biomedical and Health Informatics. Springer, 33--37","author":"Rocha BM","year":"2017","unstructured":"BM Rocha, Dimitris Filos, L Mendes, I Vogiatzis, E Perantoni, E Kaimakamis, P Natsiavas, Ana Oliveira, C J\u00e1come, A Marques, et al. 2017. A respiratory sound database for the development of automated classification. In International Conference on Biomedical and Health Informatics. Springer, 33--37."},{"key":"e_1_2_1_26_1","volume-title":"Xgan: Unsupervised image-to-image translation for many-to-many mappings. In Domain Adaptation for Visual Understanding","author":"Royer Am\u00e9lie","year":"2020","unstructured":"Am\u00e9lie Royer, Konstantinos Bousmalis, Stephan Gouws, Fred Bertsch, Inbar Mosseri, Forrester Cole, and Kevin Murphy. 2020. Xgan: Unsupervised image-to-image translation for many-to-many mappings. In Domain Adaptation for Visual Understanding. Springer, 33--49."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1088\/1757-899X\/190\/1\/012040"},{"key":"e_1_2_1_28_1","unstructured":"Hao Wang Hao He and Dina Katabi. 2020. Continuously Indexed Domain Adaptation (ICML'20). JMLR.org Article 918 10 pages."},{"key":"e_1_2_1_29_1","volume-title":"Self-supervised Learning for Speech Enhancement. arXiv preprint arXiv:2006.10388","author":"Wang Yu-Che","year":"2020","unstructured":"Yu-Che Wang, Shrikant Venkataramani, and Paris Smaragdis. 2020. Self-supervised Learning for Speech Enhancement. arXiv preprint arXiv:2006.10388 (2020)."},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2023.03.009"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00581"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2019.2936580"},{"key":"e_1_2_1_33_1","volume-title":"Proceedings, Part XX. Springer, 57--75","author":"Yang Yuzhe","year":"2022","unstructured":"Yuzhe Yang, Hao Wang, and Dina Katabi. 2022. On multi-domain long-tailed recognition, imbalanced domain generalization and beyond. In Computer Vision--ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23--27, 2022, Proceedings, Part XX. Springer, 57--75."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMC.2022.3221902"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP43922.2022.9747267"},{"key":"e_1_2_1_36_1","volume-title":"2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). IEEE, 523--529","author":"Yu Guochen","year":"2021","unstructured":"Guochen Yu, Yutian Wang, Chengshi Zheng, Hui Wang, and Qin Zhang. 2021. Cyclegan-based non-parallel speech enhancement with an adaptive attention-in-attention mechanism. In 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). IEEE, 523--529."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2023.3234778"},{"key":"e_1_2_1_38_1","volume-title":"International conference on machine learning. PMLR, 7354--7363","author":"Zhang Han","year":"2019","unstructured":"Han Zhang, Ian Goodfellow, Dimitris Metaxas, and Augustus Odena. 2019. Self-attention generative adversarial networks. In International conference on machine learning. PMLR, 7354--7363."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/3550306"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/DCOSS.2016.30"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2020.3040782"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2020.3026732"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3144457.3144467"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3478093"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i4.16458"},{"key":"e_1_2_1_46_1","doi-asserted-by":"crossref","unstructured":"Yi Zhang Yue Zheng Guidong Zhang Kun Qian Chen Qian and Zheng Yang. 2020. GaitID: Robust Wi-Fi Based Gait Recognition. In Wireless Algorithms Systems and Applications Dongxiao Yu Falko Dressler and Jiguo Yu (Eds.). Springer International Publishing Cham 730--742.","DOI":"10.1007\/978-3-030-59016-1_60"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v36i3.20253"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.244"}],"container-title":["Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3610919","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3610919","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,28]],"date-time":"2025-07-28T16:25:21Z","timestamp":1753719921000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3610919"}},"subtitle":["Self-Supervised Imbalanced Domain Adaptation for Sound Enhancement and Cross-Domain WiFi Sensing"],"short-title":[],"issued":{"date-parts":[[2023,9,27]]},"references-count":48,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,9,27]]}},"alternative-id":["10.1145\/3610919"],"URL":"https:\/\/doi.org\/10.1145\/3610919","relation":{},"ISSN":["2474-9567"],"issn-type":[{"value":"2474-9567","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,27]]},"assertion":[{"value":"2023-09-27","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}