{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,13]],"date-time":"2026-05-13T02:08:58Z","timestamp":1778638138018,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,10,10]],"date-time":"2022-10-10T00:00:00Z","timestamp":1665360000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIP)","award":["2022R1F1A1064273"],"award-info":[{"award-number":["2022R1F1A1064273"]}]},{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["2020R1C1C1A01013020"],"award-info":[{"award-number":["2020R1C1C1A01013020"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,10,14]]},"DOI":"10.1145\/3552466.3556533","type":"proceedings-article","created":{"date-parts":[[2022,10,1]],"date-time":"2022-10-01T12:27:26Z","timestamp":1664627246000},"page":"9-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["Low-quality Fake Audio Detection through Frequency Feature Masking"],"prefix":"10.1145","author":[{"given":"Il-Youp","family":"Kwak","sequence":"first","affiliation":[{"name":"Chung-Ang University, Seoul, Republic of Korea"}]},{"given":"Sunmook","family":"Choi","sequence":"additional","affiliation":[{"name":"Korea University, Seoul, Republic of Korea"}]},{"given":"Jonghoon","family":"Yang","sequence":"additional","affiliation":[{"name":"Chung-Ang University, Seoul, Republic of Korea"}]},{"given":"Yerin","family":"Lee","sequence":"additional","affiliation":[{"name":"Chung-Ang University, Seoul, Republic of Korea"}]},{"given":"Soyul","family":"Han","sequence":"additional","affiliation":[{"name":"Chung-Ang University, Seoul, Republic of Korea"}]},{"given":"Seungsang","family":"Oh","sequence":"additional","affiliation":[{"name":"Korea University, Seoul, Republic of Korea"}]}],"member":"320","published-online":{"date-parts":[[2022,10,10]]},"reference":[{"key":"e_1_3_2_2_1_1","volume-title":"Subspectral Normalization for Neural Audio Data Processing. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE","author":"Chang Simyung","year":"2021","unstructured":"Simyung Chang , Hyoungwoo Park , Janghoon Cho , Hyunsin Park , Sungrack Yun , and Kyuwoong Hwang . 2021 . Subspectral Normalization for Neural Audio Data Processing. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE , Toronto, 850--854. Simyung Chang, Hyoungwoo Park, Janghoon Cho, Hyunsin Park, Sungrack Yun, and Kyuwoong Hwang. 2021. Subspectral Normalization for Neural Audio Data Processing. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, Toronto, 850--854."},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2022-657"},{"key":"e_1_3_2_2_3_1","volume-title":"Xception: Deep Learning with Depthwise Separable Convolutions. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society","author":"Chollet Fran\u00e7ois","year":"2017","unstructured":"Fran\u00e7ois Chollet . 2017 . Xception: Deep Learning with Depthwise Separable Convolutions. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society , Honolulu , 1800--1807. https:\/\/doi.org\/10.1109\/CVPR.2017.195 10.1109\/CVPR.2017.195 Fran\u00e7ois Chollet. 2017. Xception: Deep Learning with Depthwise Separable Convolutions. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society, Honolulu, 1800--1807. https:\/\/doi.org\/10.1109\/CVPR.2017.195"},{"key":"e_1_3_2_2_4_1","unstructured":"Terrance DeVries and Graham W Taylor. 2017. Improved regularization of convolutional neural networks with cutout. 8 pages.  Terrance DeVries and Graham W Taylor. 2017. Improved regularization of convolutional neural networks with cutout. 8 pages."},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/BTAS.2015.7358783"},{"key":"e_1_3_2_2_6_1","volume-title":"MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. CoRR","author":"Howard Andrew G.","year":"2017","unstructured":"Andrew G. Howard , Menglong Zhu , Bo Chen , Dmitry Kalenichenko , Weijun Wang , Tobias Weyand , Marco Andreetto , and Hartwig Adam . 2017. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. CoRR , Vol. abs\/ 1704 .04861 ( 2017 ), 1--9. arxiv: 1704.04861 Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. CoRR , Vol. abs\/1704.04861 (2017), 1--9. arxiv: 1704.04861"},{"key":"e_1_3_2_2_7_1","volume-title":"Bong-Jin Lee, Ha-Jin Yu, and Nicholas Evans.","author":"Heo Hee-Soo","year":"2022","unstructured":"Jee-weon Jung, Hee-Soo Heo , Hemlata Tak , Hye-jin Shim , Joon Son Chung , Bong-Jin Lee, Ha-Jin Yu, and Nicholas Evans. 2022 . AASIST : Audio Anti-Spoofing Using Integrated Spectro-Temporal Graph Attention Networks. In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, Brno , 6367--6371. https:\/\/doi.org\/10.1109\/ICASSP43922.2022.9747766 10.1109\/ICASSP43922.2022.9747766 Jee-weon Jung, Hee-Soo Heo, Hemlata Tak, Hye-jin Shim, Joon Son Chung, Bong-Jin Lee, Ha-Jin Yu, and Nicholas Evans. 2022. AASIST: Audio Anti-Spoofing Using Integrated Spectro-Temporal Graph Attention Networks. In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, Brno, 6367--6371. https:\/\/doi.org\/10.1109\/ICASSP43922.2022.9747766"},{"key":"e_1_3_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2021-383"},{"key":"e_1_3_2_2_9_1","volume-title":"Technical Report. DCASE2021 Challenge.","author":"Kim Byeonggeun","year":"2021","unstructured":"Byeonggeun Kim , Seunghan Yang , Jangho Kim , and Simyung Chang . 2021 c. QTI Submission to DCASE 2021: Residual Normalization for Device-Imbalanced Acoustic Scene Classification with Efficient Design . Technical Report. DCASE2021 Challenge. Byeonggeun Kim, Seunghan Yang, Jangho Kim, and Simyung Chang. 2021c. QTI Submission to DCASE 2021: Residual Normalization for Device-Imbalanced Acoustic Scene Classification with Efficient Design. Technical Report. DCASE2021 Challenge."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2021-103"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2017-1111"},{"key":"e_1_3_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2021.3082307"},{"key":"e_1_3_2_2_13_1","volume-title":"ResMax: Detecting Voice Spoofing Attacks with Residual Network and Max Feature Map. In 25th International Conference on Pattern Recognition (ICPR). IEEE Computer Society, Milan, 4837--4844","author":"Kwak Il-Youp","year":"2021","unstructured":"Il-Youp Kwak , Sungsu Kwag , Junhee Lee , Jun Ho Huh , Choong-Hoon Lee , Youngbae Jeon , Jeonghwan Hwang , and Ji Won Yoon . 2021 . ResMax: Detecting Voice Spoofing Attacks with Residual Network and Max Feature Map. In 25th International Conference on Pattern Recognition (ICPR). IEEE Computer Society, Milan, 4837--4844 . Il-Youp Kwak, Sungsu Kwag, Junhee Lee, Jun Ho Huh, Choong-Hoon Lee, Youngbae Jeon, Jeonghwan Hwang, and Ji Won Yoon. 2021. ResMax: Detecting Voice Spoofing Attacks with Residual Network and Max Feature Map. In 25th International Conference on Pattern Recognition (ICPR). IEEE Computer Society, Milan, 4837--4844."},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2017-360"},{"key":"e_1_3_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2019-1768"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.25080\/Majora-7b98e3ed-003"},{"key":"e_1_3_2_2_17_1","volume-title":"Proceedings of the 27th International Conference on International Conference on Machine Learning","author":"Nair Vinod","unstructured":"Vinod Nair and Geoffrey E. Hinton . 2010. Rectified Linear Units Improve Restricted Boltzmann Machines . In Proceedings of the 27th International Conference on International Conference on Machine Learning ( Haifa, Israel) (ICML'10). Omnipress, Madison, WI, USA, 807--814. Vinod Nair and Geoffrey E. Hinton. 2010. Rectified Linear Units Improve Restricted Boltzmann Machines. In Proceedings of the 27th International Conference on International Conference on Machine Learning (Haifa, Israel) (ICML'10). Omnipress, Madison, WI, USA, 807--814."},{"key":"e_1_3_2_2_18_1","volume-title":"Proc. Interspeech","author":"Park Daniel S.","year":"2019","unstructured":"Daniel S. Park , William Chan , Yu Zhang , Chung-Cheng Chiu , Barret Zoph , Ekin Dogus Cubuk , and Quoc V. Le . 2019. SpecAugment: A Simple Augmentation Method for Automatic Speech Recognition . In Proc. Interspeech 2019 . ISCA, Graz, 2613--2617. Daniel S. Park, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin Dogus Cubuk, and Quoc V. Le. 2019. SpecAugment: A Simple Augmentation Method for Automatic Speech Recognition. In Proc. Interspeech 2019. ISCA, Graz, 2613--2617."},{"key":"e_1_3_2_2_19_1","volume-title":"Swish: a self-gated activation function. arXiv preprint arXiv:1710.05941","author":"Ramachandran Prajit","year":"2017","unstructured":"Prajit Ramachandran , Barret Zoph , and Quoc V Le. 2017. Swish: a self-gated activation function. arXiv preprint arXiv:1710.05941 , Vol. 7 , 1 ( 2017 ), 5. Prajit Ramachandran, Barret Zoph, and Quoc V Le. 2017. Swish: a self-gated activation function. arXiv preprint arXiv:1710.05941 , Vol. 7, 1 (2017), 5."},{"key":"e_1_3_2_2_20_1","volume-title":"Aishell-3: A multi-speaker mandarin tts corpus and the baselines. , 5 pages.arxiv","author":"Shi Yao","year":"2010","unstructured":"Yao Shi , Hui Bu , Xin Xu , Shaoji Zhang , and Ming Li. 2020. Aishell-3: A multi-speaker mandarin tts corpus and the baselines. , 5 pages.arxiv : 2010 .11567 Yao Shi, Hui Bu, Xin Xu, Shaoji Zhang, and Ming Li. 2020. Aishell-3: A multi-speaker mandarin tts corpus and the baselines. , 5 pages.arxiv: 2010.11567"},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP39728.2021.9414234"},{"key":"e_1_3_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2019-2249"},{"key":"e_1_3_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.21437\/ASVSPOOF.2021-10"},{"key":"e_1_3_2_2_24_1","unstructured":"Christophe Veaux Junichi Yamagishi Kirsten MacDonald etal 2017. CSTR VCTK corpus: English multi-speaker corpus for CSTR voice cloning toolkit. https:\/\/datashare.ed.ac.uk\/handle\/10283\/2651  Christophe Veaux Junichi Yamagishi Kirsten MacDonald et al. 2017. CSTR VCTK corpus: English multi-speaker corpus for CSTR voice cloning toolkit. https:\/\/datashare.ed.ac.uk\/handle\/10283\/2651"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2020-1011"},{"key":"e_1_3_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIFS.2018.2833032"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2018.07.033"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2015-462"},{"key":"e_1_3_2_2_29_1","doi-asserted-by":"publisher","DOI":"10.21437\/ASVSPOOF.2021-8"},{"key":"e_1_3_2_2_30_1","volume-title":"26th International Conference on Pattern Recognition (ICPR)","author":"Yang Jonghoon","unstructured":"Jonghoon Yang , Sunmook Choi , Yerin Lee , Seungsang Oh , and Il-Youp Kwak . 2022. Light-weight Frequency Information Aware Neural Network Architecture for Voice Spoofing Detection . In 26th International Conference on Pattern Recognition (ICPR) . IEEE Computer Society , Montreal Quebec . Jonghoon Yang, Sunmook Choi, Yerin Lee, Seungsang Oh, and Il-Youp Kwak. 2022. Light-weight Frequency Information Aware Neural Network Architecture for Voice Spoofing Detection. In 26th International Conference on Pattern Recognition (ICPR). IEEE Computer Society, Montreal Quebec."},{"key":"e_1_3_2_2_31_1","volume-title":"ADD 2022: the First Audio Deep Synthesis Detection Challenge. In 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, IEEE","author":"Yi Jiangyan","year":"2022","unstructured":"Jiangyan Yi , Ruibo Fu , Jianhua Tao , Shuai Nie , Haoxin Ma , Chenglong Wang , Tao Wang , Zhengkun Tian , Ye Bai , Cunhan Fan , Shan Liang , Shiming Wang , Shuai Zhang , Xinrui Yan , Le Xu , Zhengqi Wen , and Haizhou Li . 2022 . ADD 2022: the First Audio Deep Synthesis Detection Challenge. In 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, IEEE , Singapore, 9216--9220. Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhan Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, and Haizhou Li. 2022. ADD 2022: the First Audio Deep Synthesis Detection Challenge. In 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, IEEE, Singapore, 9216--9220."},{"key":"e_1_3_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2019.00612"},{"key":"e_1_3_2_2_33_1","unstructured":"Hongyi Zhang Moustapha Cisse Yann N Dauphin and David Lopez-Paz. 2017. mixup: Beyond empirical risk minimization. 13 pages.  Hongyi Zhang Moustapha Cisse Yann N Dauphin and David Lopez-Paz. 2017. mixup: Beyond empirical risk minimization. 13 pages."}],"event":{"name":"MM '22: The 30th ACM International Conference on Multimedia","location":"Lisboa Portugal","acronym":"MM '22","sponsor":["SIGMM ACM Special Interest Group on Multimedia"]},"container-title":["Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3552466.3556533","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3552466.3556533","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:25Z","timestamp":1750182565000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3552466.3556533"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,10]]},"references-count":33,"alternative-id":["10.1145\/3552466.3556533","10.1145\/3552466"],"URL":"https:\/\/doi.org\/10.1145\/3552466.3556533","relation":{},"subject":[],"published":{"date-parts":[[2022,10,10]]},"assertion":[{"value":"2022-10-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}