{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,22]],"date-time":"2025-08-22T05:10:35Z","timestamp":1755839435920,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":90,"publisher":"ACM","license":[{"start":{"date-parts":[[2022,11,7]],"date-time":"2022-11-07T00:00:00Z","timestamp":1667779200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["NRF-2021R1I1A2059696"],"award-info":[{"award-number":["NRF-2021R1I1A2059696"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,11,7]]},"DOI":"10.1145\/3548606.3560572","type":"proceedings-article","created":{"date-parts":[[2022,11,7]],"date-time":"2022-11-07T11:41:28Z","timestamp":1667821288000},"page":"1933-1946","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Overo"],"prefix":"10.1145","author":[{"given":"Jaemin","family":"Lim","sequence":"first","affiliation":[{"name":"Hanyang University, Ansan, Republic of Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kiyeon","family":"Kim","sequence":"additional","affiliation":[{"name":"Hanyang University, Ansan, Republic of Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hyunwoo","family":"Yu","sequence":"additional","affiliation":[{"name":"Hanyang University, Ansan, Republic of Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Suk-Bok","family":"Lee","sequence":"additional","affiliation":[{"name":"Hanyang University, Ansan, Republic of Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,11,7]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proc. International Conference on Neural Information Processing Systems (NIPS)","author":"Jia Ye","year":"2018","unstructured":"Ye Jia , Yu Zhang , Ron J. Weiss , Quan Wang , Jonathan Shen , Fei Ren , Zhifeng Chen , Patrick Nguyen , Ruoming Pang , Ignacio Lopez Moreno , and Yonghui Wu . Transfer learning from speaker verification to multispeaker text-to-speech synthesis . In Proc. International Conference on Neural Information Processing Systems (NIPS) , 2018 . Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio Lopez Moreno, and Yonghui Wu. Transfer learning from speaker verification to multispeaker text-to-speech synthesis. In Proc. International Conference on Neural Information Processing Systems (NIPS), 2018."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2018.01.001"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2020-1283"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3384419.3430727"},{"key":"e_1_3_2_1_5_1","volume-title":"Proc. USENIX NSDI","author":"Ahmed Shimaa","year":"2020","unstructured":"Shimaa Ahmed , Amrita Roy Chowdhury , Kassem Fawaz , and Parmesh Ramanathan . Pr\u03b5\u03b5ch : A system for privacy-preserving speech transcription . In Proc. USENIX NSDI , 2020 . Shimaa Ahmed, Amrita Roy Chowdhury, Kassem Fawaz, and Parmesh Ramanathan. Pr\u03b5\u03b5ch: A system for privacy-preserving speech transcription. In Proc. USENIX NSDI, 2020."},{"key":"e_1_3_2_1_6_1","volume-title":"Speech sanitizer: Speech content desensitization and voice anonymization","author":"Qian Jianwei","year":"2019","unstructured":"Jianwei Qian , Haohua Du , Jiahui Hou , Linlin Chen , Taeho Jung , and Xiangyang Li . Speech sanitizer: Speech content desensitization and voice anonymization . In IEEE Transactions on Dependable and Secure Computing , 2019 . Jianwei Qian, Haohua Du, Jiahui Hou, Linlin Chen, Taeho Jung, and Xiangyang Li. Speech sanitizer: Speech content desensitization and voice anonymization. In IEEE Transactions on Dependable and Secure Computing, 2019."},{"key":"e_1_3_2_1_7_1","volume-title":"Proc. ACM SenSys","author":"Qian Jianwei","year":"2018","unstructured":"Jianwei Qian , Haohua Du , Jiahui Hou , Linlin Chen , Taeho Jung , and Xiang-Yang Li. Hidebehind : Enjoy voice input with voiceprint unclonability and anonymity . In Proc. ACM SenSys , 2018 . Jianwei Qian, Haohua Du, Jiahui Hou, Linlin Chen, Taeho Jung, and Xiang-Yang Li. Hidebehind: Enjoy voice input with voiceprint unclonability and anonymity. In Proc. ACM SenSys, 2018."},{"key":"e_1_3_2_1_8_1","volume-title":"Speaker anonymisation using the McAdams coefficient. In arXiv:2011.01130","author":"Patino Jose","year":"2020","unstructured":"Jose Patino , Natalia Tomashenko , Massimiliano Todisco , Andreas Nautsch , and Nicholas Evans . Speaker anonymisation using the McAdams coefficient. In arXiv:2011.01130 , 2020 . Jose Patino, Natalia Tomashenko, Massimiliano Todisco, Andreas Nautsch, and Nicholas Evans. Speaker anonymisation using the McAdams coefficient. In arXiv:2011.01130, 2020."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFOCOM.2018.8486250"},{"key":"e_1_3_2_1_10_1","volume-title":"Internet X.509 public key infrastructure time-stamp protocol (TSP). RFC 3161","author":"Adams C.","year":"2001","unstructured":"C. Adams , P. Cain , D. Pinkas , and R. Zuccherato . Internet X.509 public key infrastructure time-stamp protocol (TSP). RFC 3161 , 2001 . C. Adams, P. Cain, D. Pinkas, and R. Zuccherato. Internet X.509 public key infrastructure time-stamp protocol (TSP). RFC 3161, 2001."},{"key":"e_1_3_2_1_11_1","unstructured":"Tacotron: An end-to-end speech synthesis system by Google. https:\/\/google. github.io\/tacotron\/.  Tacotron: An end-to-end speech synthesis system by Google. https:\/\/google. github.io\/tacotron\/."},{"key":"e_1_3_2_1_12_1","unstructured":"Text-to-Speech: Lifelike speech synthesis (Google Cloud). https:\/\/cloud.google. com\/text-to-speech\/.  Text-to-Speech: Lifelike speech synthesis (Google Cloud). https:\/\/cloud.google. com\/text-to-speech\/."},{"key":"e_1_3_2_1_13_1","unstructured":"The best audio editing software of 2021. https:\/\/www.pcmag.com\/picks\/the-bestaudio- editing-software\/.  The best audio editing software of 2021. https:\/\/www.pcmag.com\/picks\/the-bestaudio- editing-software\/."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2019-2899"},{"key":"e_1_3_2_1_15_1","unstructured":"New advances in speaker diarization (IBM Research Blog. https:\/\/www.ibm.com\/ blogs\/research\/2020\/10\/new-advances-in-speaker-diarization\/.  New advances in speaker diarization (IBM Research Blog. https:\/\/www.ibm.com\/ blogs\/research\/2020\/10\/new-advances-in-speaker-diarization\/."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6639170"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2014.6854370"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2009.5372931"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2016.7472799"},{"key":"e_1_3_2_1_20_1","volume-title":"Standard","author":"American National Standards Institute (ANSI) X9.","year":"2005","unstructured":"American National Standards Institute (ANSI) X9. 95 Trusted time stamp management and security . Standard , 2005 . American National Standards Institute (ANSI) X9.95 Trusted time stamp management and security. Standard, 2005."},{"key":"e_1_3_2_1_21_1","unstructured":"Redaction - IBM API. https:\/\/www.ibm.com\/docs\/en\/api-connect\/5.0.x?topic= policies-redaction-redact\/.  Redaction - IBM API. https:\/\/www.ibm.com\/docs\/en\/api-connect\/5.0.x?topic= policies-redaction-redact\/."},{"key":"e_1_3_2_1_22_1","unstructured":"Classification redaction and de-identification - Google API. https:\/\/cloud.google. com\/dlp\/docs\/classification-redaction\/.  Classification redaction and de-identification - Google API. https:\/\/cloud.google. com\/dlp\/docs\/classification-redaction\/."},{"key":"e_1_3_2_1_23_1","unstructured":"Automatic content redaction - Amazon API. https:\/\/docs.aws.amazon.com\/ transcribe\/latest\/dg\/content-redaction.html\/.  Automatic content redaction - Amazon API. https:\/\/docs.aws.amazon.com\/ transcribe\/latest\/dg\/content-redaction.html\/."},{"key":"e_1_3_2_1_24_1","unstructured":"Veritone - Audio redact software. https:\/\/www.veritone.com\/applications\/redact\/.  Veritone - Audio redact software. https:\/\/www.veritone.com\/applications\/redact\/."},{"key":"e_1_3_2_1_25_1","unstructured":"Vidizmo - Automatic audio redaction software. https:\/\/www.vidizmo.com\/vidizmo-artificial-intelligence-solutions\/redaction\/.  Vidizmo - Automatic audio redaction software. https:\/\/www.vidizmo.com\/vidizmo-artificial-intelligence-solutions\/redaction\/."},{"key":"e_1_3_2_1_26_1","unstructured":"CaseGuard - Audio redaction software. https:\/\/caseguard.com\/audio-redactionsoftware\/.  CaseGuard - Audio redaction software. https:\/\/caseguard.com\/audio-redactionsoftware\/."},{"key":"e_1_3_2_1_27_1","unstructured":"Audacity - Open source audio software. https:\/\/www.audacityteam.org\/.  Audacity - Open source audio software. https:\/\/www.audacityteam.org\/."},{"key":"e_1_3_2_1_28_1","unstructured":"Axon Redaction Studio. https:\/\/global.axon.com\/info\/redaction-studio\/.  Axon Redaction Studio. https:\/\/global.axon.com\/info\/redaction-studio\/."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1991.150483"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2005.848881"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.1996.541103"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.21437\/Eurospeech.1999-553"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11265-005-4151-3"},{"key":"e_1_3_2_1_34_1","volume-title":"International Workshop on Information Hiding","author":"\u00e7ak M Kivan\u00e7","year":"2001","unstructured":"M Kivan\u00e7 M\"h \u00e7ak and Ramarathnam Venkatesan . A perceptual audio hashing algorithm: a tool for robust audio identification and information hiding . In International Workshop on Information Hiding , 2001 . M Kivan\u00e7 M\"h\u00e7ak and Ramarathnam Venkatesan. A perceptual audio hashing algorithm: a tool for robust audio identification and information hiding. In International Workshop on Information Hiding, 2001."},{"key":"e_1_3_2_1_35_1","unstructured":"2021 California rules of court. https:\/\/www.courts.ca.gov\/cms\/rules\/index.cfm? title=one&linkid=rule1_201.  2021 California rules of court. https:\/\/www.courts.ca.gov\/cms\/rules\/index.cfm? title=one&linkid=rule1_201."},{"key":"e_1_3_2_1_36_1","unstructured":"Electronic audio recordings presented into evidence. https:\/\/www.wvnb.uscourts. gov\/sites\/wvnb\/files\/Electronic%20Audio%20Recordings%20Presented%20or% 20Offered%20into%20Evidence.pdf.  Electronic audio recordings presented into evidence. https:\/\/www.wvnb.uscourts. gov\/sites\/wvnb\/files\/Electronic%20Audio%20Recordings%20Presented%20or% 20Offered%20into%20Evidence.pdf."},{"key":"e_1_3_2_1_37_1","unstructured":"Separating different speakers in an audio recording - Google API. https:\/\/cloud. google.com\/speech-to-text\/docs\/multiple-voices\/.  Separating different speakers in an audio recording - Google API. https:\/\/cloud. google.com\/speech-to-text\/docs\/multiple-voices\/."},{"key":"e_1_3_2_1_38_1","unstructured":"Identifying speakers (speaker diarization) - Amazon API. https:\/\/docs.aws. amazon.com\/transcribe\/latest\/dg\/diarization.html\/.  Identifying speakers (speaker diarization) - Amazon API. https:\/\/docs.aws. amazon.com\/transcribe\/latest\/dg\/diarization.html\/."},{"key":"e_1_3_2_1_39_1","unstructured":"Speaker diarization with IBM Watson Speech-to-Text API. https:\/\/www.ibm. com\/cloud\/watson-speech-to-text\/.  Speaker diarization with IBM Watson Speech-to-Text API. https:\/\/www.ibm. com\/cloud\/watson-speech-to-text\/."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6639066"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCSLP.2010.5684886"},{"key":"e_1_3_2_1_42_1","volume-title":"Proc. IEEE International Symposium on Signal Processing and Information Technology","author":"Sundermann D.","year":"2003","unstructured":"D. Sundermann and H. Ney . VTLN-based voice conversion . In Proc. IEEE International Symposium on Signal Processing and Information Technology , 2003 . D. Sundermann and H. Ney. VTLN-based voice conversion. In Proc. IEEE International Symposium on Signal Processing and Information Technology, 2003."},{"key":"e_1_3_2_1_43_1","author":"Savioja L.","year":"2000","unstructured":"L. Savioja and V. Valimaki . Reducing the dispersion error in the digital waveguide mesh using interpolation and frequency-warping techniques. In IEEE Transactions on Speech and Audio Processing , 2000 . L. Savioja and V. Valimaki. Reducing the dispersion error in the digital waveguide mesh using interpolation and frequency-warping techniques. In IEEE Transactions on Speech and Audio Processing, 2000.","journal-title":"In IEEE Transactions on Speech and Audio Processing"},{"key":"e_1_3_2_1_44_1","volume-title":"TIMIT Acoustic-Phonetic Continuous Speech Corpus","author":"Garofolo John","year":"1993","unstructured":"John Garofolo , Lori Lamel , William Fisher , Jonathan Fiscus , David Pallett , Nancy Dahlgren , and Victor Zue . TIMIT Acoustic-Phonetic Continuous Speech Corpus . In Linguistic Data Consortium , 1993 . John Garofolo, Lori Lamel, William Fisher, Jonathan Fiscus, David Pallett, Nancy Dahlgren, and Victor Zue. TIMIT Acoustic-Phonetic Continuous Speech Corpus. In Linguistic Data Consortium, 1993."},{"key":"e_1_3_2_1_45_1","volume-title":"Journal of Statistical Software","author":"Giorgino Toni","year":"2009","unstructured":"Toni Giorgino . Computing and visualizing dynamic time warping alignments in R: The dtw package . In Journal of Statistical Software , 2009 . Toni Giorgino. Computing and visualizing dynamic time warping alignments in R: The dtw package. In Journal of Statistical Software, 2009."},{"key":"e_1_3_2_1_46_1","volume-title":"The Journal of the Acoustical Society of America","author":"Ingo","year":"1998","unstructured":"Ingo R. Titze and DanielW. Martin. Principles of voice production . In The Journal of the Acoustical Society of America , 1998 . Ingo R. Titze and DanielW. Martin. Principles of voice production. In The Journal of the Acoustical Society of America, 1998."},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2019.8683760"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7178964"},{"key":"e_1_3_2_1_49_1","volume-title":"Proc. IEEE ICASSP","author":"Snyder David","year":"2018","unstructured":"David Snyder , Daniel Garcia-Romero , Gregory Sell , Daniel Povey , and Sanjeev Khudanpur . X-vectors : Robust DNN embeddings for speaker recognition . In Proc. IEEE ICASSP , 2018 . David Snyder, Daniel Garcia-Romero, Gregory Sell, Daniel Povey, and Sanjeev Khudanpur. X-vectors: Robust DNN embeddings for speaker recognition. In Proc. IEEE ICASSP, 2018."},{"key":"e_1_3_2_1_50_1","unstructured":"Librispeech ASR model - Kaldi ASR. https:\/\/github.com\/kaldi-asr\/kaldi\/.  Librispeech ASR model - Kaldi ASR. https:\/\/github.com\/kaldi-asr\/kaldi\/."},{"key":"e_1_3_2_1_51_1","unstructured":"LibriSpeech ASR corpus. https:\/\/www.openslr.org\/12\/.  LibriSpeech ASR corpus. https:\/\/www.openslr.org\/12\/."},{"key":"e_1_3_2_1_52_1","volume-title":"CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit","author":"Yamagishi Junichi","year":"2019","unstructured":"Junichi Yamagishi , Christophe Veaux , and Kirsten MacDonald . CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit . In University of Edinburgh. The Centre for Speech Technology Research (CSTR) , 2019 . Junichi Yamagishi, Christophe Veaux, and Kirsten MacDonald. CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit. In University of Edinburgh. The Centre for Speech Technology Research (CSTR), 2019."},{"key":"e_1_3_2_1_53_1","unstructured":"WORLD - a high-quality speech analysis manipulation and synthesis system. https:\/\/github.com\/mmorise\/World\/.  WORLD - a high-quality speech analysis manipulation and synthesis system. https:\/\/github.com\/mmorise\/World\/."},{"key":"e_1_3_2_1_54_1","unstructured":"FAAC (freeware advanced audio coder). https:\/\/github.com\/knik0\/faac\/.  FAAC (freeware advanced audio coder). https:\/\/github.com\/knik0\/faac\/."},{"key":"e_1_3_2_1_55_1","unstructured":"FAAD2 (freeware advanced audio decoder). https:\/\/github.com\/knik0\/faad2\/.  FAAD2 (freeware advanced audio decoder). https:\/\/github.com\/knik0\/faad2\/."},{"key":"e_1_3_2_1_56_1","volume-title":"US secure hash algorithms (SHA and HMAC-SHA). RFC 4634","author":"Eastlake D.","year":"2006","unstructured":"D. Eastlake and P. Jones . US secure hash algorithms (SHA and HMAC-SHA). RFC 4634 , 2006 . D. Eastlake and P. Jones. US secure hash algorithms (SHA and HMAC-SHA). RFC 4634, 2006."},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2012.2191285"},{"key":"e_1_3_2_1_58_1","unstructured":"Carnegie Mellon University PocketSphinx. https:\/\/cmusphinx.github.io\/.  Carnegie Mellon University PocketSphinx. https:\/\/cmusphinx.github.io\/."},{"key":"e_1_3_2_1_59_1","unstructured":"An audio library based on libsndfile CFFI and NumPy. https:\/\/pypi.org\/project\/ SoundFile\/.  An audio library based on libsndfile CFFI and NumPy. https:\/\/pypi.org\/project\/ SoundFile\/."},{"key":"e_1_3_2_1_60_1","unstructured":"Android Math library. https:\/\/developer.android.com\/reference\/java\/lang\/Math\/.  Android Math library. https:\/\/developer.android.com\/reference\/java\/lang\/Math\/."},{"key":"e_1_3_2_1_61_1","unstructured":"Swift Numerics. https:\/\/github.com\/apple\/swift-numerics\/.  Swift Numerics. https:\/\/github.com\/apple\/swift-numerics\/."},{"key":"e_1_3_2_1_62_1","unstructured":"oFono (free software project for mobile telephony). https:\/\/git.kernel.org\/pub\/ scm\/network\/ofono\/ofono.git\/.  oFono (free software project for mobile telephony). https:\/\/git.kernel.org\/pub\/ scm\/network\/ofono\/ofono.git\/."},{"key":"e_1_3_2_1_63_1","unstructured":"PulseAudio (sound server system). https:\/\/gitlab.freedesktop.org\/pulseaudio\/ pulseaudio\/.  PulseAudio (sound server system). https:\/\/gitlab.freedesktop.org\/pulseaudio\/ pulseaudio\/."},{"key":"e_1_3_2_1_64_1","unstructured":"Natural language framework - Apple. https:\/\/developer.apple.com\/ documentation\/naturallanguage\/.  Natural language framework - Apple. https:\/\/developer.apple.com\/ documentation\/naturallanguage\/."},{"key":"e_1_3_2_1_65_1","unstructured":"pyAudioAnalysis - a Python library for audio feature extraction classification segmentation and applications. https:\/\/github.com\/tyiannak\/pyAudioAnalysis\/.  pyAudioAnalysis - a Python library for audio feature extraction classification segmentation and applications. https:\/\/github.com\/tyiannak\/pyAudioAnalysis\/."},{"key":"e_1_3_2_1_66_1","unstructured":"Speaker recognition APIs - Microsoft Azure. https:\/\/docs.microsoft.com\/enus\/ rest\/api\/speakerrecognition\/.  Speaker recognition APIs - Microsoft Azure. https:\/\/docs.microsoft.com\/enus\/ rest\/api\/speakerrecognition\/."},{"key":"e_1_3_2_1_67_1","unstructured":"The VoicePrivacy 2022 Challenge. https:\/\/www.voiceprivacychallenge.org\/.  The VoicePrivacy 2022 Challenge. https:\/\/www.voiceprivacychallenge.org\/."},{"key":"e_1_3_2_1_68_1","volume-title":"Proc. IEEE ICME2000","author":"Arnold Michael","year":"2000","unstructured":"Michael Arnold . Audio watermarking : Features, applications and algorithms . In Proc. IEEE ICME2000 , 2000 . Michael Arnold. Audio watermarking: Features, applications and algorithms. In Proc. IEEE ICME2000, 2000."},{"key":"e_1_3_2_1_69_1","unstructured":"worldveil\/dejavu: Audio fingerprinting and recognition in Python. https:\/\/github. com\/worldveil\/dejavu.  worldveil\/dejavu: Audio fingerprinting and recognition in Python. https:\/\/github. com\/worldveil\/dejavu."},{"key":"e_1_3_2_1_70_1","unstructured":"Audio watermark - audio watermarking. https:\/\/uplex.de\/audiowmark\/README.html.  Audio watermark - audio watermarking. https:\/\/uplex.de\/audiowmark\/README.html."},{"key":"e_1_3_2_1_71_1","unstructured":"pHash: the open source perceptual hash library. https:\/\/www.phash.org\/.  pHash: the open source perceptual hash library. https:\/\/www.phash.org\/."},{"key":"e_1_3_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.specom.2004.12.004"},{"key":"e_1_3_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICOEI48184.2020.9143014"},{"key":"e_1_3_2_1_74_1","volume-title":"On the relevance of language in speaker recognition. In arXiv:2203.01992","author":"Satue-Villar Antonio","year":"2022","unstructured":"Antonio Satue-Villar and Marcos Faundez-Zanuy . On the relevance of language in speaker recognition. In arXiv:2203.01992 , 2022 . Antonio Satue-Villar and Marcos Faundez-Zanuy. On the relevance of language in speaker recognition. In arXiv:2203.01992, 2022."},{"key":"e_1_3_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2020-1791"},{"key":"e_1_3_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP39728.2021.9414305"},{"key":"e_1_3_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP43922.2022.9747161"},{"key":"e_1_3_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2016.7552917"},{"key":"e_1_3_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1145\/3411495.3421355"},{"key":"e_1_3_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.1145\/2660267.2660300"},{"key":"e_1_3_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2328175"},{"key":"e_1_3_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2010.5495479"},{"key":"e_1_3_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1145\/1288869.1288879"},{"key":"e_1_3_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-04431-1_17"},{"key":"e_1_3_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.forsciint.2006.06.033"},{"key":"e_1_3_2_1_86_1","volume-title":"Proc. Audio Engineering Society Conference","author":"Cooper Alan J","year":"2008","unstructured":"Alan J Cooper . The electric network frequency (ENF) as an aid to authenticating forensic digital audio recordings--an automated approach . In Proc. Audio Engineering Society Conference , 2008 . Alan J Cooper. The electric network frequency (ENF) as an aid to authenticating forensic digital audio recordings--an automated approach. In Proc. Audio Engineering Society Conference, 2008."},{"key":"e_1_3_2_1_87_1","volume-title":"Marco Fontani, Giovanni Rocciolo, and Alessandro Piva. Detection and localization of double compression in mp3 audio tracks. In EURASIP Journal on information Security","author":"Bianchi Tiziano","year":"2014","unstructured":"Tiziano Bianchi , Alessia De Rosa , Marco Fontani, Giovanni Rocciolo, and Alessandro Piva. Detection and localization of double compression in mp3 audio tracks. In EURASIP Journal on information Security , 2014 . Tiziano Bianchi, Alessia De Rosa, Marco Fontani, Giovanni Rocciolo, and Alessandro Piva. Detection and localization of double compression in mp3 audio tracks. In EURASIP Journal on information Security, 2014."},{"key":"e_1_3_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.forsciint.2014.02.008"},{"key":"e_1_3_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2017.2647921"},{"key":"e_1_3_2_1_90_1","volume-title":"Digital Investigation: The International Journal of Digital Forensics & Incident Response","author":"Zheng Lilei","year":"2017","unstructured":"Lilei Zheng , Ying Zhang , Chien Eao Lee , and Vrizlynn L.L. Thing . Time-ofrecording estimation for audio recordings . In Digital Investigation: The International Journal of Digital Forensics & Incident Response , 2017 . Lilei Zheng, Ying Zhang, Chien Eao Lee, and Vrizlynn L.L. Thing. Time-ofrecording estimation for audio recordings. In Digital Investigation: The International Journal of Digital Forensics & Incident Response, 2017."}],"event":{"name":"CCS '22: 2022 ACM SIGSAC Conference on Computer and Communications Security","sponsor":["SIGSAC ACM Special Interest Group on Security, Audit, and Control"],"location":"Los Angeles CA USA","acronym":"CCS '22"},"container-title":["Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3548606.3560572","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3548606.3560572","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:50:57Z","timestamp":1750182657000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3548606.3560572"}},"subtitle":["Sharing Private Audio Recordings"],"short-title":[],"issued":{"date-parts":[[2022,11,7]]},"references-count":90,"alternative-id":["10.1145\/3548606.3560572","10.1145\/3548606"],"URL":"https:\/\/doi.org\/10.1145\/3548606.3560572","relation":{},"subject":[],"published":{"date-parts":[[2022,11,7]]},"assertion":[{"value":"2022-11-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}