{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T18:09:31Z","timestamp":1777486171375,"version":"3.51.4"},"reference-count":50,"publisher":"World Scientific Pub Co Pte Ltd","issue":"05","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Wavelets Multiresolut Inf. Process."],"published-print":{"date-parts":[[2023,9]]},"abstract":"<jats:p> Desynchronization attacks proved to be the greatest challenge to audio watermarking systems as they introduce misalignment between the signal carrier and the watermark. This paper proposes a DNN-based speech watermarking system with two adversarial networks jointly trained on a set of desynchronization attacks to embed a randomly generated watermark. The detector neural network is expanded with spatial pyramid pooling layers to be able to handle signals affected by these attacks. A detailed training procedure of the aforementioned DNN system with gradual attack introduction is proposed in order to achieve robustness. Experiments performed on a speech dataset show that the system achieves satisfactory results according to all the benchmarks it was tested against. The system preserves signal quality after watermark embedding. Most importantly, the system achieved resistance to all considered desynchronization attacks. The majority of the attacks cause less than [Formula: see text]% of incorrectly detected watermarked bits on average, which outperforms comparative techniques in this regard. <\/jats:p>","DOI":"10.1142\/s0219691323500091","type":"journal-article","created":{"date-parts":[[2023,2,6]],"date-time":"2023-02-06T02:30:31Z","timestamp":1675650631000},"source":"Crossref","is-referenced-by-count":4,"title":["DNN-based speech watermarking resistant to desynchronization attacks"],"prefix":"10.1142","volume":"21","author":[{"given":"Kosta","family":"Pavlovi\u0107","sequence":"first","affiliation":[{"name":"Faculty of Natural Sciences and Mathematics, University of Montenegro, Podgorica, Montenegro"}]},{"given":"Slavko","family":"Kova\u010devi\u0107","sequence":"additional","affiliation":[{"name":"Faculty of Electrical Engineering, University of Montenegro, Podgorica, Montenegro"}]},{"given":"Igor","family":"Djurovi\u0107","sequence":"additional","affiliation":[{"name":"Faculty of Electrical Engineering, University of Montenegro, Podgorica, Montenegro"},{"name":"Montenegrin Academy of Sciences and Arts, Podgorica, Montenegro"}]},{"given":"Adam","family":"Wojciechowski","sequence":"additional","affiliation":[{"name":"Institute of Information Technology, Lodz University of Technology, \u0141\u00f3d\u017a, Poland"}]}],"member":"219","published-online":{"date-parts":[[2023,3,1]]},"reference":[{"key":"S0219691323500091BIB001","first-page":"2015","volume-title":"ICASSP \u201986: IEEE Int. Conf. Acoustics, Speech, and Signal Processing","volume":"11","author":"Charpentier F.","year":"1986"},{"issue":"12","key":"S0219691323500091BIB002","doi-asserted-by":"crossref","first-page":"1673","DOI":"10.1109\/83.650120","volume":"6","author":"Cox I. J.","year":"1997","journal-title":"IEEE Trans. Image Process."},{"key":"S0219691323500091BIB003","doi-asserted-by":"crossref","first-page":"618","DOI":"10.1109\/LSP.2021.3063888","volume":"28","author":"Cui Z.","year":"2021","journal-title":"IEEE Signal Process. Lett."},{"key":"S0219691323500091BIB004","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/BF02551274","volume":"2","author":"Cybenko G.","year":"1989","journal-title":"Math. Control Signals Systems"},{"key":"S0219691323500091BIB005","first-page":"173","volume":"46","author":"Dabas N.","year":"2019","journal-title":"J. Inf. Secur. Appl."},{"issue":"2","key":"S0219691323500091BIB006","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1006\/jnca.2000.0128","volume":"24","author":"Djurovi\u0107 I.","year":"2001","journal-title":"J. Netw. Comput. Appl."},{"key":"S0219691323500091BIB007","volume-title":"Int. Conf. Learning Representations 2016","author":"Dozat T.","year":"2016"},{"issue":"9","key":"S0219691323500091BIB008","doi-asserted-by":"crossref","first-page":"1493","DOI":"10.1002\/j.1538-7305.1966.tb01706.x","volume":"45","author":"Flanagan J. L.","year":"1966","journal-title":"Bell Syst. Tech. J."},{"issue":"9","key":"S0219691323500091BIB009","doi-asserted-by":"crossref","first-page":"714","DOI":"10.3390\/e20090714","volume":"20","author":"Guariglia E.","year":"2018","journal-title":"Entropy"},{"issue":"3","key":"S0219691323500091BIB010","doi-asserted-by":"crossref","first-page":"304","DOI":"10.3390\/e21030304","volume":"21","author":"Guariglia E.","year":"2019","journal-title":"Entropy"},{"key":"S0219691323500091BIB011","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1155\/2022\/5542054","volume":"2022","author":"Guariglia E.","year":"2022","journal-title":"J. Funct. Spaces"},{"key":"S0219691323500091BIB012","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1007\/978-3-319-42105-6_16","volume-title":"Engineering Mathematics II","author":"Guariglia E.","year":"2016"},{"key":"S0219691323500091BIB013","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1007\/978-3-319-10578-9_23","volume-title":"Computer Vision \u2014 ECCV 2014","author":"He K.","year":"2014"},{"issue":"5","key":"S0219691323500091BIB014","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1016\/0893-6080(89)90020-8","volume":"2","author":"Hornik K.","year":"1989","journal-title":"Neural Netw."},{"key":"S0219691323500091BIB015","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1016\/j.dsp.2019.01.006","volume":"87","author":"Hu H.-T.","year":"2019","journal-title":"Digit. Signal Process."},{"issue":"2","key":"S0219691323500091BIB016","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1109\/TASLP.2014.2387385","volume":"23","author":"Hua G.","year":"2015","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"issue":"10","key":"S0219691323500091BIB017","doi-asserted-by":"crossref","first-page":"2447","DOI":"10.1109\/TMM.2019.2907475","volume":"21","author":"Huang Y.","year":"2019","journal-title":"IEEE Trans. Multimed."},{"issue":"1","key":"S0219691323500091BIB018","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1109\/TMM.2017.2721642","volume":"20","author":"Hwang M.-J.","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"S0219691323500091BIB019","doi-asserted-by":"crossref","first-page":"2250032","DOI":"10.1142\/S0219691322500321","volume":"21","author":"Jadda A.","year":"2023","journal-title":"Int. J. Wavelets Multiresolut. Inf. Process."},{"key":"S0219691323500091BIB020","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1016\/j.sigpro.2019.04.017","volume":"162","author":"Jiang W.","year":"2019","journal-title":"Signal Process."},{"key":"S0219691323500091BIB021","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1016\/j.cose.2016.11.016","volume":"65","author":"Kandi H.","year":"2017","journal-title":"Comput. Secur."},{"issue":"3","key":"S0219691323500091BIB023","doi-asserted-by":"crossref","first-page":"501","DOI":"10.1016\/0893-6080(92)90012-8","volume":"5","author":"Kurkov\u00e1 V.","year":"1992","journal-title":"Neural Netw."},{"key":"S0219691323500091BIB024","doi-asserted-by":"publisher","DOI":"10.1142\/S0219691310003870"},{"key":"S0219691323500091BIB025","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1016\/j.inffus.2016.05.004","volume":"33","author":"Li S.","year":"2017","journal-title":"Inf. Fusion"},{"key":"S0219691323500091BIB026","doi-asserted-by":"crossref","first-page":"107584","DOI":"10.1016\/j.sigpro.2020.107584","volume":"173","author":"Liang X.","year":"2020","journal-title":"Signal Process."},{"issue":"5","key":"S0219691323500091BIB027","doi-asserted-by":"crossref","first-page":"1171","DOI":"10.1109\/TIFS.2018.2871748","volume":"14","author":"Liu Z.","year":"2019","journal-title":"IEEE Trans. Inf. Forensics Sec."},{"key":"S0219691323500091BIB028","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1109\/TIP.2021.3132828","volume":"31","author":"Liu Y.","year":"2021","journal-title":"IEEE Trans. Image Process."},{"key":"S0219691323500091BIB029","doi-asserted-by":"crossref","first-page":"2408","DOI":"10.1109\/ACCESS.2021.3139850","volume":"10","author":"Lopac N.","year":"2022","journal-title":"IEEE Access"},{"key":"S0219691323500091BIB030","doi-asserted-by":"publisher","DOI":"10.1142\/S0219691311003931"},{"issue":"2","key":"S0219691323500091BIB031","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1109\/TASSP.1979.1163210","volume":"27","author":"Malah D.","year":"1979","journal-title":"IEEE Trans. Acoust. Speech Signal Process."},{"issue":"7","key":"S0219691323500091BIB032","doi-asserted-by":"crossref","first-page":"674","DOI":"10.1109\/34.192463","volume":"11","author":"Mallat S.","year":"1989","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"S0219691323500091BIB033","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1016\/j.neucom.2019.01.067","volume":"337","author":"Mun S.-M.","year":"2019","journal-title":"Neurocomputing"},{"issue":"11","key":"S0219691323500091BIB034","doi-asserted-by":"crossref","first-page":"2176","DOI":"10.1109\/TASLP.2017.2749001","volume":"25","author":"Natgunanathan I.","year":"2017","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"S0219691323500091BIB035","first-page":"543","volume":"269","author":"Nesterov Y.","year":"1983","journal-title":"Dokl. Akad. Nauk SSSR"},{"key":"S0219691323500091BIB036","volume-title":"Discrete-time Signal Processing","author":"Oppenheim A.","year":"1999"},{"key":"S0219691323500091BIB037","doi-asserted-by":"crossref","first-page":"103381","DOI":"10.1016\/j.dsp.2021.103381","volume":"122","author":"Pavlovi\u0107 K.","year":"2021","journal-title":"Digit. Signal Process."},{"issue":"1","key":"S0219691323500091BIB038","doi-asserted-by":"crossref","first-page":"382","DOI":"10.1016\/j.dsp.2012.08.006","volume":"23","author":"Peng H.","year":"2013","journal-title":"Digit. Signal Process."},{"key":"S0219691323500091BIB039","volume-title":"Digital Processing of Speech Signals","author":"Rabiner L.","year":"1978"},{"key":"S0219691323500091BIB040","first-page":"749","volume-title":"2001 IEEE Int. Conf. Acoustics, Speech, and Signal Processing Proc. (Cat. No. 01CH37221)","volume":"2","author":"Rix A. W.","year":"2001"},{"key":"S0219691323500091BIB041","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1007\/978-3-319-24574-4_28","volume-title":"Int. Conf. Medical Image Computing and Computer-Assisted Intervention","volume":"9351","author":"Ronneberger O.","year":"2015"},{"issue":"4","key":"S0219691323500091BIB042","doi-asserted-by":"crossref","first-page":"650","DOI":"10.1109\/83.913599","volume":"10","author":"Stankovic S.","year":"2001","journal-title":"IEEE Trans. Image Process."},{"key":"S0219691323500091BIB043","first-page":"49","volume-title":"Proc. Int. Conf. Information Technology: Coding and Computing","author":"Steinebach M.","year":"2001"},{"key":"S0219691323500091BIB044","doi-asserted-by":"crossref","first-page":"2349","DOI":"10.1109\/TASLP.2020.3013785","volume":"28","author":"Wang S.","year":"2020","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"issue":"3","key":"S0219691323500091BIB045","doi-asserted-by":"crossref","first-page":"529","DOI":"10.1109\/TASLP.2017.2782487","volume":"26","author":"Xiang Y.","year":"2018","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"S0219691323500091BIB046","doi-asserted-by":"crossref","first-page":"484","DOI":"10.1109\/LSP.2022.3143038","volume":"29","author":"Xiao D.","year":"2022","journal-title":"IEEE Signal Process. Lett."},{"key":"S0219691323500091BIB047","doi-asserted-by":"crossref","first-page":"2282","DOI":"10.1109\/TASLP.2021.3092555","volume":"29","author":"Zhao J.","year":"2021","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"issue":"7","key":"S0219691323500091BIB048","doi-asserted-by":"crossref","first-page":"1696","DOI":"10.1109\/TSP.2019.2896246","volume":"67","author":"Zheng X.","year":"2019","journal-title":"IEEE Trans. Signal Process."},{"issue":"2","key":"S0219691323500091BIB049","doi-asserted-by":"crossref","first-page":"787","DOI":"10.1016\/j.acha.2019.06.004","volume":"48","author":"Zhou D.-X.","year":"2020","journal-title":"Appl. Comput. Harmon. Anal."},{"key":"S0219691323500091BIB050","first-page":"682","volume-title":"15th Eur. Conf.","author":"Zhu J.","year":"2018"},{"issue":"5","key":"S0219691323500091BIB051","doi-asserted-by":"crossref","first-page":"717","DOI":"10.1109\/TCSVT.2014.2363743","volume":"25","author":"Zong T.","year":"2015","journal-title":"IEEE Trans. Circuits Syst. Video Technol."}],"container-title":["International Journal of Wavelets, Multiresolution and Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219691323500091","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,10]],"date-time":"2023-08-10T09:52:12Z","timestamp":1691661132000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0219691323500091"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,3,1]]},"references-count":50,"journal-issue":{"issue":"05","published-print":{"date-parts":[[2023,9]]}},"alternative-id":["10.1142\/S0219691323500091"],"URL":"https:\/\/doi.org\/10.1142\/s0219691323500091","relation":{},"ISSN":["0219-6913","1793-690X"],"issn-type":[{"value":"0219-6913","type":"print"},{"value":"1793-690X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,3,1]]},"article-number":"2350009"}}