{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T16:10:35Z","timestamp":1778083835832,"version":"3.51.4"},"reference-count":37,"publisher":"World Scientific Pub Co Pte Ltd","issue":"09","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2023,7]]},"abstract":"<jats:p> Deepfake technology, especially deep voice, which has been derived from artificial intelligence in recent years, is potentially harmful, and the public is not yet wary. However, many speech synthesis models measure the degree of true restitution by Mean Opinion Rating (MOS), a subjective assessment of naturalness and quality of speech by human subjects, but in future it will be difficult to distinguish the interlocutor\u2019s identity through the screen. For this reason, this study addresses the threat posed by this new technology by combining representational learning and 0transfer learning in two sub-systems: a recognition system and a voice print system. The recognition system is responsible for the detection of which voice is a fake voice generated by speech conversion or speech synthesis techniques, while the acoustic system is responsible for the verification of the speaker\u2019s identity through acoustic features. In the speech recognition system, we use the representation learning method and the transfer classification method. We use X-vector data for training, and then fine-tune the model using four types of marker data to learn the representation vectors of real and fake voice, and use support vector machine to classify real and fake voice in the back-end to reduce the negative effect of the new technique. <\/jats:p>","DOI":"10.1142\/s0218001423500155","type":"journal-article","created":{"date-parts":[[2023,3,31]],"date-time":"2023-03-31T08:48:23Z","timestamp":1680252503000},"source":"Crossref","is-referenced-by-count":4,"title":["Deepfake Speech Recognition and Detection"],"prefix":"10.1142","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1872-4256","authenticated-orcid":false,"given":"Hung-Chang","family":"Chang","sequence":"first","affiliation":[{"name":"Bachelor Program in Interdisciplinary Studies Yunlin University of Science and Technology Yunlin, Doulu, Taiwan"}]}],"member":"219","published-online":{"date-parts":[[2023,7,21]]},"reference":[{"issue":"2","key":"S0218001423500155BIB001","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1250\/ast.11.71","volume":"11","author":"Abe M.","year":"1990","journal-title":"J. Acoust. Soc. Jpn."},{"key":"S0218001423500155BIB002","first-page":"1","volume-title":"Int. Conf. Advancements in Computational Sciences (ICACS)","author":"Ahmed I.","year":"2018"},{"issue":"6","key":"S0218001423500155BIB003","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1109\/MCOM.2018.1700928","volume":"56","author":"Akyildiz I. F.","year":"2018","journal-title":"IEEE Commun. Mag."},{"issue":"22","key":"S0218001423500155BIB004","first-page":"10","volume":"12","author":"Behringer K.","year":"2016","journal-title":"Achieving Sustainable Development \u2013 Theoretical Approach, Eur. Sci. J. ESJ"},{"issue":"9","key":"S0218001423500155BIB005","doi-asserted-by":"crossref","first-page":"1437","DOI":"10.1109\/5.628714","volume":"85","author":"Campbell J. P.","year":"1997","journal-title":"Proc. IEEE"},{"key":"S0218001423500155BIB006","first-page":"248","volume-title":"Proc. IEEE Int. Conf. Image Processing","author":"Conotter V.","year":"2014"},{"issue":"2","key":"S0218001423500155BIB007","doi-asserted-by":"crossref","first-page":"94","DOI":"10.7861\/futurehosp.6-2-94","volume":"6","author":"Davenport T.","year":"2019","journal-title":"Fut. Health"},{"key":"S0218001423500155BIB008","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1121\/1.1906946","volume":"24","author":"Davis K.","year":"1952","journal-title":"J. Acoust. Soc. Am."},{"key":"S0218001423500155BIB009","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1109\/TASSP.1980.1163420","volume":"28","author":"Davis S.","year":"1980","journal-title":"IEEE Trans. Acoustics Speech Signal Processing"},{"issue":"4","key":"S0218001423500155BIB010","doi-asserted-by":"crossref","first-page":"677","DOI":"10.1109\/TPAMI.2016.2599174","volume":"39","author":"Donahue J.","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"issue":"8","key":"S0218001423500155BIB012","first-page":"42","volume":"41","author":"Ghai W.","year":"2016","journal-title":"Int. J. Computer Appl."},{"key":"S0218001423500155BIB013","doi-asserted-by":"crossref","first-page":"41596","DOI":"10.1109\/ACCESS.2019.2905689","volume":"7","author":"Hasan H. R.","year":"2019","journal-title":"IEEE Access"},{"issue":"8","key":"S0218001423500155BIB014","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"Hochreiter S.","year":"1997","journal-title":"Neural Comput."},{"key":"S0218001423500155BIB015","doi-asserted-by":"crossref","first-page":"373","DOI":"10.1109\/ICASSP.1996.541110","volume-title":"1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conf. Proc.","volume":"1","author":"Hunt A. J.","year":"1996"},{"issue":"2","key":"S0218001423500155BIB019","first-page":"65","volume":"120","author":"Lek S.","year":"2008","journal-title":"Ecol. Modelling"},{"issue":"1","key":"S0218001423500155BIB020","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1631\/FITEE.1601885","volume":"18","author":"Li B. H.","year":"2017","journal-title":"Front. Inf. Technol. Electronic Eng."},{"issue":"4","key":"S0218001423500155BIB022","first-page":"298","volume":"6","author":"Manjula E.","year":"2017","journal-title":"Int. J. Comput. Intell. Informatics"},{"key":"S0218001423500155BIB023","first-page":"796","volume-title":"2018 2nd Int. Conf. Inventive Systems and Control (ICISC)","author":"Mishra S.","year":"2013"},{"key":"S0218001423500155BIB024","doi-asserted-by":"crossref","DOI":"10.1007\/b138251","volume-title":"Modelling Community Structure in Freshwater Ecosystems","author":"Lek S.","year":"2005"},{"issue":"2","key":"S0218001423500155BIB025","first-page":"784","volume":"7","author":"Patil P.","year":"2020","journal-title":"Int. Res. J. Eng. Technol."},{"key":"S0218001423500155BIB026","first-page":"1822","volume-title":"Proc. IEEE Conf. Computer Vision and Pattern Recognition Workshops","author":"Raghavendra R.","year":"2017"},{"issue":"1","key":"S0218001423500155BIB027","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1177\/0141076818815510","volume":"112","author":"Reddy S.","year":"2019","journal-title":"J. R. Soc. Med."},{"issue":"28","key":"S0218001423500155BIB029","first-page":"99","volume-title":"IFAC Proc.","volume":"46","author":"\u015echiopu D.","year":"2013"},{"issue":"4","key":"S0218001423500155BIB030","first-page":"86","volume":"3","author":"Sharma F. R.","year":"2012","journal-title":"Int. J. Comput. Commun. Control."},{"issue":"5","key":"S0218001423500155BIB031","doi-asserted-by":"crossref","first-page":"525","DOI":"10.1109\/89.784104","volume":"7","author":"Vergin R.","year":"1999","journal-title":"IEEE Trans. Speech Audio Processing"},{"issue":"3","key":"S0218001423500155BIB032","doi-asserted-by":"crossref","first-page":"438","DOI":"10.1109\/TIFS.2007.902661","volume":"2","author":"Wang W.","year":"2007","journal-title":"IEEE Trans. Inf. Forensics Security"},{"key":"S0218001423500155BIB033","doi-asserted-by":"crossref","first-page":"1018","DOI":"10.3390\/sym11081018","volume":"11","author":"Wang D.","year":"2019","journal-title":"Symmetry"},{"issue":"11","key":"S0218001423500155BIB034","doi-asserted-by":"crossref","first-page":"1870","DOI":"10.1109\/29.103088","volume":"38","author":"Wilpon J.","year":"1990","journal-title":"IEEE Trans. Acoustics Speech Signal Processing"},{"key":"S0218001423500155BIB036","doi-asserted-by":"crossref","first-page":"6141","DOI":"10.1109\/ICASSP.2019.8683445","volume-title":"ICASSP 2019-2019 IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP)","author":"Zeinali H.","year":"2019"},{"issue":"6","key":"S0218001423500155BIB037","doi-asserted-by":"crossref","first-page":"582","DOI":"10.1007\/BF02943243","volume":"16","author":"Zheng F.","year":"2001","journal-title":"J. Computer Sci. Technol."},{"issue":"2","key":"S0218001423500155BIB038","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1111\/cgf.13382","volume":"37","author":"Zollh\u00f6fer M.","year":"2001","journal-title":"Computer Graphics Forum"},{"issue":"3","key":"S0218001423500155BIB042","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1561\/2000000039","volume":"7","author":"Deng L.","year":"2014","journal-title":"Foundations and Trendsff in Signal Processing"},{"key":"S0218001423500155BIB043","author":"Masuyama Y.","year":"2017","journal-title":"Clin. Orthop. Relat. Res."},{"issue":"9","key":"S0218001423500155BIB045","first-page":"283","volume":"47","author":"Bao Y. X.","year":"2020","journal-title":"Comp. Sci."},{"issue":"9","key":"S0218001423500155BIB046","first-page":"818","volume":"6","author":"Bai G. Z.","year":"2020","journal-title":"Information Security Research"},{"issue":"10","key":"S0218001423500155BIB051","first-page":"2985","volume":"41","author":"Chang Y.","year":"2021","journal-title":"Computer Applications"},{"issue":"8","key":"S0218001423500155BIB053","first-page":"52","volume":"31","author":"Yu H. Q.","year":"2019","journal-title":"Tech Law Review"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001423500155","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,16]],"date-time":"2023-08-16T04:27:15Z","timestamp":1692160035000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001423500155"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7]]},"references-count":37,"journal-issue":{"issue":"09","published-print":{"date-parts":[[2023,7]]}},"alternative-id":["10.1142\/S0218001423500155"],"URL":"https:\/\/doi.org\/10.1142\/s0218001423500155","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"value":"0218-0014","type":"print"},{"value":"1793-6381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7]]},"article-number":"2350015"}}