{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,11]],"date-time":"2026-06-11T02:02:11Z","timestamp":1781143331925,"version":"3.54.1"},"reference-count":136,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2024,9,4]],"date-time":"2024-09-04T00:00:00Z","timestamp":1725408000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100010418","name":"Defence Science and Technology Laboratory","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100010418","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Big Data"],"abstract":"<jats:p>Recent advancements in AI, especially deep learning, have contributed to a significant increase in the creation of new realistic-looking synthetic media (video, image, and audio) and manipulation of existing media, which has led to the creation of the new term \u201cdeepfake.\u201d Based on both the research literature and resources in English, this paper gives a comprehensive overview of deepfake, covering multiple important aspects of this emerging concept, including (1) different definitions, (2) commonly used performance metrics and standards, and (3) deepfake-related datasets. In addition, the paper also reports a meta-review of 15 selected deepfake-related survey papers published since 2020, focusing not only on the mentioned aspects but also on the analysis of key challenges and recommendations. We believe that this paper is the most comprehensive review of deepfake in terms of the aspects covered.<\/jats:p>","DOI":"10.3389\/fdata.2024.1400024","type":"journal-article","created":{"date-parts":[[2024,9,4]],"date-time":"2024-09-04T05:08:23Z","timestamp":1725426503000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":43,"title":["Deepfake: definitions, performance metrics and standards, datasets, and a meta-review"],"prefix":"10.3389","volume":"7","author":[{"given":"Enes","family":"Altuncu","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Virginia N. L.","family":"Franqueira","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Shujun","family":"Li","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1965","published-online":{"date-parts":[[2024,9,4]]},"reference":[{"key":"B1","first-page":"1","article-title":"\u201cMesoNet: a compact facial video forgery detection network,\u201d","volume-title":"Proceedings of the 2018 IEEE International Workshop on Information Forensics and Security","author":"Afchar","year":"2018"},{"key":"B2","unstructured":"AjderH.\n            PatriniG.\n            CavalliF.\n            CullenL.\n          The state of deepfakes: Landscape, threats, and impact2019"},{"key":"B3","doi-asserted-by":"publisher","first-page":"21090","DOI":"10.1109\/ACCESS.2017.2750918","article-title":"Audio-visual multimedia quality assessment: a comprehensive survey","volume":"5","author":"Akhtar","year":"2017","journal-title":"IEEE Access"},{"key":"B4","unstructured":"AlshammariH.\n            EI-SayedA.\n          AIRABIC: Arabic dataset for performance evaluation of ai detectors"},{"key":"B5","doi-asserted-by":"publisher","first-page":"864","DOI":"10.1109\/ICMLA58977.2023.00127","author":"Alshammari","year":""},{"key":"B6","unstructured":"BaZ.\n            WenQ.\n            ChengP.\n            WangY.\n            LinF.\n            LuL.\n          DEepfake CROss-lingual (DECRO) evaluation dataset"},{"key":"B7","doi-asserted-by":"publisher","first-page":"2033","DOI":"10.1145\/3543507.3583222","author":"Ba","year":""},{"key":"B8","doi-asserted-by":"publisher","first-page":"260","DOI":"10.3390\/fi15080260","article-title":"The power of generative AI: A review of requirements, models, input-output formats, evaluation metrics, and challenges","volume":"15","author":"Bandi","year":"2023","journal-title":"Fut. Internet"},{"key":"B9","unstructured":"BradyM.\n          Deepfakes: a new desinformation threat2020"},{"key":"B10","article-title":"AV-Deepfake1M: A large-scale LLM-driven audio-visual deepfake dataset","author":"Cai","year":"","journal-title":"arXiv:2311.15308"},{"key":"B11","unstructured":"CaiZ.\n            GhoshS.\n            AdatiaA. P.\n            HayatM.\n            DhallA.\n            StefanovK.\n          AV-Deepfake1M: a large-scale LLM-driven audio-visual deepfake dataset"},{"key":"B12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3592116","article-title":"Attend-and-Excite: attention-based semantic guidance for text-to-image diffusion models","volume":"42","author":"Chefer","year":"2023","journal-title":"ACM Trans. Graph"},{"key":"B13","article-title":"X-IQE: eXplainable image quality evaluation for text-to-image generation with visual large language models","author":"Chen","year":"2023","journal-title":"arXiv:2305.10843"},{"key":"B14","article-title":"\u201cFakeCatcher: Detection of synthetic portrait videos using biological signals,\u201d","author":"Ciftci","year":"2020","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"B15","first-page":"5781","article-title":"\u201cOn the detection of digital face manipulation,\u201d","volume-title":"Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Dang","year":"2020"},{"key":"B16","author":"Delgado","year":"","journal-title":"ASVspoof 2021 challenge"},{"key":"B17","author":"Delgado","year":"","journal-title":"ASVspoof 2021 challenge"},{"key":"B18","doi-asserted-by":"crossref","first-page":"248","DOI":"10.1109\/CVPR.2009.5206848","article-title":"\u201cImageNet: a large-scale hierarchical image database,\u201d","volume-title":"Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition","author":"Deng","year":"2009"},{"key":"B19","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1007\/978-981-15-7421-4_27","article-title":"\u201cDeepfake detection approaches using deep learning: A systematic review,\u201d","volume-title":"Intelligent Computing and Networking: Proceedings of IC-ICN 2020, volume 146 of Lecture Notes in Networks and Systems","author":"Deshmukh","year":"2021"},{"key":"B20","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13635-020-00109-8","article-title":"Swapped face detection using deep learning and subjective assessment","volume":"2020","author":"Ding","year":"2020","journal-title":"EURASIP J. Inf. Secur"},{"key":"B21","article-title":"The DeepFake detection challenge dataset","author":"Dolhansky","year":"2020","journal-title":"arXiv preprint arXiv:2006.07397"},{"key":"B22","unstructured":"DufourN.\n            GullyA.\n          36700137Contributing data to deepfake detection research2019"},{"key":"B23","article-title":"Unmasking deepfakes with simple features","author":"Durall","year":"2019","journal-title":"arXiv:1911.00686"},{"key":"B24","doi-asserted-by":"publisher","first-page":"e0251415","DOI":"10.1371\/journal.pone.0251415","article-title":"TweepFake: about detecting deepfake tweets","volume":"16","author":"Fagni","year":"","journal-title":"PLoS ONE"},{"key":"B25","unstructured":"FagniT.\n            FalchiF.\n            GambiniM.\n            MartellaA.\n            TesconiM.\n          33984021TweepFake: about detecting deepfake tweets"},{"key":"B26","first-page":"1","article-title":"\u201cVideoforensicshq: detecting high-quality manipulated face videos,\u201d","volume-title":"Proceedings of the 2021 IEEE International Conference on Multimedia and Expo","author":"Fox","year":"2021"},{"key":"B27","first-page":"1","article-title":"\u201cWaveFake: a data set to facilitate audio deepfake detection,\u201d","author":"Frank","year":"","journal-title":"Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks"},{"key":"B28","unstructured":"FrankJ.\n            Sch\u00f6nherrL.\n          WaveFake: a data set to facilitate audio deepfake detection"},{"key":"B29","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2019-1541","author":"Gong","year":"","journal-title":"Proceedings of Interspeech 2019"},{"key":"B30","unstructured":"GongY.\n            YangJ.\n            HuberJ.\n            MacKnightM.\n            PoellabauerC.\n          ReMASC: realistic replay attack corpus for voice controlled systems"},{"key":"B31","article-title":"How close is ChatGPT to human experts? comparison corpus, evaluation, and detection","author":"Guo","year":"","journal-title":"arXiv:2301.07597"},{"key":"B32","unstructured":"GuoB.\n            ZhangX.\n            WangZ.\n            JiangM.\n            NieJ.\n            DingY.\n          Human ChatGPT Comparison Corpus (HC3)"},{"key":"B33","first-page":"3309","article-title":"\u201cToxiGen: a large-scale machine-generated dataset for adversarial and implicit hate speech detection,\u201d","volume-title":"Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics","author":"Hartvigsen","year":"2022"},{"key":"B34","first-page":"4360","article-title":"\u201cForgeryNet: a versatile benchmark for comprehensive forgery analysis,\u201d","volume-title":"Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"He","year":"2021"},{"key":"B35","doi-asserted-by":"publisher","first-page":"e1520","DOI":"10.1002\/widm.1520","article-title":"Deepfake detection using deep learning methods: a systematic and comprehensive review","volume":"45","author":"Heidari","year":"2023","journal-title":"WIREs Data Mining Knowl. Discov"},{"key":"B36","doi-asserted-by":"crossref","first-page":"7514","DOI":"10.18653\/v1\/2021.emnlp-main.595","article-title":"\u201cCLIPScore: a reference-free evaluation metric for image captioning,\u201d","volume-title":"Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing","author":"Hessel","year":"2021"},{"key":"B37","first-page":"78723","article-title":"\u201cT2I-CompBench: a comprehensive benchmark for open-world compositional text-to-image generation,\u201d","volume-title":"Proceedings of the 37th Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS '23)","author":"Huang","year":"2023"},{"key":"B38","unstructured":"JiaS.\n            LiX.\n            LyuS.\n          DFDM: Deepfakes from different models"},{"key":"B39","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP46576.2022.9897972","article-title":"Model attribution of face-swap deepfake videos","author":"Jia","year":"","journal-title":"arXiv:2202.12951"},{"key":"B40","first-page":"2886","article-title":"\u201cDeeperForensics-1.0: a large-scale dataset for real-world face forgery detection,\u201d","volume-title":"Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Jiang","year":"2020"},{"key":"B41","article-title":"Efficient neural audio synthesis","author":"Kalchbrenner","year":"2018","journal-title":"arXiv:1802.08435"},{"key":"B42","first-page":"4401","article-title":"\u201cA style-based generator architecture for generative adversarial networks,\u201d","volume-title":"Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Karras","year":"2019"},{"key":"B43","doi-asserted-by":"publisher","first-page":"1001063","DOI":"10.3389\/fdata.2022.1001063","article-title":"Audio deepfakes: a survey","volume":"5","author":"Khanjani","year":"2023","journal-title":"Front. Big Data"},{"key":"B44","first-page":"1","article-title":"\u201cFake face detection methods: can they be generalized?\u201d","volume-title":"Proceedings of the 2018 International Conference of the Biometrics Special Interest Group","author":"Khodabakhsh","year":"2018"},{"key":"B45","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3355089.3356500","article-title":"Neural style-preserving visual dubbing","volume":"38","author":"Kim","year":"2019","journal-title":"ACM Trans. Graph"},{"key":"B46","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3197517.3201283","article-title":"Deep video portraits","volume":"37","author":"Kim","year":"2018","journal-title":"ACM Trans. Graph"},{"key":"B47","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/ICB45273.2019.8987375","article-title":"\u201cVulnerability assessment and detection of deepfake videos,\u201d","volume-title":"Proceedings of the 2019 International Conference on Biometrics","author":"Korshunov","year":"2019"},{"key":"B48","first-page":"10724","article-title":"\u201cKoDF: A large-scale korean DeepFake detection dataset,\u201d","volume-title":"Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision","author":"Kwon","year":"2021"},{"key":"B49","first-page":"21330","article-title":"\u201cBigDatasetGAN: Synthesizing imagenet with pixel-wise annotations,\u201d","volume-title":"Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Li","year":"2022"},{"key":"B50","first-page":"7","article-title":"\u201cFMFCC-V: an Asian large-scale challenging dataset for deepfake detection,\u201d","volume-title":"Proceedings of the 2022 ACM Workshop on Information Hiding and Multimedia Security","author":"Li","year":""},{"key":"B51","unstructured":"LiG.\n            ZhaoX.\n            CaoY.\n            PeiP.\n            LiJ.\n            ZhangZ.\n          FMFCC-V: an Asian large-scale challenging dataset for deepfake detection"},{"key":"B52","first-page":"12888","article-title":"\u201cBLIP: bootstrapping language-image pre-training for unified vision-language understanding and generation,\u201d","volume-title":"Proceedings of the 39th International Conference on Machine Learning","author":"Li","year":"2022"},{"key":"B53","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR42600.2020.00512","article-title":"\u201cAdvancing high fidelity identity swapping for forgery detection,\u201d","author":"Li","year":"2020","journal-title":"Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition"},{"key":"B54","first-page":"1","article-title":"\u201cIn ICTU OCULI: exposing AI created fake videos by detecting eye blinking,\u201d","volume-title":"Proceedings of the 2018 IEEE International Workshop on Information Forensics and Security","author":"Li","year":"2018"},{"key":"B55","article-title":"MAGE: machine-generated text detection in the wild","author":"Li","year":"2024","journal-title":"arXiv:2305.13242"},{"key":"B56","first-page":"3204","article-title":"\u201cCeleb-DF: a large-scale challenging dataset for deepfake forensics,\u201d","volume-title":"Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Li","year":"2020"},{"key":"B57","doi-asserted-by":"publisher","first-page":"2507","DOI":"10.1109\/TASLP.2023.3285283","article-title":"ASVspoof 2021: towards spoofed and deepfake speech detection in the wild","volume":"31","author":"Liu","year":"2023","journal-title":"IEEE\/ACM Trans. Audio, Speech Lang. Proc"},{"key":"B58","doi-asserted-by":"publisher","first-page":"e0196391","DOI":"10.1371\/journal.pone.0196391","article-title":"The ryerson audio-visual database of emotional speech and song (RAVDESS): a dynamic, multimodal set of facial and vocal expressions in north american english","volume":"13","author":"Livingstone","year":"2018","journal-title":"PLoS ONE"},{"key":"B59","article-title":"A benchmark corpus for the detection of automatically generated text in academic publications","author":"Liyanage","year":"","journal-title":"arXiv:2202.02013"},{"key":"B60","unstructured":"LiyanageV.\n            BuscaldiD.\n            NazarenkoA.\n          GeneratedTextDetection"},{"key":"B61","first-page":"195","article-title":"\u201cThe voice conversion challenge 2018: Promoting development of parallel and nonparallel methods,\u201d","volume-title":"Proceedings of the Odyssey 2018 The Speaker and Language Recognition Workshop","author":"Lorenzo-Trueba","year":"2018"},{"key":"B62","doi-asserted-by":"crossref","DOI":"10.1109\/ICMEW46912.2020.9105991","article-title":"\u201cDeepfake detection: Current challenges and next steps,\u201d","volume-title":"Proceedings of the 2020 IEEE International Conference on Multimedia Expo Workshops","author":"Lyu","year":"2020"},{"key":"B63","doi-asserted-by":"publisher","DOI":"10.2139\/ssrn.4748856","article-title":"CFAD: a Chinese dataset for fake audio detection","author":"Ma","year":"","journal-title":"arXiv:2207.12308"},{"key":"B64","unstructured":"MaH.\n            YiJ.\n            WangC.\n            YanX.\n            TaoJ.\n            WangT.\n          CFAD: a Chinese dataset for fake audio detection"},{"key":"B65","doi-asserted-by":"publisher","first-page":"3974","DOI":"10.1007\/s10489-022-03766-z","article-title":"Deepfakes generation and detection: State-of-the-art, open challenges, countermeasures, and way forward","volume":"53","author":"Masood","year":"2023","journal-title":"Appl. Intell"},{"key":"B66","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3425780","article-title":"The creation and detection of deepfakes: a survey","volume":"54","author":"Mirsky","year":"2021","journal-title":"ACM Comput. Surv"},{"key":"B67","doi-asserted-by":"crossref","first-page":"190","DOI":"10.18653\/v1\/2023.trustnlp-1.17","article-title":"\u201cDistinguishing fact from fiction: a benchmark dataset for identifying machine-generated scientific papers in the LLM era,\u201d","volume-title":"Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023)","author":"Mosca","year":"2023"},{"key":"B68","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2022-108","article-title":"\u201cDoes audio deepfake detection generalize?,\u201d","author":"M\u00fcller","year":"2022","journal-title":"Proceedings of Interspeech 2022"},{"key":"B69","article-title":"MLAAD: The multi-language audio anti-spoofing dataset","author":"M\u00fcller","year":"2024","journal-title":"arXiv:2401.09512"},{"key":"B70","doi-asserted-by":"publisher","first-page":"1006","DOI":"10.1109\/LSP.2014.2379648","article-title":"Can we automatically transform speech recorded on common consumer devices in real-world environments into professional production quality speech?\u2013a dataset, insights, and challenges","volume":"22","author":"Mysore","year":"2015","journal-title":"IEEE Signal Proc. Lett"},{"key":"B71","unstructured":"NarayanK.\n            AgarwalH.\n            ThakralK.\n            MittalS.\n            VatsaM.\n            SinghR.\n          Df-Platter database"},{"key":"B72","doi-asserted-by":"publisher","first-page":"9739","DOI":"10.1109\/CVPR52729.2023.00939","author":"Narayan","year":""},{"key":"B73","doi-asserted-by":"publisher","first-page":"1038","DOI":"10.1109\/JSTSP.2020.3007250","article-title":"GANprintR: improved fakes and evaluation of the state of the art in face manipulation detection","volume":"14","author":"Neves","year":"2020","journal-title":"IEEE J. Select. Topics Signal Proc"},{"key":"B74","doi-asserted-by":"publisher","first-page":"103525","DOI":"10.1016\/j.cviu.2022.103525","article-title":"Deep learning for deepfakes creation and detection: a survey","volume":"223","year":"2022","journal-title":"Comput. Vis. Image Understand"},{"key":"B75","first-page":"1","article-title":"\u201cExpanding language-image pretrained models for general video recognition,\u201d","volume-title":"Proceedings of the 17th European Conference on Computer Vision (ECCV '22)","author":"Ni","year":"2022"},{"key":"B76","first-page":"7183","article-title":"\u201cFSGAN: subject agnostic face swapping and reenactment,\u201d","volume-title":"Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision","author":"Nirkin","year":"2019"},{"key":"B77","unstructured":"GPT-2-output-dataset: dataset of GPT-2 outputs for research in detection, biases, and more2019"},{"key":"B78","doi-asserted-by":"publisher","first-page":"1391724","DOI":"10.1155\/2018\/1391724","article-title":"A survey of standardized approaches towards the quality of experience evaluation for video services: an ITU perspective","volume":"2018","author":"Pal","year":"2018","journal-title":"Int. J. Dig. Multimedia Broadcast"},{"key":"B79","first-page":"981","article-title":"\u201cDeepfake videos in the wild: analysis and detection,\u201d","volume-title":"Proceedings of the Web Conference 2021","author":"Pu","year":""},{"key":"B80","unstructured":"PuJ.\n            MangaokarN.\n            KellyL.\n            BhattacharyaP.\n            SundaramK.\n            JavedM.\n          DF-W: a new deepfake dataset comprising of deepfake videos created and shared by the internet community"},{"key":"B81","doi-asserted-by":"crossref","first-page":"1613","DOI":"10.1109\/SP46215.2023.10179387","article-title":"\u201cDeepfake text detection: Limitations and opportunities,\u201d","volume-title":"Proceedings of the 2023 IEEE Symposium on Security and Privacy (SP)","author":"Pu","year":"2023"},{"key":"B82","first-page":"8748","article-title":"\u201cLearning transferable visual models from natural language supervision,\u201d","volume-title":"Proceedings of the 38th International Conference on Machine Learning","author":"Radford","year":"2021"},{"key":"B83","doi-asserted-by":"publisher","first-page":"25494","DOI":"10.1109\/ACCESS.2022.3154404","article-title":"Deepfake detection: a systematic literature review","volume":"10","author":"Rana","year":"2022","journal-title":"IEEE Access"},{"key":"B84","article-title":"FaceForensics: a large-scale video dataset for forgery detection in human faces","author":"R\u00f6ssler","year":"2018","journal-title":"arXiv preprint arXiv:1803.09179"},{"key":"B85","first-page":"1","article-title":"\u201cFaceForensics++: learning to detect manipulated facial images,\u201d","volume-title":"Proceedings of the 2019 International Conference on Computer Vision","author":"R\u00f6ssler","year":"2019"},{"key":"B86","first-page":"252","article-title":"\u201cDEX: deep expectation of apparent age from a single image,\u201d","volume-title":"Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop","author":"Rothe","year":"2015"},{"key":"B87","doi-asserted-by":"publisher","first-page":"3859","DOI":"10.1007\/s00521-023-09288-0","article-title":"A comprehensive evaluation of feature-based AI techniques for deepfake detection","volume":"36","author":"Sandotra","year":"2024","journal-title":"Neural Comput. Applic"},{"key":"B88","unstructured":"The state of deepfakes 20242024"},{"key":"B89","doi-asserted-by":"publisher","first-page":"351","DOI":"10.1016\/j.neucom.2022.09.135","article-title":"A comprehensive overview of Deepfake: generation, detection, datasets, and opportunities","volume":"513","author":"Seow","year":"2022","journal-title":"Neurocomputing"},{"key":"B90","unstructured":"SongH.\n            HuangS.\n            DongY.\n            TuW.-W.\n          DeepFakeFace"},{"key":"B91","article-title":"Robustness and generalizability of deepfake detection: a study with diffusion models","author":"Song","year":"","journal-title":"arXiv:2309.02218"},{"key":"B92","unstructured":"SuZ.\n            LiM.\n            ZhangG.\n            WuQ.\n            LiM.\n            ZhangW.\n          CMFD"},{"key":"B93","doi-asserted-by":"publisher","first-page":"4016","DOI":"10.1109\/TDSC.2022.3215280","article-title":"Robust audio copy-move forgery detection using constant q spectral sketches and GA-SVM","volume":"20","author":"Su","year":"","journal-title":"IEEE Trans. Depend. Secure Comput"},{"key":"B94","article-title":"HC3 Plus: a semantic-invariant human ChatGPT comparison corpus","author":"Su","year":"2024","journal-title":"arXiv:2309.02731"},{"key":"B95","article-title":"WaveCycleGAN2: time-domain neural post-filter for speech waveform generation","author":"Tanaka","year":"2019","journal-title":"arXiv:1904.02892"},{"key":"B96","first-page":"1151","article-title":"\u201cLooking for traces of textual deepfakes in Bulgarian on social media,\u201d","volume-title":"Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing","author":"Temnikova","year":"2023"},{"key":"B97","unstructured":"Artificial intelligence white paper2020"},{"key":"B98","doi-asserted-by":"crossref","first-page":"1632","DOI":"10.21437\/Interspeech.2016-1066","article-title":"\u201cThe voice conversion challenge 2016,\u201d","volume-title":"Proceedings of Interspeech 2016","author":"Toda","year":"2016"},{"key":"B99","doi-asserted-by":"publisher","first-page":"131","DOI":"10.1016\/j.inffus.2020.06.014","article-title":"Deepfakes and beyond: a survey of face manipulation and fake detection","volume":"64","author":"Tolosana","year":"2020","journal-title":"Inf. Fusion"},{"key":"B100","first-page":"265","article-title":"\u201cAn overview of deepfake: the sword of Damocles in AI,\u201d","volume-title":"Proceedings of the 2020 International Conference on Computer Vision, Image and Deep Learning","author":"Tong","year":"2020"},{"key":"B101","doi-asserted-by":"crossref","first-page":"2001","DOI":"10.18653\/v1\/2021.findings-emnlp.172","article-title":"\u201cTURINGBENCH: a benchmark environment for Turing test in the age of neural text generation,\u201d","volume-title":"Findings of the Association for Computational Linguistics: EMNLP 2021","author":"Uchendu","year":"2021"},{"key":"B102","article-title":"WaveNet: a generative model for raw audio","author":"Van Den Oord","year":"2016","journal-title":"arXiv:1609.03499"},{"key":"B103","doi-asserted-by":"publisher","first-page":"910","DOI":"10.1109\/JSTSP.2020.3002101","article-title":"Media forensics and deepfakes: an overview","volume":"14","author":"Verdoliva","year":"2020","journal-title":"IEEE J. Selected Topics Signal Proc"},{"key":"B104","doi-asserted-by":"publisher","first-page":"101114","DOI":"10.1016\/j.csl.2020.101114","article-title":"ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech","volume":"64","author":"Wang","year":"2020","journal-title":"Comput. Speech Lang"},{"key":"B105","first-page":"22388","article-title":"\u201cDIRE for diffusion-generated image detection,\u201d","volume-title":"Proceedings of the 2023 IEEE\/CVF International Conference on Computer Vision","author":"Wang","year":""},{"key":"B106","unstructured":"WangZ.\n            BaoJ.\n            ZhouW.\n            WangW.\n            HuH.\n            ChenH.\n          DIRE for diffusion-generated image detection"},{"key":"B107","first-page":"24824","article-title":"\u201cChain-of-Thought prompting elicits reasoning in large language models,\u201d","volume-title":"Proceedings of the 36th Neural Information Processing Systems (NeurIPS '22)","author":"Wei","year":"2022"},{"key":"B108","article-title":"Towards a better metric for text-to-video generation","author":"Wu","year":"2024","journal-title":"arXiv:2401.07781"},{"key":"B109","first-page":"54683","article-title":"\u201cDatasetDM: synthesizing data with perception annotations using diffusion models,\u201d","volume-title":"Proceedings of the 37th International Conference on Neural Information Processing Systems","author":"Wu","year":"2023"},{"key":"B110","unstructured":"Homologous deepfake dataset: A self built small-scale, high-quality, and diverse deepfake dataset2024"},{"key":"B111","unstructured":"XieY.\n            ZhouJ.\n            LuX.\n            JiangZ.\n            YangY.\n            ChengH."},{"key":"B112","unstructured":"XieY.\n            ZhouJ.\n            LuX.\n            JiangZ.\n            YangY.\n            ChengH.\n          FSD: an initial chinese dataset for fake song detection"},{"key":"B113","first-page":"6639","article-title":"\u201cDiverse and aligned audio-to-video generation via text-to-video model adaptation,\u201d","volume-title":"Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI '24)","author":"Yariv","year":"2024"},{"key":"B114","doi-asserted-by":"crossref","first-page":"1654","DOI":"10.21437\/Interspeech.2021-930","article-title":"\u201cHalf-Truth: a partially fake audio detection dataset,\u201d","author":"Yi","year":"2021","journal-title":"Proceedings of Interspeech 2021"},{"key":"B115","first-page":"9216","article-title":"\u201cADD 2022: the first audio deep synthesis detection challenge,\u201d","volume-title":"Proceedings of the 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","author":"Yi","year":""},{"key":"B116","first-page":"125","article-title":"\u201cADD 2023: the second audio deepfake detection challenge,\u201d","volume-title":"Proceedings of the Workshop on Deepfake Audio Detection and Analysis","author":"Yi","year":"2023"},{"key":"B117","unstructured":"YiJ.\n            WangC.\n            TaoJ.\n            TianZ.\n            FanC.\n            MaH.\n          SceneFake: an initial dataset and benchmarks for scene fake audio detection"},{"key":"B118","doi-asserted-by":"publisher","first-page":"110468","DOI":"10.1016\/j.patcog.2024.110468","article-title":"SceneFake: an initial dataset and benchmarks for scene fake audio detection","volume":"152","author":"Yi","year":"2024","journal-title":"Patt. Recogn"},{"key":"B119","doi-asserted-by":"crossref","first-page":"80","DOI":"10.21437\/VCCBC.2020-14","article-title":"\u201cVoice conversion challenge 2020-intra-lingual semi-parallel and cross-lingual voice conversion.,\u201d","volume-title":"Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020","author":"Yi","year":"2020"},{"key":"B120","first-page":"115","article-title":"\u201cAbbreviated view of deepfake videos detection techniques,\u201d","volume-title":"Proceedings of the 2020 6th International Engineering Conference","author":"Younus","year":"2020"},{"key":"B121","unstructured":"YuP.\n            ChenJ.\n            FengX.\n            XiaZ.\n          CHEAT"},{"key":"B122","article-title":"CHEAT: A large-scale dataset for detecting ChatGPT-written abstracts","author":"Yu","year":"","journal-title":"arXiv:2304.12008"},{"key":"B123","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s11432-019-2757-1","article-title":"Perceptual image quality assessment: a survey","volume":"63","author":"Zhai","year":"2020","journal-title":"Sci. China Inf. Sci"},{"key":"B124","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2021-738","article-title":"\u201cAn initial investigation for detecting partially spoofed audio,\u201d","author":"Zhang","year":"","journal-title":"Proceedings of Interspeech 2021"},{"key":"B125","unstructured":"ZhangL.\n            WangX.\n            CooperE.\n            YamagishiJ.\n            PatinoJ.\n            EvansN.\n          PartialSpoof"},{"key":"B126","first-page":"67","article-title":"\u201cDeep learning in face synthesis: a survey on deepfakes,\u201d","volume-title":"Proceedings of the 2020 IEEE 3rd International Conference on Computer and Communication Engineering Technology","author":"Zhang","year":"2020"},{"key":"B127","first-page":"10140","article-title":"\u201cDatasetGAN: Efficient labeled data factory with minimal human effort,\u201d","volume-title":"Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhang","year":"2021"},{"key":"B128","first-page":"70","article-title":"\u201cCelebA-Spoof: large-scale face anti-spoofing dataset with rich annotations,\u201d","volume-title":"Proceedings of the 2020 European Conference on Computer Vision","author":"Zhang","year":"2020"},{"key":"B129","doi-asserted-by":"publisher","first-page":"338","DOI":"10.1080\/15230406.2021.1910075","article-title":"Deep fake geography? When geospatial data encounter artificial intelligence","volume":"48","author":"Zhao","year":"2021","journal-title":"Cartogr. Geogr. Inf. Sci"},{"key":"B130","article-title":"EmoFake: an initial dataset for emotion fake audio detection","author":"Zhao","year":"","journal-title":"arXiv:2211.05363"},{"key":"B131","unstructured":"ZhaoY.\n            YiJ.\n            TaoJ.\n            WangC.\n            ZhangX.\n            DongY.\n          EmoFake: an initial dataset for emotion fake audio detection"},{"key":"B132","first-page":"1831","article-title":"\u201cTwo-stream neural networks for tampered face detection,\u201d","volume-title":"Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops","author":"Zhou","year":"2017"},{"key":"B133","first-page":"5774","article-title":"\u201cFace forensics in the wild,\u201d","volume-title":"Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Zhou","year":"2021"},{"key":"B134","article-title":"\u201cMiniGPT-4: enhancing vision-language understanding with advanced large language models,\u201d","author":"Zhu","year":"2024","journal-title":"Proceedings of the 12th International Conference on Learning Representations"},{"key":"B135","first-page":"2242","article-title":"\u201cUnpaired image-to-image translation using cycle-consistent adversarial networks,\u201d","volume-title":"Proceedings of the 2017 IEEE International Conference on Computer Vision","author":"Zhu","year":"2017"},{"key":"B136","first-page":"2382","article-title":"\u201cWildDeepfake: a challenging real-world dataset for deepfake detection,\u201d","volume-title":"Proceedings of the 2020 28th ACM International Conference on Multimedia","author":"Zi","year":"2020"}],"container-title":["Frontiers in Big Data"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fdata.2024.1400024\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,4]],"date-time":"2024-09-04T05:09:10Z","timestamp":1725426550000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fdata.2024.1400024\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,9,4]]},"references-count":136,"alternative-id":["10.3389\/fdata.2024.1400024"],"URL":"https:\/\/doi.org\/10.3389\/fdata.2024.1400024","relation":{},"ISSN":["2624-909X"],"issn-type":[{"value":"2624-909X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,9,4]]},"article-number":"1400024"}}