{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,16]],"date-time":"2026-06-16T05:26:05Z","timestamp":1781587565246,"version":"3.54.5"},"reference-count":58,"publisher":"Springer Science and Business Media LLC","issue":"33","license":[{"start":{"date-parts":[[2025,3,25]],"date-time":"2025-03-25T00:00:00Z","timestamp":1742860800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,3,25]],"date-time":"2025-03-25T00:00:00Z","timestamp":1742860800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100018693","name":"HORIZON EUROPE Framework Programme","doi-asserted-by":"publisher","award":["101073928"],"award-info":[{"award-number":["101073928"]}],"id":[{"id":"10.13039\/100018693","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Multimed Tools Appl"],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>The proliferation of deepfake technology poses significant challenges due to its potential for misuse in creating highly convincing manipulated videos. Deep learning (DL) techniques have emerged as powerful tools for analyzing and identifying subtle inconsistencies that distinguish genuine content from deepfakes. This paper introduces a novel approach for video deepfake detection that integrates 3D Morphable Models (3DMMs) with a hybrid CNN-LSTM-Transformer model, aimed at enhancing detection accuracy and efficiency. Our model leverages 3DMMs for detailed facial feature extraction, a CNN for fine-grained spatial analysis, an LSTM for short-term temporal dynamics, and a Transformer for capturing long-term dependencies in sequential data. This architecture effectively addresses critical challenges in current detection systems by handling both local and global temporal information. The proposed model employs an identity verification approach, comparing test videos with reference videos containing genuine footage of the individuals. Trained and validated on the VoxCeleb2 dataset, with further testing on three additional datasets, our model demonstrates superior performance to existing state-of-the-art methods, maintaining robustness across different video qualities, compression levels and manipulation types. Additionally, it operates efficiently in time-sensitive scenarios, significantly outperforming existing methods in inference speed. By relying solely on pristine, unmanipulated data for training, our approach enhances adaptability to new and sophisticated manipulations, setting a new benchmark for video deepfake detection technologies. This study not only advances the framework for detecting deepfakes but also underscores its potential for practical deployment in areas critical for digital forensics and media integrity.<\/jats:p>","DOI":"10.1007\/s11042-024-20548-6","type":"journal-article","created":{"date-parts":[[2025,3,27]],"date-time":"2025-03-27T19:22:51Z","timestamp":1743103371000},"page":"40617-40636","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":40,"title":["Video deepfake detection using a hybrid CNN-LSTM-Transformer model for identity verification"],"prefix":"10.1007","volume":"84","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3371-569X","authenticated-orcid":false,"given":"Georgios","family":"Petmezas","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Vazgken","family":"Vanian","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Konstantinos","family":"Konstantoudakis","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Elena E. I.","family":"Almaloglou","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dimitris","family":"Zarpalas","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,3,25]]},"reference":[{"key":"20548_CR1","doi-asserted-by":"publisher","first-page":"910","DOI":"10.1109\/JSTSP.2020.3002101","volume":"14","author":"L Verdoliva","year":"2020","unstructured":"Verdoliva L (2020) Media Forensics and DeepFakes: An Overview. IEEE J Select Topics Signal Process 14:910\u2013932","journal-title":"IEEE J Select Topics Signal Process"},{"issue":"6","key":"20548_CR2","doi-asserted-by":"publisher","first-page":"607","DOI":"10.1049\/bme2.12031","volume":"10","author":"P Yu","year":"2021","unstructured":"Yu P, Xia Z, Fei J, Lu Y (2021) A survey on deepfake video detection. IET Biometrics 10(6):607\u2013624. https:\/\/doi.org\/10.1049\/bme2.12031","journal-title":"IET Biometrics"},{"key":"20548_CR3","doi-asserted-by":"publisher","first-page":"113368","DOI":"10.1016\/j.jbusres.2022.113368","volume":"154","author":"M Mustak","year":"2023","unstructured":"Mustak M, Salminen J, M\u00e4ntym\u00e4ki M, Rahman A, Dwivedi YK (2023) Deepfakes: Deceptions, mitigations, and opportunities. J Bus Res 154:113368. https:\/\/doi.org\/10.1016\/j.jbusres.2022.113368","journal-title":"J Bus Res"},{"key":"20548_CR4","doi-asserted-by":"publisher","first-page":"103525","DOI":"10.1016\/j.cviu.2022.103525","volume":"223","author":"TT Nguyen","year":"2022","unstructured":"Nguyen TT, Nguyen QVH, Nguyen DT, Nguyen DT, Huynh-The T, Nahavandi S, Nguy\u00ean TT, Pham Q, Nguyen CM (2022) Deep learning for deepfakes creation and detection: A survey. Comput Vis Image Underst 223:103525. https:\/\/doi.org\/10.1016\/j.cviu.2022.103525","journal-title":"Comput Vis Image Underst"},{"key":"20548_CR5","doi-asserted-by":"publisher","first-page":"25494","DOI":"10.1109\/access.2022.3154404","volume":"10","author":"MS Rana","year":"2022","unstructured":"Rana MS, Nobi MN, Murali B, Sung AH (2022) Deepfake Detection: A Systematic Literature Review. IEEE Access 10:25494\u201325513. https:\/\/doi.org\/10.1109\/access.2022.3154404","journal-title":"IEEE Access"},{"key":"20548_CR6","doi-asserted-by":"publisher","unstructured":"Srivastava, A., Pandey, M. K., & Sahu, S. K. (2022). A review on deepfakes detection using machine learning techniques. Lect Notes Elect Eng, 641\u2013651. https:\/\/doi.org\/10.1007\/978-981-19-5037-7_46","DOI":"10.1007\/978-981-19-5037-7_46"},{"key":"20548_CR7","doi-asserted-by":"publisher","first-page":"18757","DOI":"10.1109\/access.2022.3151186","volume":"10","author":"A Malik","year":"2022","unstructured":"Malik A, Kuribayashi M, Abdullahi SM, Khan AN (2022) DeepFake detection for human face images and Videos: a survey. IEEE Access 10:18757\u201318775. https:\/\/doi.org\/10.1109\/access.2022.3151186","journal-title":"IEEE Access"},{"issue":"10","key":"20548_CR8","doi-asserted-by":"publisher","first-page":"216","DOI":"10.3390\/computers12100216","volume":"12","author":"A Naitali","year":"2023","unstructured":"Naitali A, Ridouani M, Salahdine F, Kaabouch N (2023) Deepfake Attacks: generation, detection, datasets, challenges, and research directions. Computers 12(10):216. https:\/\/doi.org\/10.3390\/computers12100216","journal-title":"Computers"},{"issue":"05","key":"20548_CR9","doi-asserted-by":"publisher","first-page":"20","DOI":"10.4236\/jcc.2021.95003","volume":"09","author":"AM Almars","year":"2021","unstructured":"Almars AM (2021) DeepFakes Detection Techniques Using Deep Learning: A survey. J Comput Commun 09(05):20\u201335. https:\/\/doi.org\/10.4236\/jcc.2021.95003","journal-title":"J Comput Commun"},{"key":"20548_CR10","doi-asserted-by":"publisher","unstructured":"Heidari A, Navimipour NJ, Da\u011f H, & \u00dcnal M (2023) Deepfake detection using deep learning methods: A systematic and comprehensive review. Wiley Interdisciplinary Reviews. Data Mining and Knowledge Discovery\/Wiley Interdisciplinary Reviews. Data Min Knowl Dis, 14(2). https:\/\/doi.org\/10.1002\/widm.1520","DOI":"10.1002\/widm.1520"},{"key":"20548_CR11","unstructured":"Li Y, Lyu S (2018) Exposing deepfake videos by detecting face warping artifacts. arXiv (Cornell University), pp 46\u201352. https:\/\/arxiv.org\/pdf\/1811.00656.pdf"},{"key":"20548_CR12","doi-asserted-by":"publisher","unstructured":"G\u00fcera D, & Delp EJ (2018) Deepfake video detection using recurrent neural networks. 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS). https:\/\/doi.org\/10.1109\/avss.2018.8639163","DOI":"10.1109\/avss.2018.8639163"},{"key":"20548_CR13","first-page":"1","volume":"2018","author":"D Afchar","year":"2018","unstructured":"Afchar D, Nozick V, Yamagishi J, Echizen I (2018) MesoNet: a Compact Facial Video Forgery Detection Network. IEEE Int Workshop Inform Forensic Secur (WIFS) 2018:1\u20137","journal-title":"IEEE Int Workshop Inform Forensic Secur (WIFS)"},{"key":"20548_CR14","doi-asserted-by":"publisher","unstructured":"Agarwal S, Farid H, El-Gaaly T, & Lim SN (2020) Detecting deep-fake videos from appearance and behavior. 2020 IEEE Int Workshop Inform Forensic Secur (WIFS). https:\/\/doi.org\/10.1109\/wifs49906.2020.9360904","DOI":"10.1109\/wifs49906.2020.9360904"},{"key":"20548_CR15","doi-asserted-by":"publisher","unstructured":"Wang Y, & Dantcheva A (2020) A video is worth more than 1000 lies. comparing 3DCNN approaches for detecting deepfakes. 2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020). https:\/\/doi.org\/10.1109\/fg47880.2020.00089","DOI":"10.1109\/fg47880.2020.00089"},{"key":"20548_CR16","doi-asserted-by":"publisher","unstructured":"Thies J, Zollhofer M, Stamminger M, Theobalt C, & Niessner M (2016) Face2Face: Real-time face capture and reenactment of RGB videos. 2016 IEEE Conf Comput Vis Patt Recog (CVPR). https:\/\/doi.org\/10.1109\/cvpr.2016.262","DOI":"10.1109\/cvpr.2016.262"},{"key":"20548_CR17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3306346.3323035","volume":"38","author":"J Thies","year":"2019","unstructured":"Thies J, Zollh\u00f6fer M, Nie\u00dfner M (2019) Deferred neural rendering. ACM Transact Graph (TOG) 38:1\u201312","journal-title":"ACM Transact Graph (TOG)"},{"key":"20548_CR18","unstructured":"Wodajo D, Atnafu S (2021) Deepfake Video detection using convolutional vision transformer. ArXiv, abs\/2102.11126."},{"key":"20548_CR19","unstructured":"Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S, Uszkoreit J, Houlsby N (2020) An image is worth 16x16 words: transformers for image recognition at scale. ArXiv, abs\/2010.11929"},{"key":"20548_CR20","doi-asserted-by":"publisher","unstructured":"Mo H, Chen B, & Luo W (2018) Fake faces identification via Convolutional Neural Network. Proceed 6th ACM Workshop Inform Hiding Multimedia Secur. https:\/\/doi.org\/10.1145\/3206004.3206009","DOI":"10.1145\/3206004.3206009"},{"key":"20548_CR21","doi-asserted-by":"publisher","unstructured":"de Rezende ERS, Ruppert GCS, & Carvalho T (2017) Detecting computer generated images with deep convolutional neural networks. 2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI). https:\/\/doi.org\/10.1109\/sibgrapi.2017.16","DOI":"10.1109\/sibgrapi.2017.16"},{"key":"20548_CR22","doi-asserted-by":"publisher","unstructured":"\u015eeng\u00fcr A, Akhtar Z, Akbulut Y, Ekici S, & Budak U (2018) Deep feature extraction for face liveness detection. 2018 Int Conf Artif Intell Data Process (IDAP). https:\/\/doi.org\/10.1109\/idap.2018.8620804","DOI":"10.1109\/idap.2018.8620804"},{"issue":"1","key":"20548_CR23","doi-asserted-by":"publisher","first-page":"370","DOI":"10.3390\/app10010370","volume":"10","author":"CC Hsu","year":"2020","unstructured":"Hsu CC, Zhuang YX, Lee CY (2020) Deep fake image detection based on pairwise learning. Appl Sci 10(1):370. https:\/\/doi.org\/10.3390\/app10010370","journal-title":"Appl Sci"},{"key":"20548_CR24","doi-asserted-by":"publisher","unstructured":"Dong X, Bao J, Chen D, Zhang T, Zhang W, Yu N, Chen D, Wen F, & Guo B (2022) Protecting Celebrities from DeepFake with Identity Consistency Transformer. 2022 IEEE\/CVF Conf Comput Vis Patt Recog (CVPR). https:\/\/doi.org\/10.1109\/cvpr52688.2022.00925","DOI":"10.1109\/cvpr52688.2022.00925"},{"issue":"8","key":"20548_CR25","doi-asserted-by":"publisher","first-page":"128","DOI":"10.3390\/jimaging7080128","volume":"7","author":"O Giudice","year":"2021","unstructured":"Giudice O, Guarnera L, Battiato S (2021) Fighting deepfakes by detecting GAN DCT anomalies. J Imaging 7(8):128. https:\/\/doi.org\/10.3390\/jimaging7080128","journal-title":"J Imaging"},{"key":"20548_CR26","doi-asserted-by":"publisher","first-page":"2636","DOI":"10.1016\/j.procs.2023.01.237","volume":"218","author":"U Kosarkar","year":"2023","unstructured":"Kosarkar U, Sarkarkar G, Gedam S (2023) Revealing and Classification of Deepfakes Video\u2019s Images using a Customize Convolution Neural Network Model. Procedia Comput Sci 218:2636\u20132652. https:\/\/doi.org\/10.1016\/j.procs.2023.01.237","journal-title":"Procedia Comput Sci"},{"key":"20548_CR27","unstructured":"Wodajo D, Atnafu S, & Akhtar Z (2023) Deepfake Video Detection Using Generative Convolutional Vision Transformer. ArXiv, abs\/2307.07036"},{"key":"20548_CR28","unstructured":"Dolhansky B, Bitton J, Pflaum B, Lu J, Howes R, Wang M, Canton-Ferrer C (2020) The deepfake detection challenge dataset. ArXiv, abs\/2006.07397"},{"key":"20548_CR29","doi-asserted-by":"publisher","unstructured":"R\u00f6ssler A, Cozzolino D, Verdoliva L, Rie\u00df C, Thies J, & Nie\u00dfner M (2019) FaceForensics++: Learning to Detect Manipulated Facial Images. 2019 IEEE\/CVF International Conference on Computer Vision (ICCV). https:\/\/doi.org\/10.1109\/iccv.2019.00009","DOI":"10.1109\/iccv.2019.00009"},{"key":"20548_CR30","doi-asserted-by":"crossref","unstructured":"D'Avino D, Cozzolino D, Poggi G, Verdoliva L (2017) Autoencoder with recurrent neural networks for video forgery detection. Media Watermarking, Security, and Forensics","DOI":"10.2352\/ISSN.2470-1173.2017.7.MWSF-330"},{"key":"20548_CR31","doi-asserted-by":"publisher","unstructured":"Amerini I, Galteri, L, Caldelli R, & Del Bimbo A (2019) Deepfake video detection through optical flow based CNN. 2019 IEEE\/CVF Int Conf Comput Vis Workshop (ICCVW). https:\/\/doi.org\/10.1109\/iccvw.2019.00152","DOI":"10.1109\/iccvw.2019.00152"},{"key":"20548_CR32","doi-asserted-by":"publisher","unstructured":"Yang X, Li Y, & Lyu S (2019) Exposing deep fakes using inconsistent head poses. ICASSP 2019 - 2019 IEEE Int Conf Acoust, Speech Signal Process (ICASSP). https:\/\/doi.org\/10.1109\/icassp.2019.8683164","DOI":"10.1109\/icassp.2019.8683164"},{"key":"20548_CR33","doi-asserted-by":"publisher","unstructured":"Agarwal, Samaksh, Girdhar N, & Raghav H (2021) A novel neural model based framework for detection of gan generated fake images. 2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence). https:\/\/doi.org\/10.1109\/confluence51648.2021.9377150","DOI":"10.1109\/confluence51648.2021.9377150"},{"key":"20548_CR34","unstructured":"Frank JC, Eisenhofer T, Sch\u00f6nherr L, Fischer A, Kolossa D, & Holz T (2020) Leveraging Frequency Analysis for Deep Fake Image Recognition. ArXiv, abs\/2003.08685"},{"issue":"4","key":"20548_CR35","doi-asserted-by":"publisher","first-page":"5276","DOI":"10.1609\/aaai.v37i4.25658","volume":"37","author":"L Tan","year":"2023","unstructured":"Tan L, Wang Y, Wang J, Yang L, Chen X, Guo Y (2023) Deepfake video detection via facial action dependencies estimation. Proceed AAAI Conf Artif Intell 37(4):5276\u20135284. https:\/\/doi.org\/10.1609\/aaai.v37i4.25658","journal-title":"Proceed AAAI Conf Artif Intell"},{"key":"20548_CR36","doi-asserted-by":"publisher","unstructured":"Cozzolino D, Rossler A, Thies J, Niesner M, & Verdoliva L (2021) Id-reveal: Identity-aware Deepfake Video detection. 2021 IEEE\/CVF Int Conf Comput Vis (ICCV). https:\/\/doi.org\/10.1109\/iccv48922.2021.01483","DOI":"10.1109\/iccv48922.2021.01483"},{"key":"20548_CR37","doi-asserted-by":"publisher","unstructured":"Cozzolino D, Pianese A, Nie\u00dfner M, & Verdoliva L (2023) Audio-visual person-of-interest deepfake detection. 2023 IEEE\/CVF Conf Comput Vis Patt Recog Workshops (CVPRW). https:\/\/doi.org\/10.1109\/cvprw59228.2023.00101","DOI":"10.1109\/cvprw59228.2023.00101"},{"key":"20548_CR38","doi-asserted-by":"publisher","unstructured":"Chung JS, Nagrani A, & Zisserman A (2018) VoxCeleb2: Deep Speaker Recognition. Proc Interspeech 2018. https:\/\/doi.org\/10.21437\/interspeech.2018-1929","DOI":"10.21437\/interspeech.2018-1929"},{"key":"20548_CR39","unstructured":"Dufour N, Gully A, Karlsson P, Vorbyov AV, Leung T, Childs J, Bregler C (2019) Deepfakes detection dataset"},{"key":"20548_CR40","doi-asserted-by":"publisher","unstructured":"Li Y, Yang X, Sun P, Qi H, & Lyu S (2020) Celeb-DF: A Large-Scale Challenging Dataset for DeepFake Forensics. 2020 IEEE\/CVF Conf Comput Vis Patt Recog (CVPR). https:\/\/doi.org\/10.1109\/cvpr42600.2020.00327","DOI":"10.1109\/cvpr42600.2020.00327"},{"key":"20548_CR41","doi-asserted-by":"publisher","unstructured":"Deng J, Guo J, Ververas E, Kotsia I, & Zafeiriou S (2020) RetinaFace: Single-Shot Multi-Level Face Localisation in the Wild. 2020 IEEE\/CVF Conf Comput Vis Patt Recog (CVPR). https:\/\/doi.org\/10.1109\/cvpr42600.2020.00525","DOI":"10.1109\/cvpr42600.2020.00525"},{"key":"20548_CR42","doi-asserted-by":"publisher","unstructured":"Blanz V, & Vetter T (1999) A morphable model for the synthesis of 3D faces. SIGGRAPH \u201999: Proceedings of the 26th Annual Conference on Computer Graphics and Interactive Techniques. https:\/\/doi.org\/10.1145\/311535.311556","DOI":"10.1145\/311535.311556"},{"issue":"6","key":"20548_CR43","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3130800.3130813","volume":"36","author":"T Li","year":"2017","unstructured":"Li T, Bolkart T, Black MJ, Li H, Romero J (2017) Learning a model of facial shape and expression from 4D scans. ACM Trans Graph 36(6):1\u201317. https:\/\/doi.org\/10.1145\/3130800.3130813","journal-title":"ACM Trans Graph"},{"key":"20548_CR44","doi-asserted-by":"publisher","first-page":"7755","DOI":"10.1109\/cvpr.2019.00795","volume":"2019","author":"S Sanyal","year":"2019","unstructured":"Sanyal S, Bolkart T, Feng H, Black MJ (2019) Learning to Regress 3D Face Shape and Expression From an Image Without 3D Supervision. IEEE\/CVF Conf Comput Vis Patt Recog (CVPR) 2019:7755\u20137764. https:\/\/doi.org\/10.1109\/cvpr.2019.00795","journal-title":"IEEE\/CVF Conf Comput Vis Patt Recog (CVPR)"},{"issue":"4","key":"20548_CR45","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3450626.3459936","volume":"40","author":"Y Feng","year":"2021","unstructured":"Feng Y, Feng H, Black MJ, Bolkart T (2021) Learning an animatable detailed 3D face model from in-the-wild images. ACM Trans Graph 40(4):1\u201313. https:\/\/doi.org\/10.1145\/3450626.3459936","journal-title":"ACM Trans Graph"},{"key":"20548_CR46","doi-asserted-by":"publisher","unstructured":"Guo J, Zhu X, Yang Y, Yang F, Lei Z, & Li SZ (2020) Towards fast, accurate and stable 3D dense face alignment. In Lecture notes in computer science (pp. 152\u2013168). https:\/\/doi.org\/10.1007\/978-3-030-58529-7_10","DOI":"10.1007\/978-3-030-58529-7_10"},{"issue":"4","key":"20548_CR47","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1049\/ip-vis:19941301","volume":"141","author":"R Vaillant","year":"1994","unstructured":"Vaillant R, Monrocq C, Cun YL (1994) Original approach for the localisation of objects in images. IEE Proceed Vis Image Signal Process 141(4):245. https:\/\/doi.org\/10.1049\/ip-vis:19941301","journal-title":"IEE Proceed Vis Image Signal Process"},{"issue":"8","key":"20548_CR48","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J (1997) Long Short-Term memory. Neural Comput 9(8):1735\u20131780. https:\/\/doi.org\/10.1162\/neco.1997.9.8.1735","journal-title":"Neural Comput"},{"key":"20548_CR49","doi-asserted-by":"publisher","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser \u0141, & Polosukhin I (2017) Attention is all you need. arXiv (Cornell University). https:\/\/doi.org\/10.48550\/arxiv.1706.03762","DOI":"10.48550\/arxiv.1706.03762"},{"key":"20548_CR50","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1016\/j.aiopen.2022.10.001","volume":"3","author":"T Lin","year":"2022","unstructured":"Lin T, Wang Y, Li X, Qiu X (2022) A survey of transformers. AI Open 3:111\u2013132. https:\/\/doi.org\/10.1016\/j.aiopen.2022.10.001","journal-title":"AI Open"},{"key":"20548_CR51","doi-asserted-by":"publisher","first-page":"1335","DOI":"10.1109\/tifs.2023.3239223","volume":"18","author":"C Zhao","year":"2023","unstructured":"Zhao C, Wang C, Hu G, Chen H, Liu C, Tang J (2023) ISTVT: Interpretable Spatial-Temporal Video Transformer for Deepfake Detection. IEEE Trans Inf Forensics Secur 18:1335\u20131348. https:\/\/doi.org\/10.1109\/tifs.2023.3239223","journal-title":"IEEE Trans Inf Forensics Secur"},{"key":"20548_CR52","doi-asserted-by":"publisher","unstructured":"Selva J, Johansen AS, Escalera S, Nasrollahi K, Moeslund TB, & Clap\u00e9s A (2023) Video Transformers: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1\u201320. https:\/\/doi.org\/10.1109\/tpami.2023.3243465","DOI":"10.1109\/tpami.2023.3243465"},{"key":"20548_CR53","unstructured":"Loshchilov I, Hutter F (2016) SGDR: stochastic gradient descent with warm restarts. In: 5th International Conference on Learning Representations (ILCR 2017)"},{"key":"20548_CR54","doi-asserted-by":"publisher","unstructured":"Afchar D, Nozick V, Yamagishi J, & Echizen I (2018) MesoNet: a Compact Facial Video Forgery Detection Network. 2018 IEEE Int Workshop Inform Forensic Secur (WIFS). https:\/\/doi.org\/10.1109\/wifs.2018.8630761","DOI":"10.1109\/wifs.2018.8630761"},{"key":"20548_CR55","doi-asserted-by":"publisher","unstructured":"Chollet F (2017) Xception: Deep Learning with Depthwise Separable Convolutions. 2017 IEEE Confer Comput Vis Patt Recog (CVPR). https:\/\/doi.org\/10.1109\/cvpr.2017.195","DOI":"10.1109\/cvpr.2017.195"},{"key":"20548_CR56","first-page":"6105","volume":"2019","author":"M Tan","year":"2019","unstructured":"Tan M, Le Q (2019) EfficientNet: Rethinking model scaling for convolutional neural networks. Int Conf Machine Learn 2019:6105\u20136114","journal-title":"Int Conf Machine Learn"},{"key":"20548_CR57","doi-asserted-by":"publisher","unstructured":"Dang H, Liu F, Stehouwer J, Liu X, & Jain AK (2020) On the Detection of Digital Face Manipulation. 2020 IEEE\/CVF Conf Comput Vis Patt Recog (CVPR). https:\/\/doi.org\/10.1109\/cvpr42600.2020.00582","DOI":"10.1109\/cvpr42600.2020.00582"},{"key":"20548_CR58","doi-asserted-by":"publisher","unstructured":"Bonettini N, Cannas ED, Mandelli S, Bondi L, Bestagini P, & Tubaro S (2021) Video Face Manipulation Detection Through Ensemble of CNNs. 2020 25th Int Conf Patt Recog (ICPR). https:\/\/doi.org\/10.1109\/icpr48806.2021.9412711","DOI":"10.1109\/icpr48806.2021.9412711"}],"container-title":["Multimedia Tools and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11042-024-20548-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11042-024-20548-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11042-024-20548-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,27]],"date-time":"2025-09-27T11:56:05Z","timestamp":1758974165000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11042-024-20548-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,25]]},"references-count":58,"journal-issue":{"issue":"33","published-online":{"date-parts":[[2025,10]]}},"alternative-id":["20548"],"URL":"https:\/\/doi.org\/10.1007\/s11042-024-20548-6","relation":{},"ISSN":["1573-7721"],"issn-type":[{"value":"1573-7721","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,3,25]]},"assertion":[{"value":"21 June 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 October 2024","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 December 2024","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 March 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have no competing interests to declare that are relevant to the content of this article.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interest"}}]}}