{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,15]],"date-time":"2026-06-15T16:03:26Z","timestamp":1781539406777,"version":"3.54.5"},"reference-count":196,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T00:00:00Z","timestamp":1759881600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T00:00:00Z","timestamp":1759881600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>In recent years, deep learning based generative models, particularly Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), and Diffusion Models (DMs), have been instrumental in generating diverse, high-quality content across various domains, such as image and video synthesis. This capability has led to widespread adoption of these models and has captured strong public interest. As they continue to advance at a rapid pace, the growing volume of research, expanding application areas, and unresolved technical challenges make it increasingly difficult to stay current. To address this need, this survey introduces a comprehensive taxonomy that organizes the literature and provides a cohesive framework for understanding the development of GANs, VAEs, and DMs, including their many variants and combined approaches. We highlight key innovations that have improved the quality, diversity, and controllability of generated outputs, reflecting the expanding potential of generative artificial intelligence. In addition to summarizing technical progress, we examine rising ethical concerns, including the risks of misuse and the broader societal impact of synthetic media. Finally, we outline persistent challenges and propose future research directions, offering a structured and forward looking perspective for researchers in this fast evolving field.<\/jats:p>","DOI":"10.1186\/s40537-025-01247-x","type":"journal-article","created":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T09:16:33Z","timestamp":1759914993000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":17,"title":["Generative AI in depth: A survey of recent advances, model variants, and real-world applications"],"prefix":"10.1186","volume":"12","author":[{"given":"Shamim","family":"Yazdani","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Akansha","family":"Singh","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nripsuta","family":"Saxena","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zichong","family":"Wang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Avash","family":"Palikhe","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Deng","family":"Pan","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Umapada","family":"Pal","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jie","family":"Yang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Wenbin","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,10,8]]},"reference":[{"key":"1247_CR1","unstructured":"Gozalo-Brizuela R, Garrido-Merch\u00e1n EC. A survey of generative ai applications. arXiv preprint; 2023. arXiv:2306.02781"},{"issue":"5","key":"1247_CR2","doi-asserted-by":"publisher","first-page":"839","DOI":"10.1109\/JPROC.2021.3049196","volume":"109","author":"M-Y Liu","year":"2021","unstructured":"Liu M-Y, Huang X, Yu J, Wang T-C, Mallya A. Generative adversarial networks for image and video synthesis: algorithms and applications. Proc IEEE. 2021;109(5):839\u201362.","journal-title":"Proc IEEE"},{"key":"1247_CR3","doi-asserted-by":"crossref","unstructured":"Jin H, Wei W, Wang X, Zhang W, Wu Y. Rethinking learning rate tuning in the era of large language models; 2023. arXiv preprint arXiv:2309.08859","DOI":"10.1109\/CogMI58952.2023.00025"},{"key":"1247_CR4","unstructured":"Amon A, Yin Z, Wang Z, Avash P, Zhang W. Uncertain boundaries: multidisciplinary approaches to copyright issues in generative AI; 2024. arXiv preprint arXiv:2404.08221"},{"key":"1247_CR5","unstructured":"Elgammal A, Liu B, Elhoseiny M, Mazzone M. Can: creative adversarial networks, generating\" art\" by learning about styles and deviating from style norms; 2017. arXiv preprint arXiv:1706.07068"},{"key":"1247_CR6","unstructured":"Li Y, Fang C, Yang J, Wang Z, Lu X, Yang M-H. Universal style transfer via feature transforms. Advances in neural information processing systems; 2017."},{"key":"1247_CR7","doi-asserted-by":"publisher","unstructured":"Ehtesham A, Kumar S, Singh A, Khoei TT. Movie gen: Swot analysis of meta\u2019s generative AI foundation model for transforming media generation, advertising, and entertainment industries. In: 2025 IEEE 15th annual computing and communication workshop and conference (CCWC); 2025. p. 00189\u201395 . https:\/\/doi.org\/10.1109\/CCWC62904.2025.10903780","DOI":"10.1109\/CCWC62904.2025.10903780"},{"key":"1247_CR8","doi-asserted-by":"crossref","unstructured":"Liu R, Nageotte F, Zanne P, Mathelin M, Dresp-Langley B. Deep reinforcement learning for the control of robotic manipulation: a focussed mini-review. Robotics 2021, 10, 22. s Note: MDPI stays neutral with regard to jurisdictional clai-ms in; 2021","DOI":"10.3390\/robotics10010022"},{"key":"1247_CR9","doi-asserted-by":"crossref","unstructured":"Gatys LA, Ecker AS, Bethge M. Image style transfer using convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition; 2016. p. 2414\u201323.","DOI":"10.1109\/CVPR.2016.265"},{"key":"1247_CR10","unstructured":"Pinaya WH, Graham MS, Kerfoot E, Tudosiu P-D, Dafflon J, Fernandez V, Sanchez P, Wolleb J, Costa PF, Patel A, et al. Generative AI for medical imaging: extending the monai framework; 2023. arXiv preprint arXiv:2307.15208"},{"key":"1247_CR11","unstructured":"Jing B, Erives E, Pao-Huang P, Corso G, Berger B, Jaakkola T. Eigenfold: generative protein structure prediction with diffusion models; 2023. arXiv preprint arXiv:2304.02198"},{"key":"1247_CR12","unstructured":"Shokrollahi Y, Yarmohammadtoosky S, Nikahd MM, Dong P, Li X, Gu L. A comprehensive review of generative ai in healthcare; 2023. arXiv preprint arXiv:2310.00795"},{"key":"1247_CR13","doi-asserted-by":"crossref","unstructured":"Ktena I, Wiles O, Albuquerque I, Rebuffi S-A, Tanno R, Roy AG, Azizi S, Belgrave D, Kohli P, Karthikesalingam A, et al. Generative models improve fairness of medical classifiers under distribution shifts; 2023. arXiv preprint arXiv:2304.09218","DOI":"10.21203\/rs.3.rs-2976332\/v1"},{"key":"1247_CR14","unstructured":"Perez L, Wang J. The effectiveness of data augmentation in image classification using deep learning; 2017. arXiv preprint arXiv:1712.04621"},{"key":"1247_CR15","unstructured":"Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y. Generative adversarial nets. Advances in neural information processing systems; 2014."},{"key":"1247_CR16","unstructured":"Kingma DP, Welling M. Auto-encoding variational Bayes; 2013. arXiv preprint arXiv:1312.6114"},{"key":"1247_CR17","unstructured":"Rezende DJ, Mohamed S, Wierstra D. Stochastic backpropagation and approximate inference in deep generative models. In: International conference on machine learning; 2014. p. 1278\u201386. PMLR"},{"key":"1247_CR18","doi-asserted-by":"crossref","unstructured":"Karras T, Aittala M, Lehtinen J, Hellsten J, Aila T, Laine S. Analyzing and improving the training dynamics of diffusion models; 2023. arXiv preprint arXiv:2312.02696","DOI":"10.1109\/CVPR52733.2024.02282"},{"key":"1247_CR19","unstructured":"Wu Z, Zhou P, Kawaguchi K, Zhang H. Fast diffusion model; 2023. arXiv preprint arXiv:2306.06991"},{"issue":"10","key":"1247_CR20","doi-asserted-by":"publisher","first-page":"1469","DOI":"10.3390\/e25101469","volume":"25","author":"R Yang","year":"2023","unstructured":"Yang R, Srivastava P, Mandt S. Diffusion probabilistic modeling for video generation. Entropy. 2023;25(10):1469.","journal-title":"Entropy"},{"key":"1247_CR21","doi-asserted-by":"publisher","first-page":"10850","DOI":"10.1109\/TPAMI.2023.3261988","volume":"45","author":"F-A Croitoru","year":"2023","unstructured":"Croitoru F-A, Hondru V, Ionescu RT, Shah M. Diffusion models in vision: a survey. IEEE Trans Pattern Anal Mach Intell. 2023;45:10850\u201369.","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1247_CR22","first-page":"17695","volume":"34","author":"V De Bortoli","year":"2021","unstructured":"De Bortoli V, Thornton J, Heng J, Doucet A. Diffusion schr\u00f6dinger bridge with applications to score-based generative modeling. Adv Neural Inf Process Syst. 2021;34:17695\u2013709.","journal-title":"Adv Neural Inf Process Syst"},{"key":"1247_CR23","doi-asserted-by":"crossref","unstructured":"Bengesi S, El-Sayed H, Sarker MK, Houkpati Y, Irungu J, Oladunni T. Advancements in generative AI: a comprehensive review of gans, gpt, autoencoders, diffusion model, and transformers; 2023. arXiv preprint arXiv:2311.10242","DOI":"10.1109\/ACCESS.2024.3397775"},{"key":"1247_CR24","doi-asserted-by":"crossref","unstructured":"Ali S, DiPaola D, Breazeal C. What are gans?: introducing generative adversarial networks to middle school students. In: Proceedings of the AAAI conference on artificial intelligence, vol 35; 2021. p. 15472\u201379","DOI":"10.1609\/aaai.v35i17.17821"},{"key":"1247_CR25","doi-asserted-by":"crossref","unstructured":"Isola P, Zhu J-Y, Zhou T, Efros AA. Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition; 2017. p. 1125\u201334.","DOI":"10.1109\/CVPR.2017.632"},{"key":"1247_CR26","doi-asserted-by":"publisher","first-page":"1810","DOI":"10.1007\/s11263-020-01301-6","volume":"128","author":"Y Zhang","year":"2020","unstructured":"Zhang Y, Bai Y, Ding M, Ghanem B. Multi-task generative adversarial network for detecting small objects in the wild. Int J Comput Vision. 2020;128:1810\u201328.","journal-title":"Int J Comput Vision"},{"key":"1247_CR27","doi-asserted-by":"crossref","unstructured":"Xu W, Long C, Wang R, Wang G. Drb-gan: a dynamic resblock generative adversarial network for artistic style transfer. In: Proceedings of the IEEE\/CVF international conference on computer vision; 2021. p. 6383\u201392.","DOI":"10.1109\/ICCV48922.2021.00632"},{"key":"1247_CR28","doi-asserted-by":"publisher","first-page":"521","DOI":"10.1016\/j.neunet.2020.09.019","volume":"132","author":"Y Nishimura","year":"2020","unstructured":"Nishimura Y, Nakamura Y, Ishiguro H. Human interaction behavior modeling using generative adversarial networks. Neural Netw. 2020;132:521\u201331.","journal-title":"Neural Netw"},{"key":"1247_CR29","unstructured":"Gregor K, Danihelka I, Graves A, Rezende D, Wierstra D. Draw: a recurrent neural network for image generation. In: International conference on machine learning. PMLR; 2015. p. 1462\u201371."},{"key":"1247_CR30","unstructured":"Yan W, Zhang Y, Abbeel P, Srinivas A. Videogpt: Video generation using VQ-VAE and transformers; 2021. arXiv preprint arXiv:2104.10157"},{"key":"1247_CR31","doi-asserted-by":"publisher","first-page":"1420","DOI":"10.1109\/TETCI.2023.3298535","volume":"7","author":"S Cheng","year":"2023","unstructured":"Cheng S, Guo Y, Arcucci R. A generative model for surrogates of spatial-temporal wildfire nowcasting. IEEE Trans Emerg Top Comput Intell. 2023;7:1420\u201330.","journal-title":"IEEE Trans Emerg Top Comput Intell"},{"key":"1247_CR32","doi-asserted-by":"publisher","DOI":"10.1016\/j.media.2023.102872","volume":"88","author":"A G\u00fcng\u00f6r","year":"2023","unstructured":"G\u00fcng\u00f6r A, Dar SU, \u00d6zt\u00fcrk \u015e, Korkmaz Y, Bedel HA, Elmas G, Ozbey M, \u00c7ukur T. Adaptive diffusion priors for accelerated MRI reconstruction. Med Image Anal. 2023;88: 102872.","journal-title":"Med Image Anal"},{"key":"1247_CR33","unstructured":"Hu A, Russell L, Yeo H, Murez Z, Fedoseev G, Kendall A, Shotton J, Corrado G. Gaia-1: a generative world model for autonomous driving; 2023. arXiv preprint arXiv:2309.17080"},{"key":"1247_CR34","unstructured":"Arjovsky M, Chintala S, Bottou L. Wasserstein generative adversarial networks. In: International Conference on Machine Learning. PMLR; 2017. p. 214\u201323."},{"key":"1247_CR35","unstructured":"Dosovitskiy A, Brox T. Generating images with perceptual similarity metrics based on deep networks. Advances in neural information processing systems, vol. 29; 2016."},{"key":"1247_CR36","unstructured":"Larsen ABL, S\u00f8nderby SK, Larochelle H, Winther O. Autoencoding beyond pixels using a learned similarity metric. In: International Conference on Machine Learning. PMLR; 2016. p. 1558\u201366."},{"key":"1247_CR37","unstructured":"Radford A, Metz L, Chintala S. Unsupervised representation learning with deep convolutional generative adversarial networks; 2015. arXiv preprint arXiv:1511.06434"},{"key":"1247_CR38","unstructured":"Zhao S, Song J, Ermon S. Infovae: Information maximizing variational autoencoders; 2017. arXiv preprint arXiv:1706.02262"},{"key":"1247_CR39","unstructured":"Cao H, Wang J, Ren T, Qi X, Chen Y, Yao Y, Zhang L. Exploring vision transformers as diffusion learners; 2022. arXiv preprint arXiv:2212.13771"},{"key":"1247_CR40","doi-asserted-by":"crossref","unstructured":"Cordts M, Omran M, Ramos S, Rehfeld T, Enzweiler M, Benenson R, Franke U, Roth S, Schiele B. The cityscapes dataset for semantic urban scene understanding. In: Proceedings of the IEEE conference on computer vision and pattern recognition; 2016. p. 3213\u201323.","DOI":"10.1109\/CVPR.2016.350"},{"key":"1247_CR41","doi-asserted-by":"crossref","unstructured":"Tyle\u010dek R, \u0160\u00e1ra R. Spatial pattern templates for recognition of objects with regular structure. In: Pattern recognition: 35th German conference, GCPR 2013, Saarbr\u00fccken, Germany, September 3-6, 2013. Proceedings 35, 2013. Springer; 2013. p. 364\u201374.","DOI":"10.1007\/978-3-642-40602-7_39"},{"key":"1247_CR42","doi-asserted-by":"crossref","unstructured":"Xie S, Tu Z. Holistically-nested edge detection. In: Proceedings of the IEEE International Conference on Computer Vision, 2015; 1395\u2013403.","DOI":"10.1109\/ICCV.2015.164"},{"key":"1247_CR43","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","volume":"115","author":"O Russakovsky","year":"2015","unstructured":"Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, et al. Imagenet large scale visual recognition challenge. Int J Comput Vision. 2015;115:211\u201352.","journal-title":"Int J Comput Vision"},{"key":"1247_CR44","doi-asserted-by":"crossref","unstructured":"Yu A, Grauman K. Fine-grained visual comparisons with local learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition; 2014. p. 192\u20139.","DOI":"10.1109\/CVPR.2014.32"},{"key":"1247_CR45","doi-asserted-by":"crossref","unstructured":"Zhu J-Y, Kr\u00e4henb\u00fchl P, Shechtman E, Efros AA. Generative visual manipulation on the natural image manifold. In: Computer vision\u2013ECCV 2016: 14th European conference, Amsterdam, The Netherlands, October 11\u201314, 2016, proceedings, Part V 14. Springer; 2016. p. 597\u2013613.","DOI":"10.1007\/978-3-319-46454-1_36"},{"issue":"4","key":"1247_CR46","first-page":"1","volume":"31","author":"M Eitz","year":"2012","unstructured":"Eitz M, Hays J, Alexa M. How do humans sketch objects? ACM Trans Graph TOG. 2012;31(4):1\u201310.","journal-title":"ACM Trans Graph TOG"},{"issue":"4","key":"1247_CR47","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2601097.2601101","volume":"33","author":"P-Y Laffont","year":"2014","unstructured":"Laffont P-Y, Ren Z, Tao X, Qian C, Hays J. Transient attributes for high-level understanding and editing of outdoor scenes. ACM Trans Graph TOG. 2014;33(4):1\u201311.","journal-title":"ACM Trans Graph TOG"},{"key":"1247_CR48","doi-asserted-by":"crossref","unstructured":"Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Doll\u00e1r P, Zitnick CL. Microsoft coco: Common objects in context. In: Computer vision\u2013ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6\u201312, 2014, proceedings, Part V 13. Springer; 2014. p. 740\u201355.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"1247_CR49","doi-asserted-by":"crossref","unstructured":"Yang S, Luo P, Loy C-C, Tang X. Wider face: a face detection benchmark. In: Proceedings of the IEEE conference on computer vision and pattern recognition; 2016. p. 5525\u201333.","DOI":"10.1109\/CVPR.2016.596"},{"key":"1247_CR50","doi-asserted-by":"crossref","unstructured":"Lugmayr A, Danelljan M, Romero A, Yu F, Timofte R, Van\u00a0Gool L. Repaint: inpainting using denoising diffusion probabilistic models. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition; 2022. p. 11461\u201371.","DOI":"10.1109\/CVPR52688.2022.01117"},{"key":"1247_CR51","doi-asserted-by":"crossref","unstructured":"Avrahami O, Lischinski D, Fried O. Blended diffusion for text-driven editing of natural images. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition; 2022. p. 18208\u201318.","DOI":"10.1109\/CVPR52688.2022.01767"},{"key":"1247_CR52","unstructured":"Nichol A, Dhariwal P, Ramesh A, Shyam P, Mishkin P, McGrew B, Sutskever I, Chen M. Glide: towards photorealistic image generation and editing with text-guided diffusion models; 2021. arXiv preprint arXiv:2112.10741"},{"key":"1247_CR53","first-page":"27953","volume":"35","author":"W Harvey","year":"2022","unstructured":"Harvey W, Naderiparizi S, Masrani V, Weilbach C, Wood F. Flexible diffusion modeling of long videos. Adv Neural Inf Process Syst. 2022;35:27953\u201365.","journal-title":"Adv Neural Inf Process Syst"},{"key":"1247_CR54","unstructured":"Ho J, Salimans T, Gritsenko AA, Chan W, Norouzi M, Fleet DJ. Video diffusion models. In: Proceedings of DGM4HSD; 2022"},{"key":"1247_CR55","doi-asserted-by":"crossref","unstructured":"Zhou L, Du Y, Wu J. 3d shape generation and completion through point-voxel diffusion. In: Proceedings of the IEEE\/CVF international conference on computer vision; 2021. p. 5826\u201335.","DOI":"10.1109\/ICCV48922.2021.00577"},{"key":"1247_CR56","doi-asserted-by":"crossref","unstructured":"Huang N, Tang F, Dong W, Xu C. Draw your art dream: Diverse digital art synthesis with multimodal guided diffusion. In: Proceedings of the 30th ACM international conference on multimedia; 2022. p. 1085\u201394.","DOI":"10.1145\/3503161.3548282"},{"issue":"2","key":"1247_CR57","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1145\/2812802","volume":"59","author":"B Thomee","year":"2016","unstructured":"Thomee B, Shamma DA, Friedland G, Elizalde B, Ni K, Poland D, Borth D, Li L-J. Yfcc100m: the new data in multimedia research. Commun ACM. 2016;59(2):64\u201373.","journal-title":"Commun ACM"},{"key":"1247_CR58","unstructured":"Huang N, Zhang Y, Tang F, Ma C, Huang H, Zhang Y, Dong W, Xu C. Diffstyler: controllable dual diffusion for text-driven image stylization; 2022. arXiv preprint arXiv:2211.10682"},{"key":"1247_CR59","doi-asserted-by":"crossref","unstructured":"Changpinyo S, Sharma P, Ding N, Soricut R. Conceptual 12m: pushing web-scale image-text pre-training to recognize long-tail visual concepts. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition; 2021. p. 3558\u201368.","DOI":"10.1109\/CVPR46437.2021.00356"},{"key":"1247_CR60","unstructured":"Crowson K. v-diffusion-pytorch; 2023. https:\/\/github.com\/crowsonkb\/v-diffusion-pytorch"},{"key":"1247_CR61","unstructured":"WikiArt volunteer team: WikiArt dataset. https:\/\/www.wikiart.org\/"},{"key":"1247_CR62","unstructured":"LeCun Y, Cortes C, Burges C. The mnist database. Courant Institute, NYU; 1998. http:\/\/yann.lecun.com\/exdb\/mnist\/"},{"key":"1247_CR63","unstructured":"Netzer Y, Wang T, Coates A, Bissacco A, Wu B, Ng AY. Reading digits in natural images with unsupervised feature learning. NIPS workshop on deep learning and unsupervised feature learning; 2011. http:\/\/ufldl.stanford.edu\/housenumbers\/"},{"key":"1247_CR64","unstructured":"Krizhevsky A, Hinton G. Learning multiple layers of features from tiny images. Technical report, Citeseer; 2009. https:\/\/www.cs.toronto.edu\/~kriz\/cifar.html"},{"key":"1247_CR65","unstructured":"Zhou B, Lapedriza A, Xiao J, Torralba A, Oliva A. Learning deep features for scene recognition using places database. Advances in neural information processing systems; 2014. p. 27."},{"key":"1247_CR66","unstructured":"Kumar A, Irsoy O, Ondruska P, Iyyer M, Bradbury J, Gulrajani I, Zhong V, Paulus R, Socher R. Ask me anything: dynamic memory networks for natural language processing. In: International conference on machine learning. PMLR; 2016. p. 1378\u201387."},{"key":"1247_CR67","unstructured":"OpenAI R. Gpt-4 technical report. arxiv 2303.08774. View in Article 2; 2023. p. 3."},{"issue":"1","key":"1247_CR68","doi-asserted-by":"publisher","first-page":"317","DOI":"10.1038\/s41597-019-0322-0","volume":"6","author":"AE Johnson","year":"2019","unstructured":"Johnson AE, Pollard TJ, Berkowitz SJ, Greenbaum NR, Lungren MP, Deng C-Y, Mark RG, Horng S. Mimic-cxr, a de-identified publicly available database of chest radiographs with free-text reports. Sci Data. 2019;6(1):317.","journal-title":"Sci Data"},{"key":"1247_CR69","unstructured":"Song J, Meng C, Ermon S. Denoising diffusion implicit models; 2020. arXiv preprint arXiv:2010.02502"},{"issue":"3","key":"1247_CR70","doi-asserted-by":"publisher","first-page":"1001779","DOI":"10.1371\/journal.pmed.1001779","volume":"12","author":"C Sudlow","year":"2015","unstructured":"Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, Downey P, Elliott P, Green J, Landray M, et al. Uk biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):1001779.","journal-title":"PLoS Med"},{"key":"1247_CR71","unstructured":"Kermany D, Zhang K, Goldbaum M. Large dataset of labeled optical coherence tomography (oct) and chest x-ray images. Mendeley Data; 2018. 3(10.17632)"},{"key":"1247_CR72","unstructured":"IXI Dataset. http:\/\/brain-development.org\/ixi-dataset\/."},{"issue":"1","key":"1247_CR73","doi-asserted-by":"publisher","DOI":"10.1148\/ryai.2020190007","volume":"2","author":"F Knoll","year":"2020","unstructured":"Knoll F, Zbontar J, Sriram A, Muckley MJ, Bruno M, Defazio A, Parente M, Geras KJ, Katsnelson J, Chandarana H, et al. FASTMRI: a publicly available raw k-space and DICOM dataset of knee images for accelerated MR image reconstruction using machine learning. Radiol Artif Intell. 2020;2(1): 190007.","journal-title":"Radiol Artif Intell"},{"issue":"6","key":"1247_CR74","doi-asserted-by":"publisher","first-page":"4353","DOI":"10.1002\/pro.4353","volume":"31","author":"D Chakravarty","year":"2022","unstructured":"Chakravarty D, Porter LL. Alphafold2 fails to predict protein fold switching. Protein Sci. 2022;31(6):4353.","journal-title":"Protein Sci"},{"issue":"10","key":"1247_CR75","doi-asserted-by":"publisher","first-page":"2742","DOI":"10.1093\/bioinformatics\/btac202","volume":"38","author":"T Salda\u00f1o","year":"2022","unstructured":"Salda\u00f1o T, Escobedo N, Marchetti J, Zea DJ, Mac Donagh J, Velez Rueda AJ, Gonik E, Garc\u00eda Melani A, Novomisky Nechcoff J, Salas MN, et al. Impact of protein conformational diversity on alphafold predictions. Bioinformatics. 2022;38(10):2742\u20138.","journal-title":"Bioinformatics"},{"key":"1247_CR76","doi-asserted-by":"crossref","unstructured":"Antoniou A, Storkey A, Edwards H. Data augmentation generative adversarial networks; 2017. arXiv preprint arXiv:1711.04340","DOI":"10.1007\/978-3-030-01424-7_58"},{"issue":"6266","key":"1247_CR77","doi-asserted-by":"publisher","first-page":"1332","DOI":"10.1126\/science.aab3050","volume":"350","author":"BM Lake","year":"2015","unstructured":"Lake BM, Salakhutdinov R, Tenenbaum JB. Human-level concept learning through probabilistic program induction. Science. 2015;350(6266):1332\u20138.","journal-title":"Science"},{"key":"1247_CR78","doi-asserted-by":"crossref","unstructured":"Cohen G, Afshar S, Tapson J, Schaik A. Emnist: extending mnist to handwritten letters; 2017. arXiv preprint arXiv:1702.05373. https:\/\/www.nist.gov\/itl\/products-and-services\/emnist-dataset","DOI":"10.1109\/IJCNN.2017.7966217"},{"key":"1247_CR79","doi-asserted-by":"crossref","unstructured":"Parkhi OM, Vedaldi A, Zisserman A. Deep face recognition. British machine vision conference; 2015. http:\/\/www.robots.ox.ac.uk\/~vgg\/data\/vgg_face\/","DOI":"10.5244\/C.29.41"},{"key":"1247_CR80","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1016\/j.rse.2016.02.054","volume":"178","author":"L Giglio","year":"2016","unstructured":"Giglio L, Schroeder W, Justice CO. The collection 6 modis active fire detection algorithm and fire products. Remote Sens Environ. 2016;178:31\u201341.","journal-title":"Remote Sens Environ"},{"key":"1247_CR81","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1016\/j.rse.2013.12.008","volume":"143","author":"W Schroeder","year":"2014","unstructured":"Schroeder W, Oliva P, Giglio L, Csiszar IA. The new viirs 375 m active fire detection data product: algorithm description and initial assessment. Remote Sens Environ. 2014;143:85\u201396.","journal-title":"Remote Sens Environ"},{"key":"1247_CR82","doi-asserted-by":"crossref","unstructured":"Katara P, Xian Z, Fragkiadaki K. Gen2sim: scaling up robot learning in simulation with generative models; 2023. arXiv preprint arXiv:2310.18308","DOI":"10.1109\/ICRA57147.2024.10610566"},{"key":"1247_CR83","doi-asserted-by":"crossref","unstructured":"Xiang F, Qin Y, Mo K, Xia Y, Zhu H, Liu F, Liu M, Jiang H, Yuan Y, Wang H. et al. Sapien: A simulated part-based interactive environment. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition; 2020. p. 11097\u2013107.","DOI":"10.1109\/CVPR42600.2020.01111"},{"key":"1247_CR84","doi-asserted-by":"crossref","unstructured":"Geng H, Xu H, Zhao C, Xu C, Yi L, Huang S, Wang H. Gapartnet: Cross-category domain-generalizable object perception and manipulation via generalizable and actionable parts. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition; 2023. p. 7081\u201391.","DOI":"10.1109\/CVPR52729.2023.00684"},{"key":"1247_CR85","doi-asserted-by":"crossref","unstructured":"Dong Y, Liu Y, Zhang H, Chen S, Qiao Y. Fd-gan: Generative adversarial networks with fusion-discriminator for single image dehazing. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34; 2020. p. 10729\u201336.","DOI":"10.1609\/aaai.v34i07.6701"},{"issue":"1","key":"1247_CR86","doi-asserted-by":"publisher","first-page":"492","DOI":"10.1109\/TIP.2018.2867951","volume":"28","author":"B Li","year":"2018","unstructured":"Li B, Ren W, Fu D, Tao D, Feng D, Zeng W, Wang Z. Benchmarking single-image dehazing and beyond. IEEE Trans Image Process. 2018;28(1):492\u2013505.","journal-title":"IEEE Trans Image Process"},{"key":"1247_CR87","unstructured":"Ancuti C, Ancuti CO, Timofte R. Ntire 2018 challenge on image dehazing: Methods and results. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops; 2018. p. 891\u2013901."},{"issue":"28","key":"1247_CR88","doi-asserted-by":"publisher","first-page":"71619","DOI":"10.1007\/s11042-024-18502-7","volume":"83","author":"W Yi","year":"2024","unstructured":"Yi W, Dong L, Liu M, Hui M, Kong L, Zhao Y. Sid-net: single image dehazing network using adversarial and contrastive learning. Multimedia Tools Appl. 2024;83(28):71619\u201338.","journal-title":"Multimedia Tools Appl"},{"key":"1247_CR89","doi-asserted-by":"crossref","unstructured":"Ancuti C, Ancuti CO, Timofte R, De\u00a0Vleeschouwer C. I-haze: A dehazing benchmark with real hazy and haze-free indoor images. In: Advanced concepts for intelligent vision systems: 19th international conference, ACIVS 2018, Poitiers, France, September 24\u201327, 2018, Proceedings 19,2018; Springer. p. 620\u201331.","DOI":"10.1007\/978-3-030-01449-0_52"},{"issue":"11","key":"1247_CR90","doi-asserted-by":"publisher","first-page":"3888","DOI":"10.1109\/TIP.2015.2456502","volume":"24","author":"LK Choi","year":"2015","unstructured":"Choi LK, You J, Bovik AC. Referenceless prediction of perceptual fog density and perceptual image defogging. IEEE Trans Image Process. 2015;24(11):3888\u2013901.","journal-title":"IEEE Trans Image Process"},{"key":"1247_CR91","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-019-09717-4","author":"C Yinka-Banjo","year":"2020","unstructured":"Yinka-Banjo C, Ugot O-A. A review of generative adversarial networks and its application in cybersecurity. Artif Intell Rev. 2020. https:\/\/doi.org\/10.1007\/s10462-019-09717-4.","journal-title":"Artif Intell Rev"},{"key":"1247_CR92","first-page":"6840","volume":"33","author":"J Ho","year":"2020","unstructured":"Ho J, Jain A, Abbeel P. Denoising diffusion probabilistic models. Adv Neural Inf Process Syst. 2020;33:6840\u201351.","journal-title":"Adv Neural Inf Process Syst"},{"key":"1247_CR93","doi-asserted-by":"publisher","first-page":"7327","DOI":"10.1109\/TPAMI.2021.3116668","volume":"44","author":"S Bond-Taylor","year":"2021","unstructured":"Bond-Taylor S, Leach A, Long Y, Willcocks CG. Deep generative modelling: a comparative review of vaes, gans, normalizing flows, energy-based and autoregressive models. IEEE Trans Pattern Anal Mach Intell. 2021;44:7327\u201347.","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1247_CR94","doi-asserted-by":"crossref","unstructured":"Mao X, Li Q, Xie H, Lau RY, Wang Z, Paul\u00a0Smolley S. Least squares generative adversarial networks. In: Proceedings of the IEEE international conference on computer vision; 2017. p. 2794\u2013802.","DOI":"10.1109\/ICCV.2017.304"},{"key":"1247_CR95","unstructured":"Gulrajani I, Ahmed F, Arjovsky M, Dumoulin V, Courville AC. Improved training of wasserstein gans. Advances in neural information processing systems; 2017."},{"key":"1247_CR96","unstructured":"Miyato T, Kataoka T, Koyama M, Yoshida Y. Spectral normalization for generative adversarial networks; 2018. arXiv preprint arXiv:1802.05957"},{"key":"1247_CR97","unstructured":"Dumoulin V, Belghazi I, Poole B, Mastropietro O, Lamb A, Arjovsky M, Courville A. Adversarially learned inference; 2016. arXiv preprint arXiv:1606.00704"},{"key":"1247_CR98","unstructured":"Goodfellow I. Nips 2016 tutorial: Generative adversarial networks; 2016. arXiv preprint arXiv:1701.00160."},{"key":"1247_CR99","unstructured":"Arora S, Ge R, Liang Y, Ma T, Zhang Y. Generalization and equilibrium in generative adversarial nets (gans). In: International conference on machine learning. PMLR; 2017. p. 224\u201332."},{"key":"1247_CR100","first-page":"78","volume":"3","author":"A Sajeeda","year":"2022","unstructured":"Sajeeda A, Hossain BM. Exploring generative adversarial networks and adversarial training. Int J Cogn Comput Eng. 2022;3:78\u201389.","journal-title":"Int J Cogn Comput Eng"},{"key":"1247_CR101","doi-asserted-by":"crossref","unstructured":"Ratliff LJ, Burden SA, Sastry SS. Characterization and computation of local nash equilibria in continuous games. In: 2013 51st annual Allerton conference on communication, control, and computing (Allerton). IEEE; 2013. p. 917\u201324.","DOI":"10.1109\/Allerton.2013.6736623"},{"issue":"8","key":"1247_CR102","first-page":"1","volume":"54","author":"A Jabbar","year":"2021","unstructured":"Jabbar A, Li X, Omar B. A survey on generative adversarial networks: variants, applications, and training. ACM Comput Surv CSUR. 2021;54(8):1\u201349.","journal-title":"ACM Comput Surv CSUR"},{"key":"1247_CR103","unstructured":"Salimans T, Goodfellow I, Zaremba W, Cheung V, Radford A, Chen X. Improved techniques for training GANS. Advances in neural information processing systems; 2016."},{"key":"1247_CR104","unstructured":"Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning. PMLR; 2015. p. 448\u201356."},{"key":"1247_CR105","unstructured":"Denton EL, Chintala S, Fergus R, et al. Deep generative image models using a Laplacian pyramid of adversarial networks. Advances in neural information processing systems; 2015."},{"key":"1247_CR106","unstructured":"Mirza M, Osindero S. Conditional generative adversarial nets; 2014. arXiv preprint arXiv:1411.1784"},{"key":"1247_CR107","unstructured":"Kumar JK. School of engineering and architecture. PhD thesis, ALMA MATER STUDIORUM-UNIVERSIT\u00c0 DI BOLOGNA; 2022"},{"key":"1247_CR108","first-page":"1303","volume":"14","author":"MD Hoffman","year":"2013","unstructured":"Hoffman MD, Blei DM, Wang C, Paisley J. Stochastic variational inference. J Mach Learn Res. 2013;14:1303\u201347.","journal-title":"J Mach Learn Res"},{"key":"1247_CR109","doi-asserted-by":"crossref","unstructured":"Gretton A, Borgwardt K, Rasch MJ, Scholkopf B, Smola AJ. A kernel method for the two-sample problem; 2008. arXiv preprint arXiv:0805.2368","DOI":"10.7551\/mitpress\/7503.003.0069"},{"key":"1247_CR110","unstructured":"Li Y, Swersky K, Zemel R. Generative moment matching networks. In: International conference on machine learning. PMLR; 2015. p. 1718\u201327."},{"key":"1247_CR111","unstructured":"Dziugaite GK, Roy DM, Ghahramani Z. Training generative neural networks via maximum mean discrepancy optimization; 2015. arXiv preprint arXiv:1505.03906"},{"key":"1247_CR112","unstructured":"Maal\u00f8e L, S\u00f8nderby CK, S\u00f8nderby SK, Winther O. Auxiliary deep generative models. In: International conference on machine learning. PMLR; 2016. p. 1445\u201353."},{"key":"1247_CR113","unstructured":"Xu M, Yu L, Song Y, Shi C, Ermon S, Tang J. Geodiff: A geometric diffusion model for molecular conformation generation; 2022. arXiv preprint arXiv:2203.02923"},{"key":"1247_CR114","doi-asserted-by":"publisher","first-page":"6358","DOI":"10.1109\/TKDE.2024.3389783","volume":"36","author":"L Yang","year":"2024","unstructured":"Yang L, Huang Z, Zhang Z, Liu Z, Hong S, Zhang W, Yang W, Cui B, Zhang L. Graphusion: latent diffusion for graph generation. IEEE Trans Knowl Data Eng. 2024;36:6358\u201369.","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"1247_CR115","unstructured":"Sohl-Dickstein J, Weiss E, Maheswaranathan N, Ganguli S. Deep unsupervised learning using nonequilibrium thermodynamics. In: International conference on machine learning. PMLR; 2015. p. 2256\u201365."},{"key":"1247_CR116","unstructured":"Song Y, Sohl-Dickstein J, Kingma DP, Kumar A, Ermon S, Poole B. Score-based generative modeling through stochastic differential equations; 2020. arXiv preprint arXiv:2011.13456"},{"key":"1247_CR117","doi-asserted-by":"crossref","unstructured":"Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-Assisted Intervention\u2013MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer; 2015. p. 234\u201341.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"1247_CR118","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN. Kaiser \u0141, Polosukhin I. Attention is all you need. Advances in neural information processing systems; 2017."},{"key":"1247_CR119","doi-asserted-by":"crossref","unstructured":"Rombach R, Blattmann A, Lorenz D, Esser P, Ommer B. High-resolution image synthesis with latent diffusion models. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition; 2022. p. 10684\u201395.","DOI":"10.1109\/CVPR52688.2022.01042"},{"key":"1247_CR120","unstructured":"Salimans T, Karpathy A, Chen X, Kingma DP. Pixelcnn++: improving the pixelcnn with discretized logistic mixture likelihood and other modifications; 2017. arXiv preprint arXiv:1701.05517"},{"key":"1247_CR121","unstructured":"Chang Z, Koulieris GA, Shum HP. On the design fundamentals of diffusion models: a survey; 2023. arXiv preprint arXiv:2306.04542"},{"key":"1247_CR122","unstructured":"Popov V, Vovk I, Gogoryan V, Sadekova T, Kudinov M. Grad-tts: A diffusion probabilistic model for text-to-speech. In: International conference on machine learning. PMLR; 2021. p. 8599\u2013608."},{"key":"1247_CR123","doi-asserted-by":"crossref","unstructured":"Bao F, Li C, Cao Y, Zhu J. All are worth words: a vit backbone for score-based diffusion models; 2022. arXiv preprint arXiv:2209.12152","DOI":"10.1109\/CVPR52729.2023.02171"},{"key":"1247_CR124","doi-asserted-by":"crossref","unstructured":"Peebles W, Xie S. Scalable diffusion models with transformers. In: Proceedings of the IEEE\/CVF international conference on computer vision; 2023. p. 4195\u2013205.","DOI":"10.1109\/ICCV51070.2023.00387"},{"key":"1247_CR125","unstructured":"Chahal P. Exploring transformer backbones for image diffusion models; 2022. arXiv preprint arXiv:2212.14678"},{"key":"1247_CR126","unstructured":"Nichol AQ, Dhariwal P. Improved denoising diffusion probabilistic models. In: International conference on machine learning. PMLR; 2021. p. 8162\u201371."},{"issue":"7","key":"1247_CR127","doi-asserted-by":"publisher","first-page":"1661","DOI":"10.1162\/NECO_a_00142","volume":"23","author":"P Vincent","year":"2011","unstructured":"Vincent P. A connection between score matching and denoising autoencoders. Neural Comput. 2011;23(7):1661\u201374.","journal-title":"Neural Comput"},{"key":"1247_CR128","first-page":"12438","volume":"33","author":"Y Song","year":"2020","unstructured":"Song Y, Ermon S. Improved techniques for training score-based generative models. Adv Neural Inf Process Syst. 2020;33:12438\u201348.","journal-title":"Adv Neural Inf Process Syst"},{"key":"1247_CR129","unstructured":"Song Y, Ermon, S. Generative modeling by estimating gradients of the data distribution. Advances in neural information processing systems; 2019."},{"key":"1247_CR130","unstructured":"Jolicoeur-Martineau A, Pich\u00e9-Taillefer R, Combes RTD, Mitliagkas I. Adversarial score matching and improved sampling for image generation; 2020. arXiv preprint arXiv:2009.05475"},{"key":"1247_CR131","unstructured":"Chen X, Duan Y, Houthooft R, Schulman J, Sutskever I, Abbeel P. Infogan: Interpretable representation learning by information maximizing generative adversarial nets. Advances in neural information processing systems; 2016."},{"key":"1247_CR132","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2023.110156","volume":"148","author":"F Ye","year":"2023","unstructured":"Ye F, Bors AG. Self-supervised adversarial variational learning. Pattern Recogn. 2023;148: 110156.","journal-title":"Pattern Recogn"},{"key":"1247_CR133","first-page":"16761","volume":"33","author":"S Gur","year":"2020","unstructured":"Gur S, Benaim S, Wolf L. Hierarchical patch VAE-GAN: generating diverse videos from a single sample. Adv Neural Inf Process Syst. 2020;33:16761\u201372.","journal-title":"Adv Neural Inf Process Syst"},{"key":"1247_CR134","unstructured":"Chinta SV, Wang Z, Yin Z, Hoang N, Gonzalez M, Quy TL, Zhang W. Fairaied: navigating fairness, bias, and ethics in educational ai applications; 2024. arXiv preprint arXiv:2407.18745"},{"issue":"5","key":"1247_CR135","doi-asserted-by":"publisher","first-page":"0000864","DOI":"10.1371\/journal.pdig.0000864","volume":"4","author":"SV Chinta","year":"2025","unstructured":"Chinta SV, Wang Z, Palikhe A, Zhang X, Kashif A, Smith MA, Liu J, Zhang W. Ai-driven healthcare: a survey on ensuring fairness and mitigating bias. PLoS Digital Health. 2025;4(5):0000864.","journal-title":"PLoS Digital Health"},{"issue":"1","key":"1247_CR136","doi-asserted-by":"publisher","first-page":"46","DOI":"10.1186\/s40537-023-00727-2","volume":"10","author":"L Alzubaidi","year":"2023","unstructured":"Alzubaidi L, Bai J, Al-Sabaawi A, Santamar\u00eda J, Albahri A, Al-dabbagh BSN, Fadhel MA, Manoufali M, Zhang J, Al-Timemy AH, et al. A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications. J Big Data. 2023;10(1):46.","journal-title":"J Big Data"},{"key":"1247_CR137","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-031-43205-7_3","author":"V Raner","year":"2023","unstructured":"Raner V, Joshi A, Sawant S. Medical image synthesis using generative adversarial networks. GANs Data Augment Healthc. 2023. https:\/\/doi.org\/10.1007\/978-3-031-43205-7_3.","journal-title":"GANs Data Augment Healthc"},{"key":"1247_CR138","doi-asserted-by":"crossref","unstructured":"Park T, Liu M-Y, Wang T-C, Zhu J-Y. Semantic image synthesis with spatially-adaptive normalization. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition; 2019. p. 2337\u201346.","DOI":"10.1109\/CVPR.2019.00244"},{"key":"1247_CR139","doi-asserted-by":"crossref","unstructured":"Lin Z, Shi Y, Xue Z. Idsgan: Generative adversarial networks for attack generation against intrusion detection. In: Pacific-Asia conference on knowledge discovery and data mining. Springer; 2022. p. 79\u201391.","DOI":"10.1007\/978-3-031-05981-0_7"},{"key":"1247_CR140","unstructured":"Zhang C, Zhang C, Zhang M, Kweon IS. Text-to-image diffusion model in generative AI: a survey; 2023. arXiv preprint arXiv:2303.07909"},{"issue":"2","key":"1247_CR141","doi-asserted-by":"publisher","first-page":"4233","DOI":"10.1109\/LRA.2021.3068671","volume":"6","author":"TS Lembono","year":"2021","unstructured":"Lembono TS, Pignat E, Jankowski J, Calinon S. Learning constrained distributions of robot configurations with generative adversarial network. IEEE Robot Autom Lett. 2021;6(2):4233\u201340.","journal-title":"IEEE Robot Autom Lett"},{"key":"1247_CR142","doi-asserted-by":"crossref","unstructured":"Zhu J-Y, Park T, Isola P, Efros AA. Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE international conference on computer vision; 2017. p. 2223\u201332.","DOI":"10.1109\/ICCV.2017.244"},{"key":"1247_CR143","doi-asserted-by":"publisher","first-page":"109","DOI":"10.3389\/fevo.2016.00109","volume":"4","author":"GL Perry","year":"2016","unstructured":"Perry GL, Wainwright J, Etherington TR, Wilmshurst JM. Experimental simulation: using generative modeling and palaeoecological data to understand human-environment interactions. Front Ecol Evol. 2016;4:109.","journal-title":"Front Ecol Evol"},{"key":"1247_CR144","first-page":"111749","volume":"240","author":"D Liang","year":"2020","unstructured":"Liang D, Zhang Z, Zhu XX. Deep learning for remote sensing image understanding: a survey. Remote Sens Environ. 2020;240:111749.","journal-title":"Remote Sens Environ"},{"key":"1247_CR145","doi-asserted-by":"crossref","unstructured":"Reed S, Akata Z, Lee H, Schiele B. Learning deep representations of fine-grained visual descriptions. In: Proceedings of the IEEE conference on computer vision and pattern recognition; 2016. p. 49\u201358.","DOI":"10.1109\/CVPR.2016.13"},{"key":"1247_CR146","doi-asserted-by":"publisher","DOI":"10.4159\/9780674039919","volume-title":"The economic structure of intellectual property law","author":"WM Landes","year":"2003","unstructured":"Landes WM, Posner RA. The economic structure of intellectual property law. Harvard University Press; 2003."},{"key":"1247_CR147","doi-asserted-by":"crossref","unstructured":"Xu E, Zhang W, Xu W. Transforming digital forensics with large language models: Unlocking automation, insights, and justice. In: Proceedings of the 33rd ACM international conference on information and knowledge management; 2024. p. 5543\u20136.","DOI":"10.1145\/3627673.3679091"},{"key":"1247_CR148","unstructured":"Yin Z, Wang Z, Xu W, Zhuang J, Mozumder P, Smith A, Zhang W. Digital forensics in the age of large language models; 2025. arXiv preprint arXiv:2504.02963"},{"key":"1247_CR149","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1017\/dap.2022.10","volume":"4","author":"G Franceschelli","year":"2022","unstructured":"Franceschelli G, Musolesi M. Copyright in generative deep learning. Data & Policy. 2022;4:17.","journal-title":"Data & Policy"},{"key":"1247_CR150","doi-asserted-by":"crossref","unstructured":"Sag M. Copyright safety for generative ai. Forthcoming in the Houston Law Review; 2023","DOI":"10.2139\/ssrn.4438593"},{"key":"1247_CR151","unstructured":"Poland CM. Generative ai and us intellectual property law; 2023. arXiv preprint arXiv:2311.16023"},{"key":"1247_CR152","doi-asserted-by":"crossref","unstructured":"Sixta T, Jacques\u00a0Junior JC, Buch-Cardona P, Vazquez E, Escalera S. Fairface challenge at eccv 2020: Analyzing bias in face recognition. In: Computer Vision\u2013ECCV 2020 workshops: Glasgow, UK, August 23\u201328, 2020, Proceedings, Part VI 16. Springer; 2020. p. 463\u201381.","DOI":"10.1007\/978-3-030-65414-6_32"},{"issue":"1150","key":"1247_CR153","doi-asserted-by":"publisher","first-page":"20230023","DOI":"10.1259\/bjr.20230023","volume":"96","author":"JW Gichoya","year":"2023","unstructured":"Gichoya JW, Thomas K, Celi LA, Safdar N, Banerjee I, Banja JD, Seyyed-Kalantari L, Trivedi H, Purkayastha S. Ai pitfalls and what not to do: mitigating bias in ai. Br J Radiol. 2023;96(1150):20230023.","journal-title":"Br J Radiol"},{"issue":"2","key":"1247_CR154","doi-asserted-by":"publisher","first-page":"399","DOI":"10.1007\/s11760-022-02246-8","volume":"17","author":"AH Sham","year":"2023","unstructured":"Sham AH, Aktas K, Rizhinashvili D, Kuklianov D, Alisinanoglu F, Ofodile I, Ozcinar C, Anbarjafari G. Ethical ai in facial expression analysis: racial bias. SIViP. 2023;17(2):399\u2013406.","journal-title":"SIViP"},{"key":"1247_CR155","doi-asserted-by":"publisher","first-page":"203","DOI":"10.1038\/s41592-020-01008-z","volume":"18","author":"F Isensee","year":"2021","unstructured":"Isensee F, Jaeger PF, Kohl SAA, et al. NNU-NET: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods. 2021;18:203\u201311.","journal-title":"Nat Methods"},{"issue":"3","key":"1247_CR156","doi-asserted-by":"publisher","first-page":"1001779","DOI":"10.1371\/journal.pmed.1001779","volume":"12","author":"C Sudlow","year":"2015","unstructured":"Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, et al. Uk biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):1001779.","journal-title":"PLoS Med"},{"key":"1247_CR157","doi-asserted-by":"crossref","unstructured":"Feng Y, Shah C. Has ceo gender bias really been fixed? adversarial attacking and improving gender fairness in image search. In: Proceedings of the AAAI conference on artificial intelligence, vol. 36; 2022. p. 11882\u201390.","DOI":"10.1609\/aaai.v36i11.21445"},{"key":"1247_CR158","unstructured":"Yao R, Cui Z, Li X, Gu L. Improving fairness in image classification via sketching; 2022. arXiv preprint arXiv:2211.00168"},{"key":"1247_CR159","doi-asserted-by":"crossref","unstructured":"Liu Z, Luo P, Wang X, Tang X. Deep learning face attributes in the wild. In: Proceedings of international conference on computer vision (ICCV); 2015","DOI":"10.1109\/ICCV.2015.425"},{"key":"1247_CR160","unstructured":"Codella N, Rotemberg V, Tschandl P, Celebi ME, Dusza S, Gutman D, Helba B, Kalloo A, Liopyris K, Marchetti M, Kittler H, Halpern A. Skin lesion analysis toward melanoma detection 2018: a challenge hosted by the international skin imaging collaboration (ISIC); 2018."},{"key":"1247_CR161","doi-asserted-by":"crossref","unstructured":"Hwang S, Park S, Kim D, Do M, Byun H. Fairfacegan: Fairness-aware facial image-to-image translation; 2020. arXiv preprint arXiv:2012.00282","DOI":"10.5244\/C.34.54"},{"key":"1247_CR162","doi-asserted-by":"crossref","unstructured":"Booth BM, Hickman L, Subburaj SK, Tay L, Woo SE, D\u2019Mello SK. Bias and fairness in multimodal machine learning: a case study of automated video interviews. In: Proceedings of the 2021 international conference on multimodal interaction; 2021. p. 268\u201377.","DOI":"10.1145\/3462244.3479897"},{"key":"1247_CR163","doi-asserted-by":"publisher","first-page":"122677","DOI":"10.1109\/ACCESS.2023.3325891","volume":"11","author":"C Kim","year":"2023","unstructured":"Kim C, Choi J, Yoon J, Yoo D, Lee W. Fairness-aware multimodal learning in automatic video interview assessment. IEEE Access. 2023;11:122677\u201393.","journal-title":"IEEE Access"},{"key":"1247_CR164","unstructured":"Bojchevski A, Shchur O, Z\u00fcgner D, G\u00fcnnemann S. Netgan: generating graphs via random walks. In: International conference on machine learning. PMLR; 2018. p. 610\u201319."},{"key":"1247_CR165","doi-asserted-by":"crossref","unstructured":"Zhou D, Zheng L, Han J, He J. A data-driven graph generative model for temporal interaction networks. In: Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining; 2020. p. 401\u201311.","DOI":"10.1145\/3394486.3403082"},{"key":"1247_CR166","doi-asserted-by":"crossref","unstructured":"Wang Z, Wallace C, Bifet A, Yao X, Zhang W. Fairness-aware graph generative adversarial networks. In: Joint European conference on machine learning and knowledge discovery in databases. Springer; 2023. p. 259\u201375.","DOI":"10.1007\/978-3-031-43415-0_16"},{"key":"1247_CR167","unstructured":"Wang Z, Zhang W. Fdgen: A fairness-aware graph generation model. In: Proceedings of the 42nd international conference on machine learning. PMLR; 2025."},{"issue":"2","key":"1247_CR168","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1145\/3715073.3715082","volume":"26","author":"Z Wang","year":"2025","unstructured":"Wang Z, Yin Z, Zhang Y, Yang L, Zhang T, Pissinou N, Cai Y, Hu S, Li Y, Zhao L, et al. FG-smote: towards fair node classification with graph neural network. ACM SIGKDD Explorations Newsl. 2025;26(2):99\u2013108.","journal-title":"ACM SIGKDD Explorations Newsl"},{"key":"1247_CR169","doi-asserted-by":"crossref","unstructured":"Wang Z, Saxena N, Yu T, Karki S, Zetty T, Haque I, Zhou S, Kc D, Stockwell I, Bifet A, et al. Preventing discriminatory decision-making in evolving data streams. In: Proceedings of the 2023 ACM conference on fairness, accountability, and transparency (FAccT); 2023","DOI":"10.1145\/3593013.3593984"},{"issue":"2","key":"1247_CR170","first-page":"58","volume":"1","author":"AS George","year":"2023","unstructured":"George AS, George AH. Deepfakes: the evolution of hyper realistic media manipulation. Partners Univ Innov Res Publ. 2023;1(2):58\u201374.","journal-title":"Partners Univ Innov Res Publ"},{"key":"1247_CR171","unstructured":"Marr B. What is Generative AI: a super-simple explanation anyone can understand. Forbes Magazine; 2023. https:\/\/www.forbes.com\/sites\/bernardmarr\/2023\/09\/19\/what-is-generative-ai-a-super-simple-explanation-anyone-can-\/understand\/"},{"key":"1247_CR172","unstructured":"Chance C. Deepfakes and legal implications: seeing is not believing; 2020. https:\/\/www.cliffordchance.com\/insights\/resources\/blogs\/talking-tech\/en\/articles\/2020\/12\/deepfakes-and-legal-implications--seeing-is-not-believing.html"},{"key":"1247_CR173","unstructured":"Baxter J. How star wars deepfake seriously improves Luke Skywalker cameo in the mandalorian; 2021. https:\/\/www.denofgeek.com\/tv\/star-wars-deepfake-luke-skywalker-mandalorian\/"},{"key":"1247_CR174","unstructured":"Goggin B. From porn to game of thrones: how deepfakes and realistic-looking fake videos hit it big. Business Insider. https:\/\/www.businessinsider.com\/deepfakes-explained-the-rise-of-fake-realistic-videos-online-2019-6"},{"key":"1247_CR175","unstructured":"Choi CQ. AI creates fake Obama. IEEE Spectrum 2022. https:\/\/spectrum.ieee.org\/ai-creates-fake-obama"},{"key":"1247_CR176","unstructured":"BasuMallick C. Deepfake types, examples, prevention; 2022. https:\/\/www.spiceworks.com\/it-security\/cyber-risk-management\/articles\/what-is-deepfake\/"},{"key":"1247_CR177","doi-asserted-by":"crossref","unstructured":"Rossler A, Cozzolino D, Verdoliva L, Riess C, Thies J, Nie\u00dfner M. Faceforensics++: learning to detect manipulated facial images. In: Proceedings of the IEEE\/CVF international conference on computer vision; 2019. p. 1\u2013 11","DOI":"10.1109\/ICCV.2019.00009"},{"key":"1247_CR178","first-page":"00065","volume":"2005","author":"D Saxena","year":"2020","unstructured":"Saxena D, Cao J. Generative adversarial networks (GANS): challenges. Solut Future Direct. 2020;2005:00065.","journal-title":"Solut Future Direct"},{"key":"1247_CR179","unstructured":"Kim T, Cha M, Kim H, Lee JK, Kim J. Learning to discover cross-domain relations with generative adversarial networks. In: International conference on machine learning. PMLR; 2017. p. 1857\u201365 . PMLR"},{"key":"1247_CR180","unstructured":"Bau D, Zhu J-Y, Strobelt H, Zhou B, Tenenbaum JB, Freeman WT, Torralba A. Gan dissection: Visualizing and understanding generative adversarial networks; 2018. arXiv preprint arXiv:1811.10597"},{"key":"1247_CR181","doi-asserted-by":"crossref","unstructured":"Park JS, O\u2019Brien J, Cai CJ, Morris MR, Liang P, Bernstein MS. Generative agents: Interactive simulacra of human behavior. In: Proceedings of the 36th annual ACM symposium on user interface software and technology; 2023. p. 1\u201322.","DOI":"10.1145\/3586183.3606763"},{"key":"1247_CR182","doi-asserted-by":"crossref","unstructured":"Rebol M, G\u00fctl C, Pietroszek K. Real-time gesture animation generation from speech for virtual human interaction. In: Extended abstracts of the 2021 CHI conference on human factors in computing systems; 2021. p. 1\u20134.","DOI":"10.1145\/3411763.3451554"},{"key":"1247_CR183","unstructured":"Hendrycks D, Gimpel K. A baseline for detecting misclassified and out-of-distribution examples in neural networks; 2016. arXiv preprint arXiv:1610.02136"},{"key":"1247_CR184","unstructured":"Hacker P. Ai regulation in Europe: from the AI act to future regulatory challenges; 2023. arXiv preprint arXiv:2310.04072"},{"key":"1247_CR185","doi-asserted-by":"publisher","first-page":"38","DOI":"10.22381\/RCP2120223","volume":"21","author":"F Bacalu","year":"2022","unstructured":"Bacalu F, et al. Biometric facial recognition technology, law enforcement algorithmic automation, and data-driven predictive policing systems in human rights protections and abuses. Rev Contemp Philos. 2022;21:38\u201354.","journal-title":"Rev Contemp Philos"},{"key":"1247_CR186","doi-asserted-by":"crossref","unstructured":"Milossi M. Remote biometric identification systems and ethical challenges: The case of facial recognition. In: 2021 6th South-East Europe design automation, computer engineering, computer networks and social media conference (SEEDA-CECNSM). IEEE; 2021. p. 1\u20136.","DOI":"10.1109\/SEEDA-CECNSM53056.2021.9566226"},{"key":"1247_CR187","unstructured":"The United States Government; 2023. https:\/\/www.whitehouse.gov\/briefing-room\/statements-releases\/2023\/10\/30\/fact-sheet-president-biden-issues-executive-order-on-safe-secure-and-\/trustworthy-artificial-intelligence\/"},{"key":"1247_CR188","unstructured":"https:\/\/www.congress.gov\/bill\/117th-congress\/house-bill\/9631?s=1&r=1"},{"key":"1247_CR189","unstructured":"Hinchey M. Hinchey Bill to ban non-consensual deepfake images signed into law; 2023. https:\/\/www.nysenate.gov\/newsroom\/press-releases\/2023\/michelle-hinchey\/hinchey-bill-ban-non-consensual-deepfake-images"},{"key":"1247_CR190","unstructured":"https:\/\/c2pa.org\/"},{"key":"1247_CR191","unstructured":"https:\/\/leica-camera.com\/en-US\/photography\/content-credentials"},{"key":"1247_CR192","doi-asserted-by":"publisher","first-page":"1955","DOI":"10.1007\/s43681-024-00583-7","volume":"5","author":"Z Wang","year":"2024","unstructured":"Wang Z, Chu Z, Doan TV, Ni S, Yang M, Zhang W. History, development, and principles of large language models-an introductory survey. AI Ethics. 2024;5:1955\u201371.","journal-title":"AI Ethics"},{"key":"1247_CR193","doi-asserted-by":"crossref","unstructured":"Tang Y, Bi J, Xu S, Song L, Liang S, Wang T, Zhang D, An J, Lin J, Zhu R, et al. Video understanding with large language models: a survey. IEEE Trans Circ Syst Video Technol. 2025.","DOI":"10.1109\/TCSVT.2025.3566695"},{"key":"1247_CR194","doi-asserted-by":"publisher","first-page":"34","DOI":"10.1145\/3682112.3682117","volume":"2024","author":"Z Chu","year":"2024","unstructured":"Chu Z, Wang Z, Zhang W. Fairness in large language models: a taxonomic survey. ACM SIGKDD Explorations Newsl. 2024;2024:34\u201348.","journal-title":"ACM SIGKDD Explorations Newsl"},{"key":"1247_CR195","unstructured":"Li Z, Wu X, Du H, Nghiem H, Shi G. Benchmark evaluations, applications, and challenges of large vision language models: a survey. arXiv preprint arXiv: 2501.02189"},{"key":"1247_CR196","unstructured":"Wang Z, Palikhe A, Zhang W. Fairness definitions in language models explained; 2024. arXiv preprint arXiv:2407.18454"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-025-01247-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-025-01247-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-025-01247-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T10:04:34Z","timestamp":1759917874000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-025-01247-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,8]]},"references-count":196,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["1247"],"URL":"https:\/\/doi.org\/10.1186\/s40537-025-01247-x","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,10,8]]},"assertion":[{"value":"29 July 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 July 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 October 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"230"}}