{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,9]],"date-time":"2026-07-09T04:10:38Z","timestamp":1783570238370,"version":"3.55.0"},"reference-count":236,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,1,29]],"date-time":"2021-01-29T00:00:00Z","timestamp":1611878400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,1,29]],"date-time":"2021-01-29T00:00:00Z","timestamp":1611878400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100007601","name":"Horizon 2020","doi-asserted-by":"publisher","award":["Project ID: 814225"],"award-info":[{"award-number":["Project ID: 814225"]}],"id":[{"id":"10.13039\/501100007601","id-type":"DOI","asserted-by":"publisher"}]},{"name":"ELKARTEK","award":["KK-2020\/00049 3KIA"],"award-info":[{"award-number":["KK-2020\/00049 3KIA"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Any computer vision application development starts off by acquiring images and data, then preprocessing and pattern recognition steps to perform a task. When the acquired images are highly imbalanced and not adequate, the desired task may not be achievable. Unfortunately, the occurrence of imbalance problems in acquired image datasets in certain complex real-world problems such as anomaly detection, emotion recognition, medical image analysis, fraud detection, metallic surface defect detection, disaster prediction, etc., are inevitable. The performance of computer vision algorithms can significantly deteriorate when the training dataset is imbalanced. In recent years, Generative Adversarial Neural Networks (GANs) have gained immense attention by researchers across a variety of application domains due to their capability to model complex real-world image data. It is particularly important that GANs can not only be used to generate synthetic images, but also its fascinating adversarial learning idea showed good potential in restoring balance in imbalanced datasets.<\/jats:p>\n                  <jats:p>In this paper, we examine the most recent developments of GANs based techniques for addressing imbalance problems in image data. The real-world challenges and implementations of synthetic image generation based on GANs are extensively covered in this survey. Our survey first introduces various imbalance problems in computer vision tasks and its existing solutions, and then examines key concepts such as deep generative image models and GANs. After that, we propose a taxonomy to summarize GANs based techniques for addressing imbalance problems in computer vision tasks into three major categories: 1. Image level imbalances in classification, 2. object level imbalances in object detection and 3. pixel level imbalances in segmentation tasks. We elaborate the imbalance problems of each group, and provide GANs based solutions in each group. Readers will understand how GANs based techniques can handle the problem of imbalances and boost performance of the computer vision algorithms.<\/jats:p>","DOI":"10.1186\/s40537-021-00414-0","type":"journal-article","created":{"date-parts":[[2021,1,29]],"date-time":"2021-01-29T12:03:17Z","timestamp":1611921797000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":235,"title":["A survey on generative adversarial networks for imbalance problems in computer vision tasks"],"prefix":"10.1186","volume":"8","author":[{"given":"Vignesh","family":"Sampath","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"I\u00f1aki","family":"Maurtua","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Juan Jos\u00e9","family":"Aguilar Mart\u00edn","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Aitor","family":"Gutierrez","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2021,1,29]]},"reference":[{"key":"414_CR1","doi-asserted-by":"crossref","unstructured":"Nugraha BT, Su SF, Fahmizal. Towards self-driving car using convolutional neural network and road lane detector. Proceedings of the 2nd International Conference on Automation, Cognitive Science, Optics, Micro Electro-Mechanical System, and Information Technology, ICACOMIT 2017. 2017;2018-Janua:65\u20139.","DOI":"10.1109\/ICACOMIT.2017.8253388"},{"key":"414_CR2","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-019-0276-2","author":"SS Yadav","year":"2019","unstructured":"Yadav SS, Jadhav SM. Deep convolutional neural network based medical image classification for disease diagnosis. J Big Data. 2019. https:\/\/doi.org\/10.1186\/s40537-019-0276-2.","journal-title":"J Big Data."},{"key":"414_CR3","doi-asserted-by":"publisher","DOI":"10.1155\/2019\/5219471","author":"A Gutierrez","year":"2019","unstructured":"Gutierrez A, Ansuategi A, Susperregi L, Tub\u00edo C, Ranki\u0107 I, Len\u017ea L. A Benchmarking of learning strategies for pest detection and identification on tomato plants for autonomous scouting robots using internal databases. J Sensors. 2019. https:\/\/doi.org\/10.1155\/2019\/5219471.","journal-title":"J Sensors"},{"key":"414_CR4","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-35990-4_12","author":"L Santos","year":"2020","unstructured":"Santos L, Santos FN, Oliveira PM, Shinde P. Deep learning applications in agriculture: a short review. Advances in intelligent systems and computing. Fourth Ibe. 2020. https:\/\/doi.org\/10.1007\/978-3-030-35990-4_12.","journal-title":"Fourth Ibe."},{"key":"414_CR5","doi-asserted-by":"publisher","first-page":"3465","DOI":"10.1007\/s00170-017-0882-0","volume":"94","author":"T Wang","year":"2018","unstructured":"Wang T, Chen Y, Qiao M, Snoussi H. A fast and robust convolutional neural network-based defect detection model in product quality control. Int J Adv Manufactur Technol. 2018;94:3465\u201371.","journal-title":"Int J Adv Manufactur Technol."},{"key":"414_CR6","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-019-0263-7","author":"M Hashemi","year":"2019","unstructured":"Hashemi M. Enlarging smaller images before inputting into convolutional neural network: zero-padding vs interpolation. J Big Data. 2019. https:\/\/doi.org\/10.1186\/s40537-019-0263-7.","journal-title":"J Big Data."},{"key":"414_CR7","doi-asserted-by":"crossref","unstructured":"Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE . 1998;86:2278\u2013324. http:\/\/ieeexplore.ieee.org\/document\/726791\/","DOI":"10.1109\/5.726791"},{"key":"414_CR8","doi-asserted-by":"crossref","unstructured":"Girshick R, Donahue J, Darrell T, Malik J. Rich feature hierarchies for accurate object detection and semantic segmentation. 2014 IEEE Conference on Computer Vision and Pattern Recognition . IEEE; 2014. p. 580\u20137. http:\/\/ieeexplore.ieee.org\/document\/6909475\/","DOI":"10.1109\/CVPR.2014.81"},{"key":"414_CR9","doi-asserted-by":"crossref","unstructured":"Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . IEEE; 2015. p. 3431\u201340. http:\/\/arxiv.org\/abs\/1605.06211","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"414_CR10","first-page":"1097","volume":"2","author":"A Krizhevsky","year":"2012","unstructured":"Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Adv Neural Informat Process Syst. 2012;2:1097\u2013105.","journal-title":"Adv Neural Informat Process Syst"},{"key":"414_CR11","unstructured":"Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. 3rd International Conference on Learning Representations, ICLR 2015\u2013Conference Track Proceedings. 2015;1\u201314."},{"key":"414_CR12","unstructured":"Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, et al. Going Deeper with Convolutions. CoRR . 2014; abs\/1409.4. https:\/\/arxiv.org\/abs\/1409.4842"},{"key":"414_CR13","doi-asserted-by":"crossref","unstructured":"He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Proceedings of the IEEE computer society conference on computer vision and pattern recognition. 2016. p. 770\u20138. http:\/\/arxiv.org\/abs\/1512.03385","DOI":"10.1109\/CVPR.2016.90"},{"key":"414_CR14","doi-asserted-by":"crossref","unstructured":"Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z. Rethinking the inception architecture for computer vision. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . IEEE; 2016. p. 2818\u201326. http:\/\/arxiv.org\/abs\/1512.00567","DOI":"10.1109\/CVPR.2016.308"},{"key":"414_CR15","doi-asserted-by":"crossref","unstructured":"Huang G, Liu Z, Van Der Maaten L, Weinberger KQ. Densely connected convolutional networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . IEEE; 2017. p. 2261\u20139. http:\/\/arxiv.org\/abs\/1608.06993","DOI":"10.1109\/CVPR.2017.243"},{"key":"414_CR16","doi-asserted-by":"crossref","unstructured":"Buda M, Maki A, Mazurowski MA. A systematic study of the class imbalance problem in convolutional neural networks. Neural Netw. 2018;106:249\u201359. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0893608018302107","DOI":"10.1016\/j.neunet.2018.07.011"},{"key":"414_CR17","doi-asserted-by":"publisher","unstructured":"Al-Stouhi S, Reddy CK. Transfer learning for class imbalance problems with inadequate data. Knowl Informat Syst. 2016;48:201\u201328. https:\/\/doi.org\/10.1007\/s10115-015-0870-3","DOI":"10.1007\/s10115-015-0870-3"},{"key":"414_CR18","first-page":"176","volume":"7","author":"A Ali","year":"2015","unstructured":"Ali A, Shamsuddin SM, Ralescu AL. Classification with class imbalance problem: a review. Int J Adv Soft Comput Applicat. 2015;7:176\u2013204.","journal-title":"Int J Adv Soft Comput Applicat"},{"key":"414_CR19","unstructured":"Zhang J, Xia Y, Wu Q, Xie Y. Classification of medical images and illustrations in the biomedical literature using synergic deep learning. 2017. http:\/\/arxiv.org\/abs\/1706.09092"},{"key":"414_CR20","doi-asserted-by":"crossref","unstructured":"Dong Q, Gong S, Zhu X. Imbalanced deep learning by minority class incremental rectification. IEEE Transactions on Pattern Analysis and Machine Intelligence . 2019;41:1367\u201381. https:\/\/ieeexplore.ieee.org\/document\/8353718","DOI":"10.1109\/TPAMI.2018.2832629"},{"key":"414_CR21","doi-asserted-by":"crossref","unstructured":"Zhang Y, Li B, Lu H, Irie A, Ruan X. Sample-Specific SVM learning for person re-identification. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . IEEE; 2016. p. 1278\u201387. http:\/\/ieeexplore.ieee.org\/document\/7780512\/","DOI":"10.1109\/CVPR.2016.143"},{"key":"414_CR22","doi-asserted-by":"publisher","first-page":"981","DOI":"10.1007\/s10462-018-9661-z","volume":"52","author":"MM Sawant","year":"2019","unstructured":"Sawant MM, Bhurchandi KM. Age invariant face recognition: a survey on facial aging databases, techniques and effect of aging. Artific Intell Rev. 2019;52:981\u20131008. https:\/\/doi.org\/10.1007\/s10462-018-9661-z.","journal-title":"Artific Intell Rev."},{"key":"414_CR23","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1007\/978-3-642-33783-3_2","volume-title":"Pose Invariant Approach for Face Recognition at Distance","author":"E Mostafa","year":"2012","unstructured":"Mostafa E, Ali A, Alajlan N, Farag A. Pose Invariant Approach for Face Recognition at Distance. Berlin : Springer; 2012. p. 15\u201328. https:\/\/doi.org\/10.1007\/978-3-642-33783-3_2."},{"key":"414_CR24","doi-asserted-by":"publisher","first-page":"429","DOI":"10.5555\/1293951.1293954","volume":"6","author":"N Japkowicz","year":"2002","unstructured":"Japkowicz N, Stephen S. The class imbalance problem: a systematic study. Intell Data Analy. 2002;6:429\u201349. https:\/\/doi.org\/10.5555\/1293951.1293954.","journal-title":"Intell Data Analy."},{"key":"414_CR25","doi-asserted-by":"publisher","first-page":"853","DOI":"10.1007\/0-387-25465-X_40","volume-title":"Data mining for imbalanced datasets: an overview. data mining and knowledge discovery handbook","author":"NV Chawla","year":"2009","unstructured":"Chawla NV. Data mining for imbalanced datasets: an overview. data mining and knowledge discovery handbook. New York : Springer-Verlag; 2009. p. 853\u201367. https:\/\/doi.org\/10.1007\/0-387-25465-X_40."},{"key":"414_CR26","doi-asserted-by":"publisher","unstructured":"Chawla NV, Japkowicz N, Kotcz A. Special Issue on Learning from Imbalanced Data Sets. ACM SIGKDD Explorations Newsletter. 2004; 6: 1\u20136. https:\/\/doi.org\/10.1145\/1007730.1007733","DOI":"10.1145\/1007730.1007733"},{"key":"414_CR27","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1613\/jair.953","volume":"16","author":"NV Chawla","year":"2011","unstructured":"Chawla N V., Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: Synthetic minority over-sampling technique. J Artific Intell Res. 2011;16:321\u201357. https:\/\/doi.org\/10.1613\/jair.953. https:\/\/arxiv.org\/abs\/1106.1813","journal-title":"J Artific Intell Res"},{"key":"414_CR28","doi-asserted-by":"crossref","unstructured":"Haibo He, Yang Bai, Garcia EA, Shutao Li. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence) . IEEE; 2008. p. 1322\u20138. http:\/\/ieeexplore.ieee.org\/document\/4633969\/","DOI":"10.1109\/IJCNN.2008.4633969"},{"key":"414_CR29","doi-asserted-by":"crossref","unstructured":"Puntumapon K, Rakthamamon T, Waiyamai K. Cluster-based minority over-sampling for imbalanced datasets. IEICE Transactions on Information and Systems . 2016;E99.D:3101\u20139. https:\/\/www.jstage.jst.go.jp\/article\/transinf\/E99.D\/12\/E99.D_2016EDP7130\/_article","DOI":"10.1587\/transinf.2016EDP7130"},{"key":"414_CR30","doi-asserted-by":"crossref","unstructured":"Simard PY, Steinkraus D, Platt JC. Best practices for convolutional neural networks applied to visual document analysis. Seventh International Conference on Document Analysis and Recognition, 2003 Proceedings . IEEE Comput. Soc; p. 958\u201363. http:\/\/ieeexplore.ieee.org\/document\/1227801\/","DOI":"10.1109\/ICDAR.2003.1227801"},{"key":"414_CR31","doi-asserted-by":"crossref","unstructured":"Lemley J, Bazrafkan S, Corcoran P. Deep Learning for Consumer Devices and Services: Pushing the limits for machine learning, artificial intelligence, and computer vision. IEEE Consumer Electronics Magazine . 2017;6:48\u201356. http:\/\/ieeexplore.ieee.org\/document\/7879402\/","DOI":"10.1109\/MCE.2016.2640698"},{"key":"414_CR32","doi-asserted-by":"publisher","first-page":"60","DOI":"10.1186\/s40537-019-0197-0","volume":"6","author":"C Shorten","year":"2019","unstructured":"Shorten C, Khoshgoftaar TM. A survey on image data augmentation for deep learning. J Big Data. 2019;6:60. https:\/\/doi.org\/10.1186\/s40537-019-0197-0.","journal-title":"J Big Data."},{"key":"414_CR33","doi-asserted-by":"crossref","unstructured":"Wu H, Prasad S. Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification. IEEE Transactions on Image Processing . 2018;27:1259\u201370. http:\/\/ieeexplore.ieee.org\/document\/8105856\/","DOI":"10.1109\/TIP.2017.2772836"},{"key":"414_CR34","doi-asserted-by":"publisher","first-page":"373","DOI":"10.1007\/s10994-019-05855-6","volume":"109","author":"JE van Engelen","year":"2020","unstructured":"van Engelen JE, Hoos HH. A survey on semi-supervised learning. Mach Learn. 2020;109:373\u2013440. https:\/\/doi.org\/10.1007\/s10994-019-05855-6.","journal-title":"Mach Learn"},{"key":"414_CR35","doi-asserted-by":"crossref","unstructured":"Thai-Nghe N, Gantner Z, Schmidt-Thieme L. Cost-sensitive learning methods for imbalanced data. The 2010 International Joint Conference on Neural Networks (IJCNN) . IEEE; 2010. p. 1\u20138. http:\/\/ieeexplore.ieee.org\/document\/5596486\/","DOI":"10.1109\/IJCNN.2010.5596486"},{"key":"414_CR36","doi-asserted-by":"crossref","unstructured":"Girshick R. Fast R-CNN. 2015 IEEE International Conference on Computer Vision (ICCV) . IEEE; 2015. p. 1440\u20138. http:\/\/ieeexplore.ieee.org\/document\/7410526\/","DOI":"10.1109\/ICCV.2015.169"},{"key":"414_CR37","doi-asserted-by":"crossref","unstructured":"Ren S, He K, Girshick R, Sun J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence . 2017;39:1137\u201349. http:\/\/ieeexplore.ieee.org\/document\/7485869\/","DOI":"10.1109\/TPAMI.2016.2577031"},{"key":"414_CR38","doi-asserted-by":"crossref","unstructured":"He K, Gkioxari G, Dollar P, Girshick R. Mask R-CNN. IEEE Transactions on pattern analysis and machine intelligence. 2020;42:386\u201397. https:\/\/ieeexplore.ieee.org\/document\/8372616\/","DOI":"10.1109\/TPAMI.2018.2844175"},{"key":"414_CR39","doi-asserted-by":"publisher","unstructured":"Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, et al. SSD: Single Shot MultiBox Detector. In: Leibe B, Matas J, Sebe N, Welling M, editors. Cham: Springer International Publishing; 2016. p. 21\u201337. Doi: https:\/\/doi.org\/10.1007\/978-3-319-46448-0_2","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"414_CR40","unstructured":"Redmon JSDRGAF. (YOLO) You Only Look Once. Cvpr. 2016;"},{"key":"414_CR41","doi-asserted-by":"crossref","unstructured":"Yan X, Gong H, Jiang Y, Xia S-T, Zheng F, You X, et al. Video scene parsing: an overview of deep learning methods and datasets. Computer Vision and Image Understanding . 2020;201:103077. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S1077314220301120","DOI":"10.1016\/j.cviu.2020.103077"},{"key":"414_CR42","doi-asserted-by":"crossref","unstructured":"Hsu Y-W, Wang T-Y, Perng J-W. Passenger flow counting in buses based on deep learning using surveillance video. Optik . 2020;202:163675. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0030402619315736","DOI":"10.1016\/j.ijleo.2019.163675"},{"key":"414_CR43","doi-asserted-by":"crossref","unstructured":"Singh B, Davis LS. An analysis of scale invariance in object detection\u2013SNIP. 2018 IEEE\/CVF Conference on computer vision and pattern recognition. IEEE; 2018. p. 3578\u201387. https:\/\/ieeexplore.ieee.org\/document\/8578475\/","DOI":"10.1109\/CVPR.2018.00377"},{"key":"414_CR44","doi-asserted-by":"crossref","unstructured":"Yang F, Choi W, Lin Y. Exploit All the Layers: Fast and Accurate CNN object detector with scale dependent pooling and cascaded rejection classifiers. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . IEEE; 2016. p. 2129\u201337. http:\/\/ieeexplore.ieee.org\/document\/7780603\/","DOI":"10.1109\/CVPR.2016.234"},{"key":"414_CR45","unstructured":"Singh B, Najibi M, Davis LS. SNIPER: Efficient Multi-Scale Training. 32nd conference on neural information processing systems. Montr\u00e9al; 2018. http:\/\/arxiv.org\/abs\/1805.09300"},{"key":"414_CR46","doi-asserted-by":"crossref","unstructured":"Lin T-Y, Dollar P, Girshick R, He K, Hariharan B, Belongie S. Feature Pyramid Networks for Object Detection. 2017 IEEE conference on computer vision and pattern recognition (CVPR). IEEE; 2017. p. 936\u201344. http:\/\/ieeexplore.ieee.org\/document\/8099589\/","DOI":"10.1109\/CVPR.2017.106"},{"key":"414_CR47","doi-asserted-by":"crossref","unstructured":"Lin T-Y, Goyal P, Girshick R, He K, Dollar P. Focal Loss for Dense Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2020;42:318\u201327. https:\/\/ieeexplore.ieee.org\/document\/8417976\/","DOI":"10.1109\/TPAMI.2018.2858826"},{"key":"414_CR48","doi-asserted-by":"crossref","unstructured":"Dollar P, Wojek C, Schiele B, Perona P. Pedestrian detection: a benchmark. 2009 IEEE Conference on Computer Vision and Pattern Recognition . IEEE; 2009. p. 304\u201311. https:\/\/ieeexplore.ieee.org\/document\/5206631\/","DOI":"10.1109\/CVPRW.2009.5206631"},{"key":"414_CR49","unstructured":"Zhong Z, Zheng L, Kang G, Li S, Yang Y. Random Erasing Data Augmentation. 2017. http:\/\/arxiv.org\/abs\/1708.04896"},{"key":"414_CR50","doi-asserted-by":"crossref","unstructured":"Wang X, Shrivastava A, Gupta A. A-Fast-RCNN: Hard positive generation via adversary for object detection. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE; 2017. p. 3039\u201348. http:\/\/arxiv.org\/abs\/1704.03414","DOI":"10.1109\/CVPR.2017.324"},{"key":"414_CR51","doi-asserted-by":"crossref","unstructured":"Badrinarayanan V, Kendall A, Cipolla R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2017;39:2481\u201395. http:\/\/arxiv.org\/abs\/1511.00561","DOI":"10.1109\/TPAMI.2016.2644615"},{"key":"414_CR52","doi-asserted-by":"crossref","unstructured":"Ronneberger O, Fischer P, Brox T. U-Net: Convolutional networks for biomedical image segmentation. 2015. p. 234\u201341. http:\/\/arxiv.org\/abs\/1505.04597","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"414_CR53","doi-asserted-by":"crossref","unstructured":"Diakogiannis FI, Waldner F, Caccetta P, Wu C. ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data. ISPRS Journal of Photogrammetry and Remote Sensing . 2020;162:94\u2013114. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0924271620300149","DOI":"10.1016\/j.isprsjprs.2020.01.013"},{"key":"414_CR54","unstructured":"Yurtsever E, Lambert J, Carballo A, Takeda K. A survey of autonomous driving: common practices and emerging technologies. 2019. http:\/\/arxiv.org\/abs\/1906.05113"},{"key":"414_CR55","doi-asserted-by":"crossref","unstructured":"Tabernik D, \u0160ela S, Skvar\u010d J, Sko\u010daj D. Segmentation-based deep-learning approach for surface-defect detection. 2019. http:\/\/arxiv.org\/abs\/1903.08536","DOI":"10.1007\/s10845-019-01476-x"},{"key":"414_CR56","doi-asserted-by":"crossref","unstructured":"Rizwan I Haque I, Neubert J. Deep learning approaches to biomedical image segmentation. Informatics in Medicine Unlocked. 2020;18:100297. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S235291481930214X","DOI":"10.1016\/j.imu.2020.100297"},{"key":"414_CR57","doi-asserted-by":"crossref","unstructured":"Cordts M, Omran M, Ramos S, Rehfeld T, Enzweiler M, Benenson R, et al. The cityscapes dataset for semantic urban scene understanding. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2016;2016-Decem:3213\u201323.","DOI":"10.1109\/CVPR.2016.350"},{"key":"414_CR58","doi-asserted-by":"crossref","unstructured":"Menze BH, Jakab A, Bauer S, Kalpathy-Cramer J, Farahani K, Kirby J, et al. The multimodal brain tumor image segmentation benchmark (BRATS). IEEE Transac Med Imag. 2015;34:1993\u20132024. http:\/\/ieeexplore.ieee.org\/document\/6975210\/","DOI":"10.1109\/TMI.2014.2377694"},{"key":"414_CR59","volume-title":"Machine learning: a probabilistic perspective (Adaptive Computation and Machine Learning series)","author":"KP Murphy","year":"2012","unstructured":"Murphy KP. Machine learning: a probabilistic perspective (Adaptive Computation and Machine Learning series). Cambridge: The MIT Press; 2012."},{"key":"414_CR60","doi-asserted-by":"crossref","unstructured":"Milletari F, Navab N, Ahmadi S-A. V-Net: Fully convolutional neural networks for volumetric medical image segmentation. 2016 Fourth International Conference on 3D Vision (3DV) . IEEE; 2016. p. 565\u201371. http:\/\/ieeexplore.ieee.org\/document\/7785132\/","DOI":"10.1109\/3DV.2016.79"},{"key":"414_CR61","doi-asserted-by":"crossref","unstructured":"Crum WR, Camara O, Hill DLG. Generalized Overlap Measures for Evaluation and Validation in Medical Image Analysis. IEEE Transact Med Imag. 2006;25:1451\u201361. http:\/\/ieeexplore.ieee.org\/document\/1717643\/","DOI":"10.1109\/TMI.2006.880587"},{"key":"414_CR62","doi-asserted-by":"crossref","unstructured":"Salehi SSM, Erdogmus D, Gholipour A. Tversky loss function for image segmentation using 3D fully convolutional deep networks. 2017. p. 379\u201387. http:\/\/arxiv.org\/abs\/1706.05721","DOI":"10.1007\/978-3-319-67389-9_44"},{"key":"414_CR63","doi-asserted-by":"crossref","unstructured":"Berman M, Triki AR, Blaschko MB. The Lovasz-Softmax Loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition . IEEE; 2018. p. 4413\u201321. https:\/\/ieeexplore.ieee.org\/document\/8578562\/","DOI":"10.1109\/CVPR.2018.00464"},{"key":"414_CR64","doi-asserted-by":"crossref","unstructured":"He Z, Zuo W, Kan M, Shan S, Chen X. AttGAN: Facial attribute editing by only changing what you want. IEEE transactions on image processing . 2019;28:5464\u201378. https:\/\/ieeexplore.ieee.org\/document\/8718508\/","DOI":"10.1109\/TIP.2019.2916751"},{"key":"414_CR65","unstructured":"Perarnau G, van de Weijer J, Raducanu B, \u00c1lvarez JM. Invertible Conditional GANs for image editing. Conference on Neural Information Processing Systems . 2016. http:\/\/arxiv.org\/abs\/1611.06355"},{"key":"414_CR66","doi-asserted-by":"crossref","unstructured":"Tao R, Li Z, Tao R, Li B. ResAttr-GAN: Unpaired deep residual attributes learning for multi-domain face image translation. IEEE Access . 2019;7:132594\u2013608. https:\/\/ieeexplore.ieee.org\/document\/8836502\/","DOI":"10.1109\/ACCESS.2019.2941272"},{"key":"414_CR67","first-page":"2672","volume":"3","author":"IJ Goodfellow","year":"2014","unstructured":"Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, et al. Generative adversarial nets. Adv Neural Inf Process Syst. 2014;3:2672\u201380.","journal-title":"Adv Neural Inf Process Syst"},{"key":"414_CR68","unstructured":"Bowles C, Chen L, Guerrero R, Bentley P, Gunn R, Hammers A, et al. GAN Augmentation: augmenting training data using generative adversarial networks. 2018; http:\/\/arxiv.org\/abs\/1810.10863"},{"key":"414_CR69","unstructured":"Oord A van den, Kalchbrenner N, Kavukcuoglu K. Pixel recurrent neural networks. 2016; http:\/\/arxiv.org\/abs\/1601.06759"},{"key":"414_CR70","unstructured":"Sejnowski MIJTJ. Learning and relearning in boltzmann machines. Graphical models: foundations of neural computation, MITP. 2001;"},{"key":"414_CR71","unstructured":"McClelland DERJL. Information processing in dynamical systems: foundations of harmony theory. parallel distributed processing: explorations in the microstructure of Cognition: Foundations, MITP. 1987;194\u2013281."},{"key":"414_CR72","doi-asserted-by":"publisher","first-page":"504","DOI":"10.1126\/science.1127647","volume":"313","author":"GE Hinton","year":"2006","unstructured":"Hinton GE, Salakhutdinov RR. Reducing the dimensionality of data with neural networks. Science. 2006;313:504\u20137.","journal-title":"Science"},{"key":"414_CR73","first-page":"448","volume":"5","author":"R Salakhutdinov","year":"2009","unstructured":"Salakhutdinov R, Hinton G. Deep Boltzmann machines. J Machine Learn Res. 2009;5:448\u201355.","journal-title":"J Machine Learn Res"},{"key":"414_CR74","doi-asserted-by":"crossref","unstructured":"Lee H, Grosse R, Ranganath R, Y. Ng A. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. Computer Science Department, Stanford University . 2009;8. http:\/\/robotics.stanford.edu\/~ang\/papers\/icml09-ConvolutionalDeepBeliefNetworks.pdf","DOI":"10.1145\/1553374.1553453"},{"key":"414_CR75","doi-asserted-by":"publisher","first-page":"1527","DOI":"10.1162\/neco.2006.18.7.1527","volume":"18","author":"GE Hinton","year":"2006","unstructured":"Hinton GE, Osindero S, Teh Y-W. A fast learning algorithm for deep belief nets. Neural Comput. 2006;18:1527\u201354. https:\/\/doi.org\/10.1162\/neco.2006.18.7.1527.","journal-title":"Neural Comput."},{"key":"414_CR76","unstructured":"Ramachandran P, Paine T Le, Khorrami P, Babaeizadeh M, Chang S, Zhang Y, et al. Fast generation for convolutional autoregressive models. 2017; http:\/\/arxiv.org\/abs\/1704.06001"},{"key":"414_CR77","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/3348.001.0001","volume-title":"Graphical models for machine learning and digital communication","author":"BJ Frey","year":"1998","unstructured":"Frey BJ. Graphical models for machine learning and digital communication. Cambridge: MIT Press; 1998."},{"key":"414_CR78","unstructured":"Frey BJ, Hinton GE, Dayan P. Does the Wake-sleep algorithm produce good density estimators? Advances in neural information processing systems . 1996;13:661\u201370. http:\/\/www.cs.utoronto.ca\/~hinton\/absps\/wsperf.pdf%5Cnpapers2:\/\/publication\/uuid\/BCC0547E-7C14-42EC-8693-D800C5819C79"},{"key":"414_CR79","unstructured":"Uria B, C\u00f4t\u00e9 M-A, Gregor K, Murray I, Larochelle H. Neural autoregressive distribution estimation. J Mach Learn Res. 2016;17:1\u201337. http:\/\/arxiv.org\/abs\/1605.02226"},{"key":"414_CR80","doi-asserted-by":"crossref","unstructured":"Schuller B, W\u00f6llmer M, Moosmayr T, Rigoll G. Recognition of noisy speech: a comparative survey of robust model architecture and feature enhancement. EURASIP J Audio Speech Music Process. 2009;2009:942617. http:\/\/asmp.eurasipjournals.com\/content\/2009\/1\/942617","DOI":"10.1155\/2009\/942617"},{"key":"414_CR81","doi-asserted-by":"crossref","unstructured":"Yang S, Lu H, Kang S, Xue L, Xiao J, Su D, et al. On the localness modeling for the self-attention based end-to-end speech synthesis. Neural Netw. 2020;125:121\u201330. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0893608020300447","DOI":"10.1016\/j.neunet.2020.01.034"},{"key":"414_CR82","doi-asserted-by":"crossref","unstructured":"Ghosh R, Vamshi C, Kumar P. RNN based online handwritten word recognition in Devanagari and Bengali scripts using horizontal zoning. Pattern Recognit. 2019;92:203\u201318. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0031320319301384","DOI":"10.1016\/j.patcog.2019.03.030"},{"key":"414_CR83","doi-asserted-by":"crossref","unstructured":"Chen J, Zhuge H. Extractive summarization of documents with images based on multi-modal RNN. Future Generat Comput Syst. 2019;99:186\u201396. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0167739X18326876","DOI":"10.1016\/j.future.2019.04.045"},{"key":"414_CR84","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9:1735\u201380. https:\/\/doi.org\/10.1162\/neco.1997.9.8.1735.","journal-title":"Neural Comput."},{"key":"414_CR85","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is all you need. arXiv . 2017; http:\/\/arxiv.org\/abs\/1706.03762"},{"key":"414_CR86","unstructured":"Theis L, Bethge M. Generative Image Modeling Using Spatial LSTMs. Proceedings of the 28th International Conference on Neural Information Processing Systems\u2013Volume 2. Cambridge: MIT Press; 2015. p. 1927\u20131935."},{"key":"414_CR87","unstructured":"Krizhevsky A. Learning multiple layers of features from tiny images . 2009. http:\/\/www.cs.toronto.edu\/~kriz\/cifar.html"},{"key":"414_CR88","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","volume":"115","author":"O Russakovsky","year":"2015","unstructured":"Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, et al. ImageNet large scale visual recognition challenge. Int J Comput Vis. 2015;115:211\u201352. https:\/\/doi.org\/10.1007\/s11263-015-0816-y.","journal-title":"Int J Comput Vis."},{"key":"414_CR89","unstructured":"Oord A van den, Kalchbrenner N, Vinyals O, Espeholt L, Graves A, Kavukcuoglu K. Conditional image generation with PixelCNN Decoders. http:\/\/arxiv.org\/abs\/1606.05328"},{"key":"414_CR90","unstructured":"Salimans T, Karpathy A, Chen X, Kingma DP. PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications. 2017; http:\/\/arxiv.org\/abs\/1701.05517"},{"key":"414_CR91","unstructured":"Chen X, Mishra N, Rohaninejad M, Abbeel P. PixelSNAIL: an improved autoregressive generative model. 2017. http:\/\/arxiv.org\/abs\/1712.09763"},{"key":"414_CR92","doi-asserted-by":"crossref","unstructured":"Vincent P, Larochelle H, Bengio Y, Manzagol P-A. Extracting and composing robust features with denoising autoencoders. Proceedings of the 25th international conference on Machine learning - ICML \u201908 . New York: ACM Press; 2008. p. 1096\u2013103. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0925231218306155","DOI":"10.1145\/1390156.1390294"},{"key":"414_CR93","unstructured":"Baldi P. Autoencoders, unsupervised learning, and deep architectures . PMLR; 2012. http:\/\/proceedings.mlr.press\/v27\/baldi12a.html"},{"key":"414_CR94","unstructured":"Y. Ng A. Sparse autoencoder .https:\/\/web.stanford.edu\/class\/cs294a\/sparseAutoencoder.pdf"},{"key":"414_CR95","doi-asserted-by":"publisher","unstructured":"Masci J, Meier U, Cire\u015fan D, Schmidhuber J. Stacked convolutional auto-encoders for hierarchical feature extraction. 2011. p. 52\u20139. https:\/\/doi.org\/10.1007\/978-3-642-21735-7_7","DOI":"10.1007\/978-3-642-21735-7_7"},{"key":"414_CR96","doi-asserted-by":"crossref","unstructured":"Rifai S, Vincent P, Muller X, Glorot X, Bengio Y. Contractive auto-encoders: explicit invariance during feature extraction. ICML. 2011.","DOI":"10.1007\/978-3-642-23783-6_41"},{"key":"414_CR97","unstructured":"Kingma DP, Welling M. Auto-encoding variational bayes. 2013; http:\/\/arxiv.org\/abs\/1312.6114"},{"key":"414_CR98","doi-asserted-by":"crossref","unstructured":"Tan S, Li B. Stacked convolutional auto-encoders for steganalysis of digital images. Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific. IEEE; 2014. p. 1\u20134.","DOI":"10.1109\/APSIPA.2014.7041565"},{"key":"414_CR99","unstructured":"Germain M, Gregor K, Murray I, Larochelle H. MADE: Masked autoencoder for distribution estimation. 2015. http:\/\/arxiv.org\/abs\/1502.03509"},{"key":"414_CR100","doi-asserted-by":"publisher","first-page":"863","DOI":"10.1162\/neco.1992.4.6.863","volume":"4","author":"J Schmidhuber","year":"1992","unstructured":"Schmidhuber J. Learning factorial codes by predictability minimization. Neural Comput. 1992;4:863\u201379. https:\/\/doi.org\/10.1162\/neco.1992.4.6.863.","journal-title":"Neural Comput."},{"key":"414_CR101","first-page":"3483","volume":"2015-Janua","author":"K Sohn","year":"2015","unstructured":"Sohn K, Yan X, Lee H. Learning structured output representation using deep conditional generative models. Adv Neural Informat Process Syst. 2015;2015-Janua:3483\u201391.","journal-title":"Adv Neural Informat Process Syst."},{"key":"414_CR102","unstructured":"Higgins I, Matthey L, Pal A, Burgess C, Glorot X, Botvinick M, et al. \u0392-VAE: Learning basic visual concepts with a constrained variational framework. 5th International Conference on Learning Representations, ICLR 2017\u2013Conference Track Proceedings. 2019;1\u201313."},{"key":"414_CR103","unstructured":"Kulkarni TD, Whitney W, Kohli P, Tenenbaum JB. Deep convolutional inverse graphics network. 2015. http:\/\/arxiv.org\/abs\/1503.03167"},{"key":"414_CR104","unstructured":"Huang C-W, Sankaran K, Dhekane E, Lacoste A, Courville A. Hierarchical Importance Weighted Autoencoders. In: Chaudhuri K, Salakhutdinov R, editors. Long Beach, California, USA: PMLR; 2019. p. 2869\u201378. http:\/\/proceedings.mlr.press\/v97\/huang19d.html"},{"key":"414_CR105","unstructured":"Gulrajani I, Kumar K, Ahmed F, Taiga AA, Visin F, Vazquez D, et al. PixelVAE: A latent variable model for natural images. 2016; Ahttp:\/\/arxiv.org\/abs\/1611.05013"},{"key":"414_CR106","unstructured":"Chen X, Kingma DP, Salimans T, Duan Y, Dhariwal P, Schulman J, et al. Variational Lossy Autoencoder. 2016. http:\/\/arxiv.org\/abs\/1611.02731"},{"key":"414_CR107","unstructured":"Gregor K, Danihelka I, Graves A, Rezende DJ, Wierstra D. DRAW: A recurrent neural network for image generation. 2015. http:\/\/arxiv.org\/abs\/1502.04623"},{"key":"414_CR108","unstructured":"Oord A van den, Vinyals O, Kavukcuoglu K. Neural Discrete Representation Learning. 31st Conference on Neural Information Processing Systems . Long Beach, California, USA; 2017. http:\/\/arxiv.org\/abs\/1711.00937"},{"key":"414_CR109","unstructured":"Razavi A, Oord A van den, Vinyals O. Generating diverse high-fidelity images with VQ-VAE-2. Advances in neural information processing systems 32. 2019. http:\/\/arxiv.org\/abs\/1906.00446"},{"key":"414_CR110","unstructured":"Husz\u00e1r F. How (not) to Train your generative model: scheduled sampling, likelihood, adversary? 2015. http:\/\/arxiv.org\/abs\/1511.05101"},{"key":"414_CR111","unstructured":"Lotter W, Kreiman G, Cox D. Deep Predictive coding networks for video prediction and unsupervised learning. 2016. http:\/\/arxiv.org\/abs\/1605.08104"},{"key":"414_CR112","unstructured":"Radford A, Metz L, Chintala S. Unsupervised representation learning with deep convolutional generative adversarial networks. 2015. http:\/\/arxiv.org\/abs\/1511.06434"},{"key":"414_CR113","unstructured":"Makhzani A, Shlens J, Jaitly N, Goodfellow I, Frey B. Adversarial Autoencoders. 2015; Available from: http:\/\/arxiv.org\/abs\/1511.05644"},{"key":"414_CR114","unstructured":"Dumoulin V, Belghazi I, Poole B, Mastropietro O, Lamb A, Arjovsky M, et al. Adversarially Learned Inference. 2016. http:\/\/arxiv.org\/abs\/1606.00704"},{"key":"414_CR115","unstructured":"Larsen ABL, S\u00f8nderby SK, Larochelle H, Winther O. Autoencoding beyond pixels using a learned similarity metric. 2015. http:\/\/arxiv.org\/abs\/1512.09300"},{"key":"414_CR116","unstructured":"Zhong G, Gao W, Liu Y, Yang Y. Generative Adversarial networks with decoder-encoder output noise. 2018. http:\/\/arxiv.org\/abs\/1807.03923"},{"key":"414_CR117","unstructured":"Srivastava A, Valkov L, Russell C, Gutmann MU, Sutton C. VEEGAN: Reducing Mode Collapse in GANs using implicit variational learning. 2017. http:\/\/arxiv.org\/abs\/1705.07761"},{"key":"414_CR118","unstructured":"Mirza M, Osindero S. Conditional generative adversarial nets. 2014. http:\/\/arxiv.org\/abs\/1411.1784"},{"key":"414_CR119","unstructured":"Odena A, Olah C, Shlens J. Conditional image synthesis with auxiliary classifier GANs. 2016. http:\/\/arxiv.org\/abs\/1610.09585"},{"key":"414_CR120","unstructured":"Bazrafkan S, Corcoran P. Versatile auxiliary classifier with generative adversarial network (VAC+GAN), Multi Class Scenarios. 2018. http:\/\/arxiv.org\/abs\/1806.07751"},{"key":"414_CR121","unstructured":"Chen X, Duan Y, Houthooft R, Schulman J, Sutskever I, Abbeel P. InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets. 2016. http:\/\/arxiv.org\/abs\/1606.03657"},{"key":"414_CR122","doi-asserted-by":"crossref","unstructured":"Li X, Chen L, Wang L, Wu P, Tong W. SCGAN: disentangled representation learning by adding similarity constraint on generative adversarial nets. IEEE Access . 2019;7:147928\u201338. https:\/\/ieeexplore.ieee.org\/document\/8476290\/","DOI":"10.1109\/ACCESS.2018.2872695"},{"key":"414_CR123","unstructured":"Arjovsky M, Chintala S, Bottou L. Wasserstein GAN. 2017. http:\/\/arxiv.org\/abs\/1701.07875"},{"key":"414_CR124","unstructured":"Gulrajani I, Ahmed F, Arjovsky M, Dumoulin V, Courville A. Improved training of Wasserstein GANs. 2017. http:\/\/arxiv.org\/abs\/1704.00028"},{"key":"414_CR125","unstructured":"Petzka H, Fischer A, Lukovnicov D. On the regularization of Wasserstein GANs. 2017. http:\/\/arxiv.org\/abs\/1709.08894"},{"key":"414_CR126","doi-asserted-by":"crossref","unstructured":"Mao X, Li Q, Xie H, Lau RYK, Wang Z, Smolley SP. Least squares generative adversarial networks. 2016. http:\/\/arxiv.org\/abs\/1611.04076","DOI":"10.1109\/ICCV.2017.304"},{"key":"414_CR127","unstructured":"Zhao J, Mathieu M, LeCun Y. Energy-based Generative Adversarial Network. 2016. http:\/\/arxiv.org\/abs\/1609.03126"},{"key":"414_CR128","unstructured":"Berthelot D, Schumm T, Metz L. BEGAN: Boundary Equilibrium Generative Adversarial Networks. 2017. http:\/\/arxiv.org\/abs\/1703.10717"},{"key":"414_CR129","unstructured":"Wang R, Cully A, Chang HJ, Demiris Y. MAGAN: Margin adaptation for generative adversarial networks. 2017. http:\/\/arxiv.org\/abs\/1704.03817"},{"key":"414_CR130","first-page":"66","volume":"2017","author":"J Zhao","year":"2017","unstructured":"Zhao J, Xiong L, Jayashree K, Li J, Zhao F, Wang Z, et al. Dual-agent GANs for photorealistic and identity preserving profile face synthesis. Advan Neural Informat Process Syst. 2017;2017:66\u201376.","journal-title":"Advan Neural Informat Process Syst."},{"key":"414_CR131","unstructured":"Karras T, Aila T, Laine S, Lehtinen J. Progressive growing of GANs for improved quality, stability, and variation. 2017; http:\/\/arxiv.org\/abs\/1710.10196"},{"key":"414_CR132","unstructured":"Denton E, Chintala S, Szlam A, Fergus R. Deep generative image models using a laplacian pyramid of adversarial networks. Advances in Neural Information Processing Systems 28 . 2015. http:\/\/arxiv.org\/abs\/1506.05751"},{"key":"414_CR133","unstructured":"Im DJ, Kim CD, Jiang H, Memisevic R. Generating images with recurrent adversarial networks. 2016; http:\/\/arxiv.org\/abs\/1602.05110"},{"key":"414_CR134","unstructured":"Nguyen TD, Le T, Vu H, Phung D. Dual discriminator generative adversarial Nets. 2017; http:\/\/arxiv.org\/abs\/1709.03831"},{"key":"414_CR135","doi-asserted-by":"crossref","unstructured":"Ghosh A, Kulharia V, Namboodiri V, Torr PHS, Dokania PK. Multi-agent diverse generative adversarial networks. 2017. http:\/\/arxiv.org\/abs\/1704.02906","DOI":"10.1109\/CVPR.2018.00888"},{"key":"414_CR136","unstructured":"Liu M-Y, Tuzel O. Coupled generative adversarial networks. conference on neural information processing systems. 2016. http:\/\/arxiv.org\/abs\/1606.07536"},{"key":"414_CR137","unstructured":"Kim T, Cha M, Kim H, Lee JK, Kim J. Learning to discover cross-domain relations with generative adversarial networks. 2017. http:\/\/arxiv.org\/abs\/1703.05192"},{"key":"414_CR138","doi-asserted-by":"crossref","unstructured":"Zhu J-Y, Park T, Isola P, Efros AA. Unpaired Image-to-image translation using cycle-consistent adversarial networks. 2017 IEEE International Conference on Computer Vision (ICCV) . IEEE; 2017. p. 2242\u201351. http:\/\/arxiv.org\/abs\/1703.10593","DOI":"10.1109\/ICCV.2017.244"},{"key":"414_CR139","doi-asserted-by":"crossref","unstructured":"Ledig C, Theis L, Huszar F, Caballero J, Cunningham A, Acosta A, et al. Photo-realistic single image super-resolution using a generative adversarial network. 2016; http:\/\/arxiv.org\/abs\/1609.04802","DOI":"10.1109\/CVPR.2017.19"},{"key":"414_CR140","unstructured":"Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. 2014; http:\/\/arxiv.org\/abs\/1409.1556"},{"key":"414_CR141","unstructured":"Zhang H, Goodfellow I, Metaxas D, Odena A. Self-Attention Generative Adversarial Networks. 2018; http:\/\/arxiv.org\/abs\/1805.08318"},{"key":"414_CR142","doi-asserted-by":"crossref","unstructured":"Isola P, Zhu J-Y, Zhou T, Efros AA. Image-to-image translation with conditional adversarial networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE; 2017. p. 5967\u201376. http:\/\/ieeexplore.ieee.org\/document\/8100115\/","DOI":"10.1109\/CVPR.2017.632"},{"key":"414_CR143","doi-asserted-by":"crossref","unstructured":"Wang T-C, Liu M-Y, Zhu J-Y, Tao A, Kautz J, Catanzaro B. High-resolution image synthesis and semantic manipulation with conditional GANs. 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition . IEEE; 2018. p. 8798\u2013807. https:\/\/ieeexplore.ieee.org\/document\/8579015\/","DOI":"10.1109\/CVPR.2018.00917"},{"key":"414_CR144","unstructured":"Bellemare MG, Danihelka I, Dabney W, Mohamed S, Lakshminarayanan B, Hoyer S, et al. The cramer distance as a solution to biased wasserstein gradients. 2017. http:\/\/arxiv.org\/abs\/1705.10743"},{"key":"414_CR145","unstructured":"Mroueh Y, Sercu T, Goel V. McGan: mean and covariance feature matching GAN. 2017. http:\/\/arxiv.org\/abs\/1702.08398"},{"key":"414_CR146","unstructured":"Li C-L, Chang W-C, Cheng Y, Yang Y, P\u00f3czos B. MMD GAN: towards deeper understanding of moment matching network. 2017. http:\/\/arxiv.org\/abs\/1705.08584"},{"key":"414_CR147","unstructured":"Mroueh Y, Sercu T. Fisher GAN. 2017. http:\/\/arxiv.org\/abs\/1705.09675"},{"key":"414_CR148","unstructured":"Salimans T, Goodfellow I, Zaremba W, Cheung V, Radford A, Chen X. Improved techniques for training GANs. 2016. http:\/\/arxiv.org\/abs\/1606.03498"},{"key":"414_CR149","unstructured":"S\u00f8nderby CK, Caballero J, Theis L, Shi W, Husz\u00e1r F. Amortised MAP inference for image super-resolution. 2016. http:\/\/arxiv.org\/abs\/1610.04490"},{"key":"414_CR150","unstructured":"Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S. GANs trained by a two time-scale update rule converge to a local nash equilibrium. 2017. http:\/\/arxiv.org\/abs\/1706.08500"},{"key":"414_CR151","unstructured":"Miyato T, Kataoka T, Koyama M, Yoshida Y. Spectral normalization for generative adversarial networks. 2018. http:\/\/arxiv.org\/abs\/1802.05957"},{"key":"414_CR152","unstructured":"Heath M, Bowyer K, Kopans D, Moore R, Kegelmeyer WP. Digital database for screening mammography . https:\/\/www.mammoimage.org\/databases\/"},{"key":"414_CR153","first-page":"1079","volume":"20","author":"LM Shoohi","year":"2020","unstructured":"Shoohi LM, Saud JH. Dcgan for handling imbalanced malaria dataset based on over-sampling technique and using cnn. Medico-Legal Update. 2020;20:1079\u201385.","journal-title":"Medico-Legal Update."},{"key":"414_CR154","doi-asserted-by":"crossref","unstructured":"Niu S, Li B, Wang X, Lin H. Defect image sample generation With GAN for Improving defect recognition. IEEE Transactions on Automation Science and Engineering . 2020;1\u201312. https:\/\/ieeexplore.ieee.org\/document\/9000806\/","DOI":"10.1109\/TASE.2020.2967415"},{"key":"414_CR155","unstructured":"Mariani G, Scheidegger F, Istrate R, Bekas C, Malossi C. BAGAN: Data Augmentation with Balancing GAN. 2018; http:\/\/arxiv.org\/abs\/1803.09655"},{"key":"414_CR156","doi-asserted-by":"publisher","unstructured":"Wu E, Wu K, Cox D, Lotter W. Conditional infilling GANs for data augmentation in mammogram classification. 2018. p. 98\u2013106. Doi: https:\/\/doi.org\/10.1007\/978-3-030-00946-5_11","DOI":"10.1007\/978-3-030-00946-5_11"},{"key":"414_CR157","doi-asserted-by":"crossref","unstructured":"Muramatsu C, Nishio M, Goto T, Oiwa M, Morita T, Yakami M, et al. Improving breast mass classification by shared data with domain transformation using a generative adversarial network. Comput Biol Med. 2020;119:103698. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S001048252030086X","DOI":"10.1016\/j.compbiomed.2020.103698"},{"key":"414_CR158","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1117\/1.JMI.6.3.031411.full","volume":"6","author":"S Guan","year":"2019","unstructured":"Guan S. Breast cancer detection using synthetic mammograms from generative adversarial networks in convolutional neural networks. J Med Imag. 2019;6:1. https:\/\/doi.org\/10.1117\/1.JMI.6.3.031411.full.","journal-title":"J Med Imag."},{"key":"414_CR159","doi-asserted-by":"crossref","unstructured":"Waheed A, Goyal M, Gupta D, Khanna A, Al-Turjman F, Pinheiro PR. CovidGAN: Data augmentation using auxiliary classifier GAN for improved Covid-19 detection. IEEE Access . 2020;8:91916\u201323. https:\/\/ieeexplore.ieee.org\/document\/9093842\/","DOI":"10.1109\/ACCESS.2020.2994762"},{"key":"414_CR160","unstructured":"COVID-19 Chest X-Ray dataset initiative. https:\/\/github.com\/agchung\/Figure1-COVID-chestxray-dataset"},{"key":"414_CR161","doi-asserted-by":"crossref","unstructured":"Cohen JP, Morrison P, Dao L, Roth K, Duong TQ, Ghassemi M. COVID-19 Image data collection: prospective predictions are the future. 2020. http:\/\/arxiv.org\/abs\/2006.11988","DOI":"10.59275\/j.melba.2020-48g7"},{"key":"414_CR162","unstructured":"Covid19 radiography database. https:\/\/www.kaggle.com\/tawsifurrahman\/covid19-radiography-database"},{"key":"414_CR163","doi-asserted-by":"publisher","unstructured":"Hase N, Ito S, Kanaeko N, Sumi K. Data augmentation for intra-class imbalance with generative adversarial network. In: Cudel C, Bazeille S, Verrier N, editors. Fourteenth International Conference on Quality Control by Artificial Vision . SPIE; 2019. p. 56. Available from: https:\/\/www.spiedigitallibrary.org\/conference-proceedings-of-spie\/11172\/2521692\/Data-augmentation-for-intra-class-imbalance-with-generative-adversarial-network\/https:\/\/doi.org\/10.1117\/12.2521692.full","DOI":"10.1117\/12.2521692.full"},{"key":"414_CR164","unstructured":"Donahue C, Lipton ZC, Balsubramani A, McAuley J. Semantically Decomposing the Latent Spaces of Generative Adversarial Networks. 2017; http:\/\/arxiv.org\/abs\/1705.07904"},{"key":"414_CR165","doi-asserted-by":"publisher","unstructured":"Wang Y, Gong D, Zhou Z, Ji X, Wang H, Li Z, et al. Orthogonal deep features decomposition for age-invariant face recognition. 2018. p. 764\u201379. https:\/\/doi.org\/10.1007\/978-3-030-01267-0_45","DOI":"10.1007\/978-3-030-01267-0_45"},{"key":"414_CR166","doi-asserted-by":"crossref","unstructured":"Gong D, Li Z, Lin D, Liu J, Tang X. Hidden factor analysis for age invariant face recognition. 2013 IEEE International Conference on Computer Vision. IEEE; 2013. p. 2872\u20139. http:\/\/ieeexplore.ieee.org\/document\/6751468\/","DOI":"10.1109\/ICCV.2013.357"},{"key":"414_CR167","doi-asserted-by":"crossref","unstructured":"Yin X, Liu X. Multi-task convolutional neural network for pose-invariant face recognition. IEEE Transactions on Image Processing. 2018;27:964\u201375. http:\/\/ieeexplore.ieee.org\/document\/8080244\/","DOI":"10.1109\/TIP.2017.2765830"},{"key":"414_CR168","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1186\/s13640-015-0089-y","volume":"2015","author":"P Carcagn\u00ec","year":"2015","unstructured":"Carcagn\u00ec P, Del CM, Cazzato D, Leo M, Distante C. A study on different experimental configurations for age, race, and gender estimation problems. EURASIP J Image Video Process. 2015;2015:37. https:\/\/doi.org\/10.1186\/s13640-015-0089-y.","journal-title":"EURASIP J Image Video Process."},{"key":"414_CR169","unstructured":"Ziwei L, Ping L, Xiaogang W, Tang X. Large-scale CelebFaces attributes (CelebA) Dataset. 2018. http:\/\/mmlab.ie.cuhk.edu.hk\/projects\/CelebA.html"},{"key":"414_CR170","doi-asserted-by":"crossref","unstructured":"Zhang J, Li A, Liu Y, Wang M. Adversarially Regularized U-Net-based GANs for facial attribute modification and generation. IEEE Access . 2019;7:86453\u201362. https:\/\/ieeexplore.ieee.org\/document\/8754728\/","DOI":"10.1109\/ACCESS.2019.2926633"},{"key":"414_CR171","doi-asserted-by":"publisher","unstructured":"Zhang G, Kan M, Shan S, Chen X. Generative adversarial network with spatial attention for face attribute editing. 2018. p. 422\u201337. https:\/\/doi.org\/10.1007\/978-3-030-01231-1_26","DOI":"10.1007\/978-3-030-01231-1_26"},{"key":"414_CR172","doi-asserted-by":"crossref","unstructured":"Zheng Z, Yang X, Yu Z, Zheng L, Yang Y, Kautz J. joint discriminative and generative learning for person re-identification. 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) . IEEE; 2019. p. 2133\u201342. https:\/\/ieeexplore.ieee.org\/document\/8954292\/","DOI":"10.1109\/CVPR.2019.00224"},{"key":"414_CR173","doi-asserted-by":"crossref","unstructured":"Zhang X, Gao Y. Face recognition across pose: a review. pattern recognition . 2009;42:2876\u201396. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0031320309001538","DOI":"10.1016\/j.patcog.2009.04.017"},{"key":"414_CR174","doi-asserted-by":"crossref","unstructured":"Tan X, Chen S, Zhou Z-H, Zhang F. Face recognition from a single image per person: a survey. pattern recognition. 2006;39:1725\u201345. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0031320306001270","DOI":"10.1016\/j.patcog.2006.03.013"},{"key":"414_CR175","doi-asserted-by":"crossref","unstructured":"Zhao W, Chellappa R, Phillips PJ, Rosenfeld A. Face recognition. ACM computing surveys. 2003;35:399\u2013458. http:\/\/portal.acm.org\/citation.cfm?doid=954339.954342","DOI":"10.1145\/954339.954342"},{"key":"414_CR176","doi-asserted-by":"publisher","unstructured":"Qian X, Fu Y, Xiang T, Wang W, Qiu J, Wu Y, et al. Pose-Normalized Image Generation for Person Re-identification. 2018. p. 661\u201378. https:\/\/doi.org\/10.1007\/978-3-030-01240-3_40","DOI":"10.1007\/978-3-030-01240-3_40"},{"key":"414_CR177","doi-asserted-by":"crossref","unstructured":"Wei L, Zhang S, Gao W, Tian Q. Person Transfer GAN to bridge domain gap for person re-identification. 2018 IEEE\/CVF conference on computer vision and pattern recognition . IEEE; 2018. p. 79\u201388. https:\/\/ieeexplore.ieee.org\/document\/8578114\/","DOI":"10.1109\/CVPR.2018.00016"},{"key":"414_CR178","doi-asserted-by":"crossref","unstructured":"Zhong Z, Zheng L, Zheng Z, Li S, Yang Y. Camera style adaptation for person re-identification. 2018 IEEE\/CVF conference on computer vision and pattern recognition. IEEE; 2018. p. 5157\u201366. https:\/\/ieeexplore.ieee.org\/document\/8578639\/","DOI":"10.1109\/CVPR.2018.00541"},{"key":"414_CR179","unstructured":"Deng W, Zheng L, Ye Q, Yang Y, Jiao J. Similarity-preserving image-image domain adaptation for person re-identification. 2018; http:\/\/arxiv.org\/abs\/1811.10551"},{"key":"414_CR180","first-page":"1222","volume":"2018","author":"Y Ge","year":"2018","unstructured":"Ge Y, Li Z, Zhao H, Yin G, Yi S, Wang X, et al. FD-GAN: Pose-guided Feature Distilling GAN for robust person re-identification. Adv Neural Informat Process Syst. 2018;2018:1222\u201333.","journal-title":"Adv Neural Informat Process Syst."},{"key":"414_CR181","unstructured":"Zheng A, Lin X, Li C, He R, Tang J. Attributes guided feature learning for vehicle re-identification. 2019; http:\/\/arxiv.org\/abs\/1905.08997"},{"key":"414_CR182","doi-asserted-by":"crossref","unstructured":"Zhou Y, Shao L. Cross-View GAN Based Vehicle Generation for Re-identification. Procedings of the British Machine Vision Conference 2017 . British Machine Vision Association; 2017. http:\/\/www.bmva.org\/bmvc\/2017\/papers\/paper186\/index.html","DOI":"10.5244\/C.31.186"},{"key":"414_CR183","doi-asserted-by":"crossref","unstructured":"Wu F, Yan S, Smith JS, Zhang B. Vehicle re-identification in still images: application of semi-supervised learning and re-ranking. Signal Processing: Image Communication . 2019;76:261\u201371. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0923596518305800","DOI":"10.1016\/j.image.2019.04.021"},{"key":"414_CR184","doi-asserted-by":"crossref","unstructured":"Fu Y, Li X, Ye Y. A multi-task learning model with adversarial data augmentation for classification of fine-grained images. Neurocomputing . 2020;377:122\u20139. https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0925231219313748","DOI":"10.1016\/j.neucom.2019.10.002"},{"key":"414_CR185","doi-asserted-by":"crossref","unstructured":"Ge Z, Bewley A, McCool C, Corke P, Upcroft B, Sanderson C. Fine-grained classification via mixture of deep convolutional neural networks. 2016 IEEE Winter Conference on Applications of Computer Vision (WACV) . IEEE; 2016. p. 1\u20136. http:\/\/ieeexplore.ieee.org\/document\/7477700\/","DOI":"10.1109\/WACV.2016.7477700"},{"key":"414_CR186","unstructured":"Khosla A, Jayadevaprakash N, Yao B, Fei-Fei L. Novel dataset for fine-grained image categorization. Proc IEEE Conf Comput Vision and Pattern Recognition. 2011"},{"key":"414_CR187","unstructured":"Welinder P, Branson S, Mita T, Wah C, Schroff F. Caltech-ucsd Birds 200. Caltech-UCSD Technical Report . 2010;200:1\u201315. http:\/\/www.flickr.com\/"},{"key":"414_CR188","doi-asserted-by":"crossref","unstructured":"Wang C, Yu Z, Zheng H, Wang N, Zheng B. CGAN-plankton: Towards large-scale imbalanced class generation and fine-grained classification. 2017 IEEE International Conference on Image Processing (ICIP) . IEEE; 2017. p. 855\u20139. http:\/\/ieeexplore.ieee.org\/document\/8296402\/","DOI":"10.1109\/ICIP.2017.8296402"},{"key":"414_CR189","unstructured":"Orenstein EC, Beijbom O, Peacock EE, Sosik HM. WHOI-Plankton-a large scale fine grained visual recognition benchmark dataset for plankton classification. 2015; http:\/\/arxiv.org\/abs\/1510.00745"},{"key":"414_CR190","unstructured":"Koga T, Nonaka N, Sakuma J, Seita J. General-to-Detailed GAN for infrequent class medical images. 2018; http:\/\/arxiv.org\/abs\/1812.01690"},{"key":"414_CR191","unstructured":"Zhu X, Liu Y, Qin Z, Li J. Data Augmentation in emotion classification using generative adversarial networks. 2017; http:\/\/arxiv.org\/abs\/1711.00648"},{"key":"414_CR192","doi-asserted-by":"crossref","unstructured":"Haseeb Nazki, Jaehwan Lee, Sook Yoon DSP. Image-to-image translation with GAN for Synthetic Data augmentation in plant disease datasets. Smart Media J. 2019;8:46\u201357. http:\/\/kism.or.kr\/file\/memoir\/8_2_6.pdf","DOI":"10.30693\/SMJ.2019.8.2.46"},{"key":"414_CR193","doi-asserted-by":"crossref","unstructured":"Salehinejad H, Valaee S, Dowdell T, Colak E, Barfett J. Generalization of deep neural networks for chest pathology classification in X-Rays using generative adversarial networks. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing\u2013Proceedings. 2018;2018-April:990\u20134.","DOI":"10.1109\/ICASSP.2018.8461430"},{"key":"414_CR194","doi-asserted-by":"crossref","unstructured":"Lu Y-W, Liu K-L, Hsu C-Y. Conditional Generative Adversarial Network for Defect Classification with Class Imbalance. 2019 IEEE International Conference on Smart Manufacturing, Industrial & Logistics Engineering (SMILE) . IEEE; 2019. p. 146\u20139. https:\/\/ieeexplore.ieee.org\/document\/8965320\/","DOI":"10.1109\/SMILE45626.2019.8965320"},{"key":"414_CR195","doi-asserted-by":"crossref","unstructured":"Shuo Wang, Xin Yao. Multiclass imbalance problems: analysis and potential solutions. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) . 2012;42:1119\u201330. http:\/\/ieeexplore.ieee.org\/document\/6170916\/","DOI":"10.1109\/TSMCB.2012.2187280"},{"key":"414_CR196","doi-asserted-by":"publisher","first-page":"1119","DOI":"10.1109\/TSMCB.2012.2187280","volume":"42","author":"W Shuo","year":"2012","unstructured":"Shuo W, Xin Y. Multiclass Imbalance Problems: Analysis and Potential Solutions. IEEE Transact Syst Man Cybernet Part B. 2012;42:1119\u201330.","journal-title":"IEEE Transact Syst Man Cybernet Part B."},{"key":"414_CR197","doi-asserted-by":"crossref","unstructured":"Zhu X, Liu Y, Qin Z, Li J. Data augmentation in emotion classification using generative adversarial networks. 2017.","DOI":"10.1007\/978-3-319-93040-4_28"},{"key":"414_CR198","doi-asserted-by":"crossref","unstructured":"Li Z, Jin Y, Li Y, Lin Z, Wang S. imbalanced adversarial learning for weather image generation and classification. 2018 14th IEEE International Conference on Signal Processing (ICSP) . IEEE; 2018. p. 1093\u20137. https:\/\/ieeexplore.ieee.org\/document\/8652272\/","DOI":"10.1109\/ICSP.2018.8652272"},{"key":"414_CR199","doi-asserted-by":"crossref","unstructured":"Huang Y, Jin Y, Li Y, Lin Z. Towards imbalanced image classification: a generative adversarial network ensemble learning method. IEEE Access . 2020;8:88399\u2013409. https:\/\/ieeexplore.ieee.org\/document\/9086504\/","DOI":"10.1109\/ACCESS.2020.2992683"},{"key":"414_CR200","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1016\/j.neucom.2018.09.013","volume":"321","author":"M Frid-Adar","year":"2018","unstructured":"Frid-Adar M, Diamant I, Klang E, Amitai M, Goldberger J, Greenspan H. GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification. Neurocomputing. 2018;321:321\u201331.","journal-title":"Neurocomputing."},{"key":"414_CR201","doi-asserted-by":"crossref","unstructured":"Rashid H, Tanveer MA, Aqeel Khan H. Skin lesion classification using GAN based data augmentation. 2019 41st annual international conference of the IEEE engineering in medicine and biology society (EMBC). IEEE; 2019. p. 916\u20139. https:\/\/ieeexplore.ieee.org\/document\/8857905\/","DOI":"10.1109\/EMBC.2019.8857905"},{"key":"414_CR202","doi-asserted-by":"crossref","unstructured":"Tschandl P, Rosendahl C, Kittler H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Scientific Data . 2018;5:180161. http:\/\/www.nature.com\/articles\/sdata2018161","DOI":"10.1038\/sdata.2018.161"},{"key":"414_CR203","unstructured":"Bhatia S, Dahyot R. Using WGAN for improving imbalanced classification performance. AICS 2019. 2019."},{"key":"414_CR204","unstructured":"Xiao H, Rasul K, Vollgraf R. Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms. 2017;1\u20136. http:\/\/arxiv.org\/abs\/1708.07747"},{"key":"414_CR205","doi-asserted-by":"crossref","unstructured":"Fanny, Cenggoro TW. Deep learning for imbalance data classification using class expert generative adversarial network. Procedia Comput Sci. 2018;135:60\u20137.","DOI":"10.1016\/j.procs.2018.08.150"},{"key":"414_CR206","doi-asserted-by":"crossref","unstructured":"Lin TY, Maire M, Belongie S, Hays J, Perona P, Ramanan D, et al. Microsoft COCO: Common objects in context. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2014;8693 LNCS:740\u201355.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"414_CR207","doi-asserted-by":"crossref","unstructured":"Bai H, Wen S, Chan SHG. Crowd counting on images with scale variation and isolated clusters. Proceedings\u20132019 International Conference on Computer Vision Workshop, ICCVW 2019. 2019;18\u201327.","DOI":"10.1109\/ICCVW.2019.00009"},{"key":"414_CR208","doi-asserted-by":"crossref","unstructured":"Li J, Liang X, Wei Y, Xu T, Feng J, Yan S. Perceptual generative adversarial networks for small object detection. 2017 IEEE conference on computer vision and pattern recognition (CVPR) . IEEE; 2017. p. 1951\u20139. http:\/\/ieeexplore.ieee.org\/document\/8099694\/","DOI":"10.1109\/CVPR.2017.211"},{"key":"414_CR209","doi-asserted-by":"crossref","unstructured":"Liu L, Muelly M, Deng J, Pfister T, Li LJ. Generative modeling for small-data object detection. Proceedings of the IEEE International Conference on Computer Vision. 2019; 2019-Octob: 6072\u201380.","DOI":"10.1109\/ICCV.2019.00617"},{"key":"414_CR210","doi-asserted-by":"crossref","unstructured":"Zhu Z, Liang D, Zhang S, Huang X, Li B, Hu S. Traffic-Sign Detection and Classification in the Wild. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . IEEE; 2016. p. 2110\u20138. http:\/\/ieeexplore.ieee.org\/document\/7780601\/","DOI":"10.1109\/CVPR.2016.232"},{"key":"414_CR211","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","volume":"88","author":"M Everingham","year":"2010","unstructured":"Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A. The pascal visual object classes (VOC) challenge. Int J Comput Vision. 2010;88:303\u201338. https:\/\/doi.org\/10.1007\/s11263-009-0275-4.","journal-title":"Int J Comput Vision."},{"key":"414_CR212","doi-asserted-by":"crossref","unstructured":"Dollar P, Wojek C, Schiele B, Perona P. Pedestrian detection: an evaluation of the state of the art. IEEE transactions on pattern analysis and machine intelligence . 2012;34:743\u201361. http:\/\/ieeexplore.ieee.org\/document\/5975165\/","DOI":"10.1109\/TPAMI.2011.155"},{"key":"414_CR213","doi-asserted-by":"crossref","unstructured":"Bai Y, Zhang Y, Ding M, Ghanem B. SOD-MTGAN: Small object detection via multi-task generative adversarial network. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2018;11217 LNCS:210\u201326.","DOI":"10.1007\/978-3-030-01261-8_13"},{"key":"414_CR214","doi-asserted-by":"crossref","unstructured":"He K, Gkioxari G, Dollar P, Girshick R. Mask R-CNN. 2017 IEEE International Conference on Computer Vision (ICCV) . IEEE; 2017. p. 2980\u20138. http:\/\/ieeexplore.ieee.org\/document\/8237584\/","DOI":"10.1109\/ICCV.2017.322"},{"key":"414_CR215","doi-asserted-by":"publisher","unstructured":"B SC, Koznek N, Ismail A, Adam G, Narayan V, Schulze M. Computer Vision\u2013ECCV 2018 Workshops . European Conference on Computer Vision 2018. 2019. https:\/\/doi.org\/10.1007\/978-3-030-11021-5","DOI":"10.1007\/978-3-030-11021-5"},{"key":"414_CR216","doi-asserted-by":"crossref","unstructured":"Wang X, Shrivastava A, Gupta A. A-Fast-RCNN: Hard positive generation via adversary for object detection. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . IEEE; 2017. p. 3039\u201348. http:\/\/ieeexplore.ieee.org\/document\/8099807\/","DOI":"10.1109\/CVPR.2017.324"},{"key":"414_CR217","unstructured":"Chen Y, Song L, He R. Adversarial occlusion-aware face detection. 2017; http:\/\/arxiv.org\/abs\/1709.05188"},{"key":"414_CR218","doi-asserted-by":"crossref","unstructured":"Dwibedi D, Misra I, Hebert M. Cut, Paste and learn: surprisingly easy synthesis for instance detection. 2017 IEEE International conference on computer vision (ICCV) . IEEE; 2017. p. 1310\u20139. http:\/\/ieeexplore.ieee.org\/document\/8237408\/","DOI":"10.1109\/ICCV.2017.146"},{"key":"414_CR219","doi-asserted-by":"crossref","unstructured":"Tripathi S, Chandra S, Agrawal A, Tyagi A, Rehg JM, Chari V. Learning to generate synthetic data via compositing. 2019 IEEE\/CVF Conference on computer vision and pattern recognition (CVPR) . IEEE; 2019. p. 461\u201370. https:\/\/ieeexplore.ieee.org\/document\/8953554\/","DOI":"10.1109\/CVPR.2019.00055"},{"key":"414_CR220","unstructured":"Wang H, Wang Q, Yang F, Zhang W, Zuo W. Data augmentation for object detection via progressive and selective instance-switching. 2019; http:\/\/arxiv.org\/abs\/1906.00358"},{"key":"414_CR221","doi-asserted-by":"crossref","unstructured":"Zhou S, Xiao T, Yang Y, Feng D, He Q, He W. GeneGAN: Learning object transfiguration and object subspace from unpaired data. procedings of the british machine vision conference 2017. British Machine Vision Association; 2017. http:\/\/www.bmva.org\/bmvc\/2017\/papers\/paper111\/index.html","DOI":"10.5244\/C.31.111"},{"key":"414_CR222","doi-asserted-by":"crossref","unstructured":"Liu S, Zhang J, Chen Y, Liu Y, Qin Z, Wan T. Pixel Level Data Augmentation for Semantic Image segmentation using generative adversarial networks. ICASSP 2019\u20132019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . IEEE; 2019. p. 1902\u20136. https:\/\/ieeexplore.ieee.org\/document\/8683590\/","DOI":"10.1109\/ICASSP.2019.8683590"},{"key":"414_CR223","doi-asserted-by":"crossref","unstructured":"Nguyen V, Vicente TFY, Zhao M, Hoai M, Samaras D. Shadow detection with conditional generative adversarial networks. 2017 IEEE International Conference on Computer Vision (ICCV). IEEE; 2017. p. 4520\u20138. http:\/\/ieeexplore.ieee.org\/document\/8237745\/","DOI":"10.1109\/ICCV.2017.483"},{"key":"414_CR224","doi-asserted-by":"crossref","unstructured":"Zhu J, Samuel KGG, Masood SZ, Tappen MF. Learning to recognize shadows in monochromatic natural images. 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition . IEEE; 2010. p. 223\u201330. http:\/\/ieeexplore.ieee.org\/document\/5540209\/","DOI":"10.1109\/CVPR.2010.5540209"},{"key":"414_CR225","doi-asserted-by":"publisher","unstructured":"Vicente TFY, Hou L, Yu C-P, Hoai M, Samaras D. Large-Scale Training of Shadow Detectors with Noisily-Annotated Shadow Examples. 2016. p. 816\u201332. https:\/\/doi.org\/10.1007\/978-3-319-46466-4_49","DOI":"10.1007\/978-3-319-46466-4_49"},{"key":"414_CR226","doi-asserted-by":"publisher","unstructured":"Rezaei M, Yang H, Meinel C. voxel-GAN: adversarial framework for learning imbalanced brain tumor segmentation. 2019. p. 321\u201333. https:\/\/doi.org\/10.1007\/978-3-030-11726-9_29","DOI":"10.1007\/978-3-030-11726-9_29"},{"key":"414_CR227","doi-asserted-by":"publisher","first-page":"15329","DOI":"10.1007\/s11042-019-7305-1","volume":"79","author":"M Rezaei","year":"2020","unstructured":"Rezaei M, Yang H, Meinel C. Recurrent generative adversarial network for learning imbalanced medical image semantic segmentation. Multimedia Tools Applications. 2020;79:15329\u201348. https:\/\/doi.org\/10.1007\/s11042-019-7305-1.","journal-title":"Multimedia Tools Applications."},{"key":"414_CR228","doi-asserted-by":"crossref","unstructured":"Rezaei M, Yang H, Meinel C. Conditional generative refinement adversarial networks for unbalanced medical image semantic segmentation. 2018; http:\/\/arxiv.org\/abs\/1810.03871","DOI":"10.1109\/WACV.2019.00200"},{"key":"414_CR229","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1016\/j.compag.2015.05.021","volume":"116","author":"A Gongal","year":"2015","unstructured":"Gongal A, Amatya S, Karkee M, Zhang Q, Lewis K. Sensors and systems for fruit detection and localization: a review. Comput Electron Agric. 2015;116:8\u201319.","journal-title":"Comput Electron Agric."},{"key":"414_CR230","doi-asserted-by":"crossref","unstructured":"Sa I, Ge Z, Dayoub F, Upcroft B, Perez T, McCool C. DeepFruits: a fruit detection system using deep neural networks. Sensors . 2016;16:1222. http:\/\/www.mdpi.com\/1424-8220\/16\/8\/1222","DOI":"10.3390\/s16081222"},{"key":"414_CR231","doi-asserted-by":"crossref","unstructured":"Ehsani K, Mottaghi R, Farhadi A. SeGAN: Segmenting and Generating the Invisible. 2018 IEEE\/CVF conference on computer vision and pattern recognition . IEEE; 2018. p. 6144\u201353. https:\/\/ieeexplore.ieee.org\/document\/8578741\/","DOI":"10.1109\/CVPR.2018.00643"},{"key":"414_CR232","doi-asserted-by":"crossref","unstructured":"Dong J, Zhang L, Zhang H, Liu W. Occlusion-Aware GAN for Face De-Occlusion in the Wild. 2020 IEEE international conference on multimedia and expo (ICME) . IEEE; 2020. p. 1\u20136. https:\/\/ieeexplore.ieee.org\/document\/9102788\/","DOI":"10.1109\/ICME46284.2020.9102788"},{"key":"414_CR233","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1117\/1.JMI.6.3.031411","volume":"6","author":"S Guan","year":"2019","unstructured":"Guan S. Breast cancer detection using synthetic mammograms from generative adversarial networks in convolutional neural networks. J Med Imag. 2019;6:1.","journal-title":"J Med Imag"},{"key":"414_CR234","unstructured":"Donahue C, Lipton ZC, Balsubramani A, McAuley J. Semantically decomposing the latent spaces of generative adversarial networks. 2017;"},{"key":"414_CR235","doi-asserted-by":"crossref","unstructured":"Wang W, Hong W, Wang F, Yu J. GAN-Knowledge distillation for one-stage object detection. IEEE Access . 2020;8:60719\u201327. https:\/\/ieeexplore.ieee.org\/document\/9046859\/","DOI":"10.1109\/ACCESS.2020.2983174"},{"key":"414_CR236","doi-asserted-by":"publisher","first-page":"014021","DOI":"10.1103\/PhysRevD.97.014021","volume":"97","author":"M Paganini","year":"2018","unstructured":"Paganini M, de Oliveira L, Nachman B. CaloGAN: Simulating 3D high energy particle showers in multilayer electromagnetic calorimeters with generative adversarial networks. Phys Rev D. 2018;97:014021. https:\/\/doi.org\/10.1103\/PhysRevD.97.014021.","journal-title":"Phys Rev D."}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-021-00414-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s40537-021-00414-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-021-00414-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,23]],"date-time":"2024-08-23T02:22:42Z","timestamp":1724379762000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-021-00414-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,29]]},"references-count":236,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["414"],"URL":"https:\/\/doi.org\/10.1186\/s40537-021-00414-0","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-45616\/v1","asserted-by":"object"},{"id-type":"doi","id":"10.21203\/rs.3.rs-45616\/v4","asserted-by":"object"},{"id-type":"doi","id":"10.21203\/rs.3.rs-45616\/v3","asserted-by":"object"},{"id-type":"doi","id":"10.21203\/rs.3.rs-45616\/v2","asserted-by":"object"}]},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,29]]},"assertion":[{"value":"30 July 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 January 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 January 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"27"}}