{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,11]],"date-time":"2026-05-11T01:12:35Z","timestamp":1778461955412,"version":"3.51.4"},"reference-count":110,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,3,23]],"date-time":"2024-03-23T00:00:00Z","timestamp":1711152000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,3,23]],"date-time":"2024-03-23T00:00:00Z","timestamp":1711152000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100000038","name":"Natural Sciences and Engineering Research Council of Canada","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100000038","id-type":"DOI","asserted-by":"publisher"}]},{"name":"British Columbia Cancer Foundation"},{"name":"Collaborative Health Research Projects"},{"DOI":"10.13039\/501100004326","name":"Simon Fraser University","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100004326","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Modern deep learning training procedures rely on model regularization techniques such as data augmentation methods, which generate training samples that increase the diversity of data and richness of label information. A popular recent method,<jats:italic>mixup<\/jats:italic>, uses convex combinations of pairs of original samples to generate new samples. However, as we show in our experiments,<jats:italic>mixup<\/jats:italic>\u00a0can produce undesirable synthetic samples, where the data is sampled off the manifold and can contain incorrect labels. We propose<jats:inline-formula><jats:alternatives><jats:tex-math>$$\\zeta $$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mi>\u03b6<\/mml:mi><\/mml:math><\/jats:alternatives><\/jats:inline-formula>-<jats:italic>mixup<\/jats:italic>, a generalization of<jats:italic>mixup<\/jats:italic>\u00a0with provably and demonstrably desirable properties that allows convex combinations of<jats:inline-formula><jats:alternatives><jats:tex-math>$${T} \\ge 2$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mrow><mml:mi>T<\/mml:mi><mml:mo>\u2265<\/mml:mo><mml:mn>2<\/mml:mn><\/mml:mrow><\/mml:math><\/jats:alternatives><\/jats:inline-formula>samples, leading to more realistic and diverse outputs that incorporate information from<jats:inline-formula><jats:alternatives><jats:tex-math>$${T}$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mi>T<\/mml:mi><\/mml:math><\/jats:alternatives><\/jats:inline-formula>original samples by using a<jats:italic>p<\/jats:italic>-series interpolant. We show that, compared to<jats:italic>mixup<\/jats:italic>,<jats:inline-formula><jats:alternatives><jats:tex-math>$$\\zeta $$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mi>\u03b6<\/mml:mi><\/mml:math><\/jats:alternatives><\/jats:inline-formula>-<jats:italic>mixup<\/jats:italic>\u00a0better preserves the intrinsic dimensionality of the original datasets, which is a desirable property for training generalizable models. Furthermore, we show that our implementation of<jats:inline-formula><jats:alternatives><jats:tex-math>$$\\zeta $$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mi>\u03b6<\/mml:mi><\/mml:math><\/jats:alternatives><\/jats:inline-formula>-<jats:italic>mixup<\/jats:italic>\u00a0is faster than<jats:italic>mixup<\/jats:italic>, and extensive evaluation on controlled synthetic and 26 diverse real-world natural and medical image classification datasets shows that<jats:inline-formula><jats:alternatives><jats:tex-math>$$\\zeta $$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"><mml:mi>\u03b6<\/mml:mi><\/mml:math><\/jats:alternatives><\/jats:inline-formula>-<jats:italic>mixup<\/jats:italic>\u00a0outperforms<jats:italic>mixup<\/jats:italic>, CutMix,\u00a0and traditional data augmentation techniques. The code will be released at<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/kakumarabhishek\/zeta-mixup\">https:\/\/github.com\/kakumarabhishek\/zeta-mixup<\/jats:ext-link>.<\/jats:p>","DOI":"10.1186\/s40537-024-00898-6","type":"journal-article","created":{"date-parts":[[2024,3,23]],"date-time":"2024-03-23T12:01:52Z","timestamp":1711195312000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Multi-sample $$\\zeta $$-mixup: richer, more realistic synthetic samples from a p-series interpolant"],"prefix":"10.1186","volume":"11","author":[{"given":"Kumar","family":"Abhishek","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Colin J.","family":"Brown","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ghassan","family":"Hamarneh","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,3,23]]},"reference":[{"key":"898_CR1","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1016\/j.neunet.2014.09.003","volume":"61","author":"J Schmidhuber","year":"2015","unstructured":"Schmidhuber J. Deep learning in neural networks: an overview. Neural Netw. 2015;61:85\u2013117.","journal-title":"Neural Netw"},{"issue":"7553","key":"898_CR2","doi-asserted-by":"publisher","first-page":"436","DOI":"10.1038\/nature14539","volume":"521","author":"Y LeCun","year":"2015","unstructured":"LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521(7553):436\u201344.","journal-title":"Nature"},{"key":"898_CR3","unstructured":"Alom MZ, Taha TM, Yakopcic C, Westberg S, Sidike P, Nasrin MS, Van\u00a0Esesn BC, Awwal AAS, Asari VK. The history began from AlexNet: A comprehensive survey on deep learning approaches. arXiv preprint arXiv:1803.01164. 2018."},{"issue":"1","key":"898_CR4","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1109\/TNNLS.2020.2978386","volume":"32","author":"Z Wu","year":"2020","unstructured":"Wu Z, Pan S, Chen F, Long G, Zhang C, Philip SY. A comprehensive survey on graph neural networks. IEEE Trans Neural Netw Learn Syst. 2020;32(1):4\u201324.","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"key":"898_CR5","doi-asserted-by":"crossref","unstructured":"Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L. ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009; pp. 248\u2013255. IEEE.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"898_CR6","first-page":"740","volume-title":"European Conference on Computer Vision (ECCV)","author":"T-Y Lin","year":"2014","unstructured":"Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Doll\u00e1r P, Zitnick CL. Microsoft COCO: common objects in context. In: European Conference on Computer Vision (ECCV). Berlin: Springer; 2014. p. 740\u201355."},{"key":"898_CR7","doi-asserted-by":"crossref","unstructured":"Cordts M, Omran M, Ramos S, Rehfeld T, Enzweiler M, Benenson R, Franke U, Roth S, Schiele B. The Cityscapes dataset for semantic urban scene understanding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016; pp. 3213\u20133223.","DOI":"10.1109\/CVPR.2016.350"},{"issue":"6088","key":"898_CR8","doi-asserted-by":"publisher","first-page":"533","DOI":"10.1038\/323533a0","volume":"323","author":"DE Rumelhart","year":"1986","unstructured":"Rumelhart DE, Hinton GE, Williams RJ. Learning representations by back-propagating errors. Nature. 1986;323(6088):533\u20136.","journal-title":"Nature"},{"issue":"2","key":"898_CR9","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1162\/106365602320169811","volume":"10","author":"KO Stanley","year":"2002","unstructured":"Stanley KO, Miikkulainen R. Evolving neural networks through augmenting topologies. Evol Comput. 2002;10(2):99\u2013127.","journal-title":"Evol Comput"},{"key":"898_CR10","doi-asserted-by":"crossref","unstructured":"Steinkraus D, Buck I, Simard P. Using GPUs for machine learning algorithms. In: Eighth International Conference on Document Analysis and Recognition (ICDAR\u201905). 2005; pp. 1115\u20131120. IEEE.","DOI":"10.1109\/ICDAR.2005.251"},{"key":"898_CR11","unstructured":"Chellapilla K, Puri S, Simard P. High performance convolutional neural networks for document processing. In: Tenth International Workshop on Frontiers in Handwriting Recognition (IWFHR). 2006. Suvisoft."},{"key":"898_CR12","doi-asserted-by":"crossref","unstructured":"Raina R, Madhavan A, Ng AY. Large-scale deep unsupervised learning using graphics processors. In: Proceedings of the 26th Annual International Conference on Machine Learning (ICML). 2009. pp. 873\u2013880","DOI":"10.1145\/1553374.1553486"},{"issue":"12","key":"898_CR13","doi-asserted-by":"publisher","first-page":"3207","DOI":"10.1162\/NECO_a_00052","volume":"22","author":"DC Cire\u015fan","year":"2010","unstructured":"Cire\u015fan DC, Meier U, Gambardella LM, Schmidhuber J. Deep, big, simple neural nets for handwritten digit recognition. Neural Comput. 2010;22(12):3207\u201320.","journal-title":"Neural Comput"},{"key":"898_CR14","unstructured":"Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556. 2014."},{"key":"898_CR15","first-page":"234","volume-title":"International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI)","author":"O Ronneberger","year":"2015","unstructured":"Ronneberger O, Fischer P, Brox T. U-Net: convolutional networks for biomedical image segmentation. In: International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI). Berlin: Springer; 2015. p. 234\u201341."},{"key":"898_CR16","doi-asserted-by":"crossref","unstructured":"He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016:770\u2013778.","DOI":"10.1109\/CVPR.2016.90"},{"key":"898_CR17","unstructured":"Duchi J, Hazan E, Singer Y. Adaptive subgradient methods for online learning and stochastic optimization. J Mach Learn Res (JMLR). 2011; 12(7)."},{"key":"898_CR18","unstructured":"Zeiler MD. ADADELTA: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 2012."},{"key":"898_CR19","unstructured":"Kingma DP, Ba J. Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980. 2014."},{"key":"898_CR20","unstructured":"Dozat T. Incorporating Nesterov momentum into Adam. International Conference on Learning Representations (ICLR) Workshop. 2016."},{"issue":"1","key":"898_CR21","first-page":"1929","volume":"15","author":"N Srivastava","year":"2014","unstructured":"Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res (JMLR). 2014;15(1):1929\u201358.","journal-title":"J Mach Learn Res (JMLR)"},{"key":"898_CR22","unstructured":"Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International Conference on Machine Learning (ICML). 2015. pp. 448\u2013456. PMLR."},{"key":"898_CR23","unstructured":"Krogh A, Hertz J. A simple weight decay can improve generalization. Adv Neural Informat Process Syst (NeurIPS). 1991;4."},{"key":"898_CR24","first-page":"646","volume-title":"European Conference on Computer Vision (ECCV)","author":"G Huang","year":"2016","unstructured":"Huang G, Sun Y, Liu Z, Sedra D, Weinberger KQ. Deep networks with stochastic depth. In: European Conference on Computer Vision (ECCV). Berlin: Springer; 2016. p. 646\u201361."},{"key":"898_CR25","doi-asserted-by":"publisher","first-page":"3714","DOI":"10.1609\/aaai.v33i01.33013714","volume":"33","author":"H Guo","year":"2019","unstructured":"Guo H, Mao Y, Zhang R. MixUp as locally linear out-of-manifold regularization. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI). 2019;33:3714\u201322.","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence (AAAI)"},{"issue":"11","key":"898_CR26","doi-asserted-by":"publisher","first-page":"2278","DOI":"10.1109\/5.726791","volume":"86","author":"Y LeCun","year":"1998","unstructured":"LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 1998;86(11):2278\u2013324.","journal-title":"Proc IEEE"},{"key":"898_CR27","doi-asserted-by":"crossref","unstructured":"Cire\u015fan D, Meier U, Schmidhuber J. Multi-column deep neural networks for image classification. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2012. pp. 3642\u20133649. IEEE.","DOI":"10.1109\/CVPR.2012.6248110"},{"key":"898_CR28","unstructured":"Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Adv Neural Informat Process Syst (NeurIPS). 2012; 25."},{"key":"898_CR29","first-page":"818","volume-title":"European Conference on Computer Vision (ECCV)","author":"MD Zeiler","year":"2014","unstructured":"Zeiler MD, Fergus R. Visualizing and understanding convolutional networks. In: European Conference on Computer Vision (ECCV). Berlin: Springer; 2014. p. 818\u201333."},{"key":"898_CR30","unstructured":"Goodfellow IJ, Shlens J, Szegedy C. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 (2014)"},{"key":"898_CR31","doi-asserted-by":"crossref","unstructured":"Bai T, Luo J, Zhao J, Wen B, Wang Q. Recent advances in adversarial training for adversarial robustness. arXiv preprint arXiv:2102.01356. 2021.","DOI":"10.24963\/ijcai.2021\/591"},{"key":"898_CR32","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-84858-7","volume-title":"The elements of statistical learning: data mining, inference, and prediction","author":"T Hastie","year":"2009","unstructured":"Hastie T, Tibshirani R, Friedman JH, Friedman JH. The elements of statistical learning: data mining, inference, and prediction, vol. 2. Berlin: Springer; 2009."},{"key":"898_CR33","first-page":"95","volume-title":"International Conference on Artificial Neural Networks (ICANN)","author":"A Hern\u00e1ndez-Garc\u00eda","year":"2018","unstructured":"Hern\u00e1ndez-Garc\u00eda A, K\u00f6nig P. Further advantages of data augmentation on convolutional neural networks. In: International Conference on Artificial Neural Networks (ICANN). Berlin: Springer; 2018. p. 95\u2013103."},{"key":"898_CR34","unstructured":"Hern\u00e1ndez-Garc\u00eda A, K\u00f6nig P. Data augmentation instead of explicit regularization. arXiv preprint arXiv:1806.03852. 2018."},{"issue":"1","key":"898_CR35","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-019-0197-0","volume":"6","author":"C Shorten","year":"2019","unstructured":"Shorten C, Khoshgoftaar TM. A survey on image data augmentation for deep learning. J Big Data. 2019;6(1):1\u201348.","journal-title":"J Big Data"},{"key":"898_CR36","unstructured":"Zhang H, Cisse M, Dauphin YN, Lopez-Paz D. mixup: Beyond empirical risk minimization. In: International Conference on Learning Representations (ICLR). 2018."},{"key":"898_CR37","doi-asserted-by":"crossref","unstructured":"Tokozume Y, Ushiku Y, Harada T. Between-class learning for image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2018. pp. 5486\u20135494.","DOI":"10.1109\/CVPR.2018.00575"},{"key":"898_CR38","unstructured":"Inoue H. Data augmentation by pairing samples for images classification. arXiv preprint arXiv:1801.02929. 2018."},{"key":"898_CR39","doi-asserted-by":"crossref","unstructured":"Yun S, Han D, Oh SJ, Chun S, Choe J, Yoo Y. CutMix: regularization strategy to train strong classifiers with localizable features. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV). 2019. pp. 6023\u20136032.","DOI":"10.1109\/ICCV.2019.00612"},{"key":"898_CR40","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2020.107594","volume":"109","author":"K Baek","year":"2021","unstructured":"Baek K, Bang D, Shim H. GridMix: strong regularization through local context mapping. Pattern Recogn. 2021;109: 107594.","journal-title":"Pattern Recogn"},{"key":"898_CR41","unstructured":"Verma V, Lamb A, Beckham C, Najafi A, Mitliagkas I, Lopez-Paz D, Bengio Y. Manifold mixup: Better representations by interpolating hidden states. In: International Conference on Machine Learning (ICML), 2019. pp. 6438\u20136447. PMLR."},{"key":"898_CR42","unstructured":"Yaguchi Y, Shiratani F, Iwaki H. MixFeat: Mix Feature in Latent Space Learns Discriminative Space (2019). https:\/\/openreview.net\/forum?id=HygT9oRqFX"},{"key":"898_CR43","first-page":"633","volume-title":"European Conference on Computer Vision (ECCV)","author":"J Zhu","year":"2020","unstructured":"Zhu J, Shi L, Yan J, Zha H. AutoMix: mixup networks for sample interpolation via cooperative barycenter learning. In: European Conference on Computer Vision (ECCV). Cham: Springer; 2020. p. 633\u201349."},{"key":"898_CR44","doi-asserted-by":"crossref","unstructured":"Liu Z, Li S, Wu D, Chen Z, Wu L, Guo J, Li SZ. Unveiling the power of mixup for stronger classifiers. arXiv preprint arXiv:2103.13027 (2021)","DOI":"10.1007\/978-3-031-20053-3_26"},{"key":"898_CR45","unstructured":"Pope P, Zhu C, Abdelkader A, Goldblum M, Goldstein T. The intrinsic dimension of images and its impact on learning. In: International Conference on Learning Representations (ICLR). 2021. https:\/\/openreview.net\/forum?id=XJk19XzGq2J."},{"issue":"3","key":"898_CR46","doi-asserted-by":"publisher","first-page":"217","DOI":"10.1016\/0010-0277(95)00669-X","volume":"57","author":"JM Beale","year":"1995","unstructured":"Beale JM, Keil FC. Categorical effects in the perception of faces. Cognition. 1995;57(3):217\u201339.","journal-title":"Cognition"},{"issue":"2","key":"898_CR47","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1016\/S0010-0277(02)00104-X","volume":"85","author":"FN Newell","year":"2002","unstructured":"Newell FN, B\u00fclthoff HH. Categorical perception of familiar objects. Cognition. 2002;85(2):113\u201343.","journal-title":"Cognition"},{"issue":"1","key":"898_CR48","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1002\/wcs.26","volume":"1","author":"RL Goldstone","year":"2010","unstructured":"Goldstone RL, Hendrickson AT. Categorical perception. Wiley Interdiscip Rev Cogn Sci. 2010;1(1):69\u201378.","journal-title":"Wiley Interdiscip Rev Cogn Sci"},{"issue":"4","key":"898_CR49","doi-asserted-by":"publisher","first-page":"814","DOI":"10.1093\/cercor\/bhs067","volume":"23","author":"JR Folstein","year":"2013","unstructured":"Folstein JR, Palmeri TJ, Gauthier I. Category learning increases discriminability of relevant object dimensions in visual cortex. Cerebral Cortex. 2013;23(4):814\u201323.","journal-title":"Cerebral Cortex"},{"key":"898_CR50","volume-title":"The nature of statistical learning theory","author":"V Vapnik","year":"1999","unstructured":"Vapnik V. The nature of statistical learning theory. Berlin: Springer; 1999."},{"key":"898_CR51","unstructured":"Chapelle O, Weston J, Bottou L, Vapnik V. Vicinal risk minimization. Advances in Neural Information Processing Systems (NeurIPS). 2001; 416\u2013422."},{"key":"898_CR52","unstructured":"Zhang C, Hsieh M-H, Tao D. Generalization bounds for vicinal risk minimization principle. arXiv preprint arXiv:1811.04351 (2018)"},{"issue":"1\u201317","key":"898_CR53","first-page":"1","volume":"12","author":"L Cayton","year":"2005","unstructured":"Cayton L. Algorithms for manifold learning. University of California at San Diego Technical Report. 2005;12(1\u201317):1.","journal-title":"University of California at San Diego Technical Report"},{"issue":"4","key":"898_CR54","doi-asserted-by":"publisher","first-page":"983","DOI":"10.1090\/jams\/852","volume":"29","author":"C Fefferman","year":"2016","unstructured":"Fefferman C, Mitter S, Narayanan H. Testing the manifold hypothesis. Journal of the American Mathematical Society. 2016;29(4):983\u20131049.","journal-title":"Journal of the American Mathematical Society"},{"key":"898_CR55","doi-asserted-by":"crossref","unstructured":"Wood E, Baltru\u0161aitis T, Hewitt C, Dziadzio S, Cashman TJ, Shotton J. Fake it till you make it: Face analysis in the wild using synthetic data alone. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), 2021. pp. 3681\u20133691.","DOI":"10.1109\/ICCV48922.2021.00366"},{"key":"898_CR56","unstructured":"Wood E. Synthetic data with digital humans. Microsoft Sponsor Session, CVPR 2021 (2021). https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/09\/2019-10-01-Synthetic-Data-with-Digital-Humans.pdf."},{"key":"898_CR57","doi-asserted-by":"crossref","unstructured":"Brown CJ, Miller SP, Booth BG, Poskitt KJ, Chau V, Synnes AR, Zwicker JG, Grunau RE, Hamarneh G. Prediction of motor function in very preterm infants using connectome features and local synthetic instances. In: International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2015:69\u201376. Springer","DOI":"10.1007\/978-3-319-24553-9_9"},{"issue":"145\u2013155","key":"898_CR58","first-page":"2","volume":"2","author":"B Riemann","year":"1859","unstructured":"Riemann B. Ueber die anzahl der primzahlen unter einer gegebenen grosse. Ges Math Werke Wissenschaftlicher Nachla\u00df. 1859;2(145\u2013155):2.","journal-title":"Ges Math Werke Wissenschaftlicher Nachla\u00df"},{"key":"898_CR59","volume-title":"Deep Learning","author":"I Goodfellow","year":"2016","unstructured":"Goodfellow I, Bengio Y, Courville A. Deep Learning. USA: MIT Press; 2016."},{"key":"898_CR60","doi-asserted-by":"crossref","unstructured":"Morra L, Piano L, Lamberti F, Tommasi T. Bridging the gap between natural and medical images through deep colorization. In: 2020 25th International Conference on Pattern Recognition (ICPR), 2021:835\u2013842. IEEE","DOI":"10.1109\/ICPR48806.2021.9412444"},{"issue":"3","key":"898_CR61","doi-asserted-by":"publisher","first-page":"1224","DOI":"10.3390\/su13031224","volume":"13","author":"X Liu","year":"2021","unstructured":"Liu X, Song L, Liu S, Zhang Y. A review of deep-learning-based medical image segmentation methods. Sustainability. 2021;13(3):1224.","journal-title":"Sustainability"},{"issue":"1","key":"898_CR62","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1007\/s10462-020-09854-1","volume":"54","author":"S Asgari Taghanaki","year":"2021","unstructured":"Asgari Taghanaki S, Abhishek K, Cohen JP, Cohen-Adad J, Hamarneh G. Deep semantic segmentation of natural and medical images: a review. Artificial Intelligence Review. 2021;54(1):137\u201378.","journal-title":"Artificial Intelligence Review"},{"key":"898_CR63","unstructured":"Krizhevsky A, Hinton G. Learning multiple layers of features from tiny images. Technical Report. 2009."},{"key":"898_CR64","unstructured":"Xiao H, Rasul K, Vollgraf R. Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:1708.07747. 2017."},{"key":"898_CR65","unstructured":"Coates A, Ng A, Lee H. An analysis of single-layer networks in unsupervised feature learning. In: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS), 2011:215\u2013223. JMLR Workshop and Conference Proceedings."},{"key":"898_CR66","unstructured":"Howard J. imagenette. GitHub. [Online. Accessed February 18, 2022] (2019)"},{"key":"898_CR67","doi-asserted-by":"crossref","unstructured":"Sandler M, Howard A, Zhu M, Zhmoginov A, Chen L-C. MobileNetV2: Inverted residuals and linear bottlenecks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018:4510\u20134520","DOI":"10.1109\/CVPR.2018.00474"},{"key":"898_CR68","unstructured":"Tan M, Le Q. EfficientNet: Rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, 2019:6105\u20136114. PMLR"},{"key":"898_CR69","volume-title":"Dermoscopy: An Atlas","author":"SW Menzies","year":"2009","unstructured":"Menzies SW, Crotty KA, Ingvar C, McCarthy W. Dermoscopy: An Atlas. 3rd ed. Maidenhead, England: McGraw-Hill Education; 2009.","edition":"3"},{"issue":"3","key":"898_CR70","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1016\/S1470-2045(02)00679-4","volume":"3","author":"H Kittler","year":"2002","unstructured":"Kittler H, Pehamberger H, Wolff K, Binder M. Diagnostic accuracy of dermoscopy. The Lancet Oncology. 2002;3(3):159\u201365.","journal-title":"The Lancet Oncology"},{"key":"898_CR71","unstructured":"Gutman D, Codella NC, Celebi E, Helba B, Marchetti M, Mishra N, Halpern A. Skin lesion analysis toward melanoma detection: a challenge at the International Symposium on Biomedical Imaging (ISBI) 2016, hosted by the International Skin Imaging Collaboration (ISIC). arXiv preprint arXiv:1605.01397. 2016."},{"key":"898_CR72","doi-asserted-by":"crossref","unstructured":"Codella NC, Gutman D, Celebi ME, Helba B, Marchetti MA, Dusza SW, Kalloo A, Liopyris K, Mishra N, Kittler H. et al.: Skin lesion analysis toward melanoma detection: a challenge at the 2017 international symposium on biomedical imaging (ISBI), hosted by the international skin imaging collaboration (ISIC). In: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), 2018:168\u2013172. IEEE.","DOI":"10.1109\/ISBI.2018.8363547"},{"key":"898_CR73","unstructured":"Codella N, Rotemberg V, Tschandl P, Celebi ME, Dusza S, Gutman D, Helba B, Kalloo A, Liopyris K, Marchetti M, et al. Skin lesion analysis toward melanoma detection 2018: a challenge hosted by the International Skin Imaging Collaboration (ISIC). arXiv preprint arXiv:1902.03368. 2019."},{"issue":"1","key":"898_CR74","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/sdata.2018.161","volume":"5","author":"P Tschandl","year":"2018","unstructured":"Tschandl P, Rosendahl C, Kittler H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci Data. 2018;5(1):1\u20139.","journal-title":"Sci Data"},{"key":"898_CR75","unstructured":"International Skin Imaging Collaboration (ISIC): Melanoma Project-ISIC Archive. https:\/\/www.isic-archive.com\/. [Online. Accessed February 18, 2022]. 2016."},{"key":"898_CR76","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1007\/978-94-007-5389-1_4","volume-title":"Color medical image analysis","author":"L Ballerini","year":"2013","unstructured":"Ballerini L, Fisher RB, Aldridge B, Rees J. A color and texture based hierarchical K-NN approach to the classification of non-melanoma skin lesions. In: Color medical image analysis. Berlin: Springer; 2013. p. 63\u201386."},{"issue":"2","key":"898_CR77","doi-asserted-by":"publisher","first-page":"538","DOI":"10.1109\/JBHI.2018.2824327","volume":"23","author":"J Kawahara","year":"2018","unstructured":"Kawahara J, Daneshvar S, Argenziano G, Hamarneh G. Seven-point checklist and skin lesion classification using multitask multimodal neural nets. IEEE J Biomed Health Informat. 2018;23(2):538\u201346.","journal-title":"IEEE J Biomed Health Informat"},{"key":"898_CR78","doi-asserted-by":"crossref","unstructured":"Mendon\u00e7a T, Ferreira PM, Marques JS, Marcal AR, Rozeira J. PH2\u2014a dermoscopic image database for research and benchmarking. In: 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2013:5437\u20135440. IEEE.","DOI":"10.1109\/EMBC.2013.6610779"},{"issue":"19","key":"898_CR79","doi-asserted-by":"publisher","first-page":"6578","DOI":"10.1016\/j.eswa.2015.04.034","volume":"42","author":"I Giotis","year":"2015","unstructured":"Giotis I, Molders N, Land S, Biehl M, Jonkman MF, Petkov N. MED-NODE: a computer-assisted melanoma diagnosis system using non-dermoscopic images. Expert Syst Appl. 2015;42(19):6578\u201385.","journal-title":"Expert Syst Appl"},{"key":"898_CR80","unstructured":"Mosley L. A balanced approach to the multi-class imbalance problem. PhD thesis, Iowa State University. 2013."},{"key":"898_CR81","doi-asserted-by":"crossref","unstructured":"Coppola D, Lee HK, Guan C. Interpreting mechanisms of prediction for skin cancer diagnosis using multi-task learning. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020:734\u2013735.","DOI":"10.1109\/CVPRW50498.2020.00375"},{"issue":"1","key":"898_CR82","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-021-87064-7","volume":"11","author":"K Abhishek","year":"2021","unstructured":"Abhishek K, Kawahara J, Hamarneh G. Predicting the clinical management of skin lesions using deep learning. Sci Rep. 2021;11(1):1\u201314.","journal-title":"Sci Rep"},{"key":"898_CR83","doi-asserted-by":"crossref","unstructured":"Yang J, Shi R, Ni B. MedMNIST classification decathlon: a lightweight autoML benchmark for medical image analysis. In: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), 2021:191\u2013195. IEEE.","DOI":"10.1109\/ISBI48211.2021.9434062"},{"issue":"1","key":"898_CR84","doi-asserted-by":"publisher","first-page":"1002730","DOI":"10.1371\/journal.pmed.1002730","volume":"16","author":"JN Kather","year":"2019","unstructured":"Kather JN, Krisam J, Charoentong P, Luedde T, Herpel E, Weis C-A, Gaiser T, Marx A, Valous NA, Ferber D, et al. Predicting survival from colorectal cancer histology slides using deep learning: a retrospective multicenter study. PLoS Med. 2019;16(1):1002730.","journal-title":"PLoS Med"},{"issue":"5","key":"898_CR85","doi-asserted-by":"publisher","first-page":"1122","DOI":"10.1016\/j.cell.2018.02.010","volume":"172","author":"DS Kermany","year":"2018","unstructured":"Kermany DS, Goldbaum M, Cai W, Valentim CC, Liang H, Baxter SL, McKeown A, Yang G, Wu X, Yan F, et al. Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell. 2018;172(5):1122\u201331.","journal-title":"Cell"},{"key":"898_CR86","doi-asserted-by":"crossref","unstructured":"Acevedo A, Merino A, Alf\u00e9rez S, Molina \u00c1, Bold\u00fa L, Rodellar J. A dataset of microscopic peripheral blood cell images for development of automatic recognition systems. Data Brief 2020;30.","DOI":"10.1016\/j.dib.2020.105474"},{"issue":"7","key":"898_CR87","doi-asserted-by":"publisher","first-page":"637","DOI":"10.1038\/nmeth.2083","volume":"9","author":"V Ljosa","year":"2012","unstructured":"Ljosa V, Sokolnicki KL, Carpenter AE. Annotated high-throughput microscopy image sets for validation. Nat Methods. 2012;9(7):637\u2013637. https:\/\/doi.org\/10.1038\/nmeth.2083.","journal-title":"Nat Methods"},{"key":"898_CR88","doi-asserted-by":"publisher","DOI":"10.1016\/j.media.2022.102680","volume":"84","author":"P Bilic","year":"2023","unstructured":"Bilic P, Christ P, Li HB, Vorontsov E, Ben-Cohen A, Kaissis G, Szeskin A, Jacobs C, Mamani GEH, Chartrand G, et al. The liver tumor segmentation benchmark (LiTS). Med Image Anal. 2023;84: 102680.","journal-title":"Med Image Anal"},{"issue":"8","key":"898_CR89","doi-asserted-by":"publisher","first-page":"1885","DOI":"10.1109\/TMI.2019.2894854","volume":"38","author":"X Xu","year":"2019","unstructured":"Xu X, Zhou F, Liu B, Fu D, Bai X. Efficient multiple organ localization in CT image using 3D region proposal network. IEEE Trans Med Imag. 2019;38(8):1885\u201398.","journal-title":"IEEE Trans Med Imag"},{"key":"898_CR90","unstructured":"Collins KM, Bhatt U, Liu W, Piratla V, Love B, Weller A. Web-based elicitation of human perception on mixup data. arXiv preprint arXiv:2211.01202. 2022."},{"issue":"4","key":"898_CR91","doi-asserted-by":"publisher","first-page":"517","DOI":"10.1088\/0954-898X_5_4_006","volume":"5","author":"DL Ruderman","year":"1994","unstructured":"Ruderman DL. The statistics of natural images. Netw Comput Neural Syst. 1994;5(4):517.","journal-title":"Netw Comput Neural Syst"},{"issue":"5786","key":"898_CR92","doi-asserted-by":"publisher","first-page":"504","DOI":"10.1126\/science.1127647","volume":"313","author":"GE Hinton","year":"2006","unstructured":"Hinton GE, Salakhutdinov RR. Reducing the dimensionality of data with neural networks. Science. 2006;313(5786):504\u20137.","journal-title":"Science"},{"key":"898_CR93","unstructured":"Vincent P, Larochelle H, Lajoie I, Bengio Y, Manzagol P-A, Bottou L. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res (JMLR). 2010;11(12)."},{"issue":"10","key":"898_CR94","doi-asserted-by":"publisher","first-page":"1368","DOI":"10.3390\/e23101368","volume":"23","author":"J Bac","year":"2021","unstructured":"Bac J, Mirkes EM, Gorban AN, Tyukin I, Zinovyev A. scikit-dimension: a Python package for intrinsic dimension estimation. Entropy. 2021;23(10):1368.","journal-title":"Entropy"},{"issue":"2","key":"898_CR95","doi-asserted-by":"publisher","first-page":"176","DOI":"10.1109\/T-C.1971.223208","volume":"100","author":"K Fukunaga","year":"1971","unstructured":"Fukunaga K, Olsen DR. An algorithm for finding intrinsic dimensionality of data. IEEE Trans Comput. 1971;100(2):176\u201383.","journal-title":"IEEE Trans Comput"},{"key":"898_CR96","unstructured":"Kim J-H, Choo W, Jeong H, Song HO. Co-Mixup: saliency guided joint mixup with supermodular diversity. arXiv preprint arXiv:2102.03065. 2021."},{"key":"898_CR97","unstructured":"Cho K. Manifold mixup: Degeneracy? https:\/\/kyunghyuncho.me\/manifold-mixup-degeneracy\/. [Online. Accessed February 18, 2022]. 2021."},{"key":"898_CR98","unstructured":"Hendrycks D, Mu N, Cubuk ED, Zoph B, Gilmer J, Lakshminarayanan B. AugMix: a simple data processing method to improve robustness and uncertainty. In: International Conference on Learning Representations (ICLR). 2019."},{"key":"898_CR99","doi-asserted-by":"crossref","unstructured":"Dabouei A, Soleymani S, Taherkhani F, Nasrabadi NM. SuperMix: supervising the mixing data augmentation. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, 2021:13794\u201313803.","DOI":"10.1109\/CVPR46437.2021.01358"},{"key":"898_CR100","unstructured":"Chen J, Sinha S, Kyrillidis A. StackMix: a complementary mix algorithm. In: Uncertainty in Artificial Intelligence, 2022:326\u2013335. PMLR."},{"issue":"86","key":"898_CR101","first-page":"2579","volume":"9","author":"L van der Maaten","year":"2008","unstructured":"van der Maaten L, Hinton G. Visualizing data using t-SNE. J Mach Learn Res. 2008;9(86):2579\u2013605.","journal-title":"J Mach Learn Res"},{"key":"898_CR102","doi-asserted-by":"publisher","unstructured":"Poli\u010dar PG, Stra\u017ear M, Zupan B. openTSNE: a modular Python library for t-SNE dimensionality reduction and embedding. bioRxiv. 2019. https:\/\/doi.org\/10.1101\/731877. https:\/\/www.biorxiv.org\/content\/early\/2019\/08\/13\/731877.full.pdf.","DOI":"10.1101\/731877"},{"key":"898_CR103","unstructured":"Biewald L. Experiment Tracking with Weights and Biases. Software available from wandb.com.2020. https:\/\/www.wandb.com\/."},{"key":"898_CR104","doi-asserted-by":"crossref","unstructured":"Mo\u010dkus J. On Bayesian methods for seeking the extremum. In: Optimization Techniques IFIP Technical Conference: Novosibirsk, July 1\u20137, 1974, pp. 400\u2013404 (1975). Springer.","DOI":"10.1007\/3-540-07165-2_55"},{"key":"898_CR105","doi-asserted-by":"publisher","DOI":"10.1007\/978-94-009-0909-0","volume-title":"Bayesian approach to global optimization: theory and applications","author":"J Mo\u010dkus","year":"1989","unstructured":"Mo\u010dkus J. Bayesian approach to global optimization: theory and applications. Berlin: Springer; 1989. https:\/\/doi.org\/10.1007\/978-94-009-0909-0."},{"key":"898_CR106","doi-asserted-by":"publisher","first-page":"455","DOI":"10.1023\/A:1008306431147","volume":"13","author":"DR Jones","year":"1998","unstructured":"Jones DR, Schonlau M, Welch WJ. Efficient global optimization of expensive black-box functions. J Global Optim. 1998;13:455\u201392.","journal-title":"J Global Optim"},{"key":"898_CR107","unstructured":"Brochu E, Cora VM, De\u00a0Freitas N. A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. arXiv preprint arXiv:1012.2599 (2010)"},{"key":"898_CR108","unstructured":"Inselberg A, Dimsdale B. Parallel coordinates: a tool for visualizing multi-dimensional geometry. In: Proceedings of the First IEEE Conference on Visualization: Visualization \u201990, 1990; pp. 361\u2013378. IEEE."},{"key":"898_CR109","doi-asserted-by":"crossref","unstructured":"Inselberg A. Multidimensional detective. In: Proceedings of VIZ\u201997: Visualization Conference, Information Visualization Symposium and Parallel Rendering Symposium. 1997. pp. 100\u2013107. IEEE.","DOI":"10.1109\/INFVIS.1997.636793"},{"key":"898_CR110","unstructured":"Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L, et al. PyTorch: an imperative style, high-performance deep learning library. Adv Neural Informat Process Syst (NeurIPS). 2019;32."}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-024-00898-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-024-00898-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-024-00898-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,11,14]],"date-time":"2024-11-14T22:52:31Z","timestamp":1731624751000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-024-00898-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,3,23]]},"references-count":110,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["898"],"URL":"https:\/\/doi.org\/10.1186\/s40537-024-00898-6","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,3,23]]},"assertion":[{"value":"20 February 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 February 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 March 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"This research does not include studies involving human participants.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"43"}}