{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,10]],"date-time":"2026-01-10T11:37:37Z","timestamp":1768045057789,"version":"3.49.0"},"reference-count":56,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2023,2,27]],"date-time":"2023-02-27T00:00:00Z","timestamp":1677456000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,2,27]],"date-time":"2023-02-27T00:00:00Z","timestamp":1677456000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003542","name":"Ministerium f\u00fcr Wissenschaft, Forschung und Kunst Baden-W\u00fcrttemberg","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100003542","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["The VLDB Journal"],"published-print":{"date-parts":[[2023,9]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Real-world data of multi-class classification tasks often show complex data characteristics that lead to a reduced classification performance. Major analytical challenges are a high degree of multi-class imbalance within data and a heterogeneous feature space, which increases the number and complexity of class patterns. Existing solutions to classification or data pre-processing only address one of these two challenges in isolation. We propose a novel classification approach that explicitly addresses both challenges of multi-class imbalance and heterogeneous feature space together. As main contribution, this approach exploits domain knowledge in terms of a taxonomy to systematically prepare the training data. Based on an experimental evaluation on both real-world data and several synthetically generated data sets, we show that our approach outperforms any other classification technique in terms of accuracy. Furthermore, it entails considerable practical benefits in real-world use cases, e.g., it reduces rework required in the area of product quality control.<\/jats:p>","DOI":"10.1007\/s00778-023-00780-6","type":"journal-article","created":{"date-parts":[[2023,2,27]],"date-time":"2023-02-27T13:03:03Z","timestamp":1677502983000},"page":"1037-1064","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Exploiting domain knowledge to address class imbalance and a heterogeneous feature space in multi-class classification"],"prefix":"10.1007","volume":"32","author":[{"given":"Vitali","family":"Hirsch","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peter","family":"Reimann","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dennis","family":"Treder-Tschechlov","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Holger","family":"Schwarz","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bernhard","family":"Mitschang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2023,2,27]]},"reference":[{"issue":"15","key":"780_CR1","doi-asserted-by":"publisher","first-page":"2955","DOI":"10.1080\/00207540410001691929","volume":"42","author":"B Agard","year":"2004","unstructured":"Agard, B., Kusiak, A.: Data-mining-based methodology for the design of product families. Int. J. Prod. Res. 42(15), 2955\u20132969 (2004). https:\/\/doi.org\/10.1080\/00207540410001691929","journal-title":"Int. J. Prod. Res."},{"key":"780_CR2","doi-asserted-by":"publisher","unstructured":"Akhand, M.A.H., Murase, K.: Neural network ensemble training by sequential interaction. In: Proceedings of the 17th International Conference on Artificial Neural Networks, LNCS, pp. 98\u2013108. Springer, Porto, Portugal (2007). https:\/\/doi.org\/10.1007\/978-3-540-74690-4_11","DOI":"10.1007\/978-3-540-74690-4_11"},{"issue":"2\u20133","key":"780_CR3","first-page":"255","volume":"17","author":"J Alcal\u00e1-Fdez","year":"2011","unstructured":"Alcal\u00e1-Fdez, J., Fern\u00e1ndez, A., Luengo, J., Derrac, J., Garc\u00eda, S.: KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. Multiple-Valued Logic Soft Comput. 17(2\u20133), 255\u2013287 (2011)","journal-title":"Multiple-Valued Logic Soft Comput."},{"key":"780_CR4","doi-asserted-by":"publisher","unstructured":"Bach, S.H., Rodriguez, D., Liu, Y., Luo, C., Shao, H., Xia, C., Sen, S., Ratner, A., Hancock, B., Alborzi, H., Kuchhal, R., R\u00e9, C., Malkin, R.: Snorkel Drybell: A case study in deploying weak supervision at industrial scale. In: Proceedings of the 2019 International Conference on Management of Data (SIGMOD), pp. 362\u2013375. Amsterdam, The Netherlands (2019). https:\/\/doi.org\/10.1145\/3299869.3314036","DOI":"10.1145\/3299869.3314036"},{"issue":"4","key":"780_CR5","doi-asserted-by":"publisher","first-page":"713","DOI":"10.1515\/cclm-2012-0849","volume":"51","author":"G Baggio","year":"2013","unstructured":"Baggio, G., Corsini, A., Floreani, A., Giannini, S., Zagonel, V.: Gender medicine: a task for the third millennium. Clin Chem Lab Med 51(4), 713\u2013727 (2013). https:\/\/doi.org\/10.1515\/cclm-2012-0849","journal-title":"Clin Chem Lab Med"},{"issue":"1","key":"780_CR6","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman, L.: Random forests. Mach Learn 45(1), 5\u201332 (2001). https:\/\/doi.org\/10.1023\/A:1010933404324","journal-title":"Mach Learn"},{"issue":"3","key":"780_CR7","doi-asserted-by":"publisher","first-page":"365","DOI":"10.1007\/s13555-020-00372-0","volume":"10","author":"S Chan","year":"2020","unstructured":"Chan, S., Reddy, V., Myers, B., Thibodeaux, Q., Brownstone, N., Liao, W.: Machine learning in dermatology: current applications, opportunities, and limitations. Dermatol Therapy 10(3), 365\u2013386 (2020). https:\/\/doi.org\/10.1007\/s13555-020-00372-0","journal-title":"Dermatol Therapy"},{"key":"780_CR8","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1016\/j.jii.2017.08.001","volume":"9","author":"Y Cheng","year":"2017","unstructured":"Cheng, Y., Chen, K., Sun, H., Zhang, Y., Tao, F.: Data and knowledge mining with big data towards smart production. J. Ind. Inf. Integr. 9, 66 (2017). https:\/\/doi.org\/10.1016\/j.jii.2017.08.001","journal-title":"J. Ind. Inf. Integr."},{"key":"780_CR9","doi-asserted-by":"publisher","unstructured":"Cowell, F.: Measuring Inequality, 3rd edn. Oxford Academic (2011). https:\/\/doi.org\/10.1093\/acprof:osobl\/9780199594030.001.0001","DOI":"10.1093\/acprof:osobl\/9780199594030.001.0001"},{"key":"780_CR10","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1016\/j.knosys.2013.01.018","volume":"42","author":"A Fern\u00e1ndez","year":"2013","unstructured":"Fern\u00e1ndez, A., L\u00f3pez, V., Galar, M., del Jesus, M.J., Herrera, F.: Analysing the classification of imbalanced data-sets with multiple classes: binarization techniques and ad-hoc approaches. Knowl. Based Syst. 42, 97\u2013110 (2013). https:\/\/doi.org\/10.1016\/j.knosys.2013.01.018","journal-title":"Knowl. Based Syst."},{"issue":"6","key":"780_CR11","doi-asserted-by":"publisher","first-page":"869","DOI":"10.1001\/archderm.1988.016700600150084","volume":"124","author":"TB Fitzpatrick","year":"1988","unstructured":"Fitzpatrick, T.B.: The validity and practicality of sun-reactive skin types I through VI. Arch. Dermatol. 124(6), 869\u2013871 (1988). https:\/\/doi.org\/10.1001\/archderm.1988.016700600150084","journal-title":"Arch. Dermatol."},{"issue":"8","key":"780_CR12","doi-asserted-by":"publisher","first-page":"1761","DOI":"10.1016\/j.patcog.2011.01.017","volume":"44","author":"M Galar","year":"2011","unstructured":"Galar, M., Fern\u00e1ndez, A., Barrenechea, E., Bustince, H., Herrera, F.: An overview of ensemble methods for binary classifiers in multi-class problems: experimental study on one-vs-one and one-vs-all schemes. Pattern Recognit. 44(8), 1761\u20131776 (2011). https:\/\/doi.org\/10.1016\/j.patcog.2011.01.017","journal-title":"Pattern Recognit."},{"key":"780_CR13","doi-asserted-by":"publisher","unstructured":"Galar, M., Fern\u00e1ndez, A., Barrenechea, E., Bustince, H., Herrera, F.: A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Trans. Syst. Man Cybern. C Appl. Rev. 42(4), 463\u2013484 (2012). https:\/\/doi.org\/10.1109\/TSMCC.2011.2161285","DOI":"10.1109\/TSMCC.2011.2161285"},{"key":"780_CR14","doi-asserted-by":"publisher","unstructured":"Gerling, A., Schreier, U., Hess, A., Saleh, A., Ziekow, H., Ould\u00a0Abdeslam, D.: A reference process model for machine learning aided production quality management. In: Proceedings of the 22nd International Conference on Enterprise Information Systems (ICEIS 2020), pp. 515\u2013523. Prague, Czechia (2020). https:\/\/doi.org\/10.5220\/0009379705150523","DOI":"10.5220\/0009379705150523"},{"issue":"121","key":"780_CR15","doi-asserted-by":"publisher","first-page":"124","DOI":"10.2307\/2223319","volume":"31","author":"C Gini","year":"1921","unstructured":"Gini, C.: Measurement of inequality of incomes. Econ J 31(121), 124\u2013126 (1921). https:\/\/doi.org\/10.2307\/2223319","journal-title":"Econ J"},{"key":"780_CR16","doi-asserted-by":"publisher","first-page":"220","DOI":"10.1016\/j.eswa.2016.12.035","volume":"73","author":"G Haixiang","year":"2017","unstructured":"Haixiang, G., Yijing, L., Shang, J., Mingyun, G., Yuanyue, H., Bing, G.: Learning from class-imbalanced data: review of methods and applications. Expert Syst. Appl. 73, 220\u2013239 (2017). https:\/\/doi.org\/10.1016\/j.eswa.2016.12.035","journal-title":"Expert Syst. Appl."},{"issue":"9","key":"780_CR17","doi-asserted-by":"publisher","first-page":"1263","DOI":"10.1109\/TKDE.2008.239","volume":"21","author":"H He","year":"2009","unstructured":"He, H., Garcia, E.A.: Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263\u20131284 (2009). https:\/\/doi.org\/10.1109\/TKDE.2008.239","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"780_CR18","doi-asserted-by":"publisher","first-page":"1333","DOI":"10.1016\/j.procir.2018.03.024","volume":"72","author":"V Hirsch","year":"2018","unstructured":"Hirsch, V., Reimann, P., Kirn, O., Mitschang, B.: Analytical approach to support fault diagnosis and quality control in end-of-line testing. Procedia CIRP 72, 1333\u20131338 (2018). https:\/\/doi.org\/10.1016\/j.procir.2018.03.024","journal-title":"Procedia CIRP"},{"key":"780_CR19","doi-asserted-by":"publisher","unstructured":"Hirsch, V., Reimann, P., Mitschang, B.: Data-driven fault diagnosis in end-of-line testing of complex products. In: Proceedings of the 6th IEEE International Conference on Data Science and Advanced Analytics (DSAA), pp. 492\u2013503. IEEE (2019). https:\/\/doi.org\/10.1109\/DSAA.2019.00064","DOI":"10.1109\/DSAA.2019.00064"},{"key":"780_CR20","doi-asserted-by":"publisher","first-page":"747","DOI":"10.1016\/j.procir.2020.03.026","volume":"74","author":"V Hirsch","year":"2020","unstructured":"Hirsch, V., Reimann, P., Mitschang, B.: Approach to incorporate cost aspects into the ordering of a data-driven recommendation list for end-of-line testing. Procedia CIRP 74, 747\u2013752 (2020). https:\/\/doi.org\/10.1016\/j.procir.2020.03.026","journal-title":"Procedia CIRP"},{"issue":"12","key":"780_CR21","doi-asserted-by":"publisher","first-page":"3258","DOI":"10.14778\/3415478.3415549","volume":"13","author":"V Hirsch","year":"2020","unstructured":"Hirsch, V., Reimann, P., Mitschang, B.: Exploiting domain knowledge to address multi-class imbalance and a heterogeneous feature space in classification tasks for manufacturing data. PVLDB 13(12), 3258\u20133271 (2020). https:\/\/doi.org\/10.14778\/3415478.3415549","journal-title":"PVLDB"},{"issue":"1","key":"780_CR22","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1016\/j.cirp.2008.03.138","volume":"57","author":"S Hu","year":"2008","unstructured":"Hu, S., Zhu, X., Wang, H., Koren, Y.: Product variety and manufacturing complexity in assembly systems and supply chains. CIRP Ann. 57(1), 45\u201348 (2008). https:\/\/doi.org\/10.1016\/j.cirp.2008.03.138","journal-title":"CIRP Ann."},{"issue":"6","key":"780_CR23","doi-asserted-by":"publisher","first-page":"401","DOI":"10.2471\/BLT.12.020612","volume":"90","author":"G Humphreys","year":"2012","unstructured":"Humphreys, G.: Coming together to combat rare diseases. Bull. World Health Organ. 90(6), 401\u2013476 (2012). https:\/\/doi.org\/10.2471\/BLT.12.020612","journal-title":"Bull. World Health Organ."},{"key":"780_CR24","doi-asserted-by":"publisher","first-page":"585","DOI":"10.1146\/annurev.anthro.33.070203.143955","volume":"33","author":"N Jablonski","year":"2004","unstructured":"Jablonski, N.: The evolution of human skin and skin color. Ann. Rev. Anthropol. 33, 585\u2013623 (2004). https:\/\/doi.org\/10.1146\/annurev.anthro.33.070203.143955","journal-title":"Ann. Rev. Anthropol."},{"key":"780_CR25","doi-asserted-by":"publisher","unstructured":"Kassner, L., Mitschang, B.: Exploring text classification for messy data: an industry use case for domain-specific analytics technology. In: Proceedings of the 19th International Conference on Extending Database Technology (EDBT), pp. 491\u2013502. Bordeaux, France (2016). https:\/\/doi.org\/10.5441\/002\/edbt.2016.47","DOI":"10.5441\/002\/edbt.2016.47"},{"key":"780_CR26","doi-asserted-by":"publisher","unstructured":"Kiefer, C., Reimann, P., Mitschang, B.: A hybrid information extraction approach exploiting structured data within a text mining process. In: Proceedings of the 18th Conference on Datenbanksysteme f\u00fcr Business, Technologie und Web (BTW), pp. 149\u2013168. Rostock, Germany (2019). https:\/\/doi.org\/10.18420\/btw2019-10","DOI":"10.18420\/btw2019-10"},{"issue":"10","key":"780_CR27","doi-asserted-by":"publisher","first-page":"13448","DOI":"10.1016\/j.eswa.2011.04.063","volume":"38","author":"G K\u00f6ksal","year":"2011","unstructured":"K\u00f6ksal, G., Batmaz, I., Testik, M.C.: A review of data mining applications for quality improvement in manufacturing industry. Expert Syst Appl. 38(10), 13448\u201313467 (2011). https:\/\/doi.org\/10.1016\/j.eswa.2011.04.063","journal-title":"Expert Syst Appl."},{"issue":"11","key":"780_CR28","doi-asserted-by":"publisher","first-page":"66","DOI":"10.18637\/jss.v036.i11","volume":"36","author":"MB Kursa","year":"2010","unstructured":"Kursa, M.B., Rudnicki, W.R.: Feature selection with the Boruta package. J. Stat. Softw. 36(11), 66 (2010). https:\/\/doi.org\/10.18637\/jss.v036.i11","journal-title":"J. Stat. Softw."},{"issue":"42","key":"780_CR29","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1186\/s40537-018-0151-6","volume":"5","author":"JL Leevy","year":"2018","unstructured":"Leevy, J.L., Khoshgoftaar, T.M., Bauder, R.A., Seliya, N.: A survey on addressing high-class imbalance in big data. J. Big Data 5(42), 66 (2018). https:\/\/doi.org\/10.1186\/s40537-018-0151-6","journal-title":"J. Big Data"},{"key":"780_CR30","doi-asserted-by":"publisher","unstructured":"Liu, Y., Jin, R., Jain, A.: BoostCluster: boosting clustering by pairwise constraints. In: Proceedings of the 13th International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 450\u2013459. San Jose, CA, USA (2007). https:\/\/doi.org\/10.1145\/1281192.1281242","DOI":"10.1145\/1281192.1281242"},{"key":"780_CR31","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1145\/3457607","volume":"54","author":"N Mehrabi","year":"2021","unstructured":"Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., Galstyan, A.: A survey on bias and fairness in machine learning. ACM Comput. Surv. 54, 66 (2021). https:\/\/doi.org\/10.1145\/3457607","journal-title":"ACM Comput. Surv."},{"issue":"18","key":"780_CR32","doi-asserted-by":"publisher","first-page":"66","DOI":"10.3390\/app9183865","volume":"9","author":"M Mehrpouya","year":"2019","unstructured":"Mehrpouya, M., Dehghanghadikolaei, A., Fotovvati, B., Vosooghnia, A., Emamian, S.S., Gisario, A.: The potential of additive manufacturing in the smart factory industrial 4.0: a review. Appl. Sci. 9(18), 66 (2019). https:\/\/doi.org\/10.3390\/app9183865","journal-title":"Appl. Sci."},{"issue":"1","key":"780_CR33","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1016\/j.artmed.2011.11.006","volume":"55","author":"L Nanni","year":"2012","unstructured":"Nanni, L., Lumini, A., Brahnam, S.: A classifier ensemble approach for the missing feature problem. Artif. Intell. Med. 55(1), 37\u201350 (2012). https:\/\/doi.org\/10.1016\/j.artmed.2011.11.006","journal-title":"Artif. Intell. Med."},{"issue":"3","key":"780_CR34","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1109\/MCAS.2006.1688199","volume":"6","author":"R Polikar","year":"2006","unstructured":"Polikar, R.: Ensemble based systems in decision making. IEEE Circuits Syst. Mag. 6(3), 21\u201345 (2006). https:\/\/doi.org\/10.1109\/MCAS.2006.1688199","journal-title":"IEEE Circuits Syst. Mag."},{"key":"780_CR35","doi-asserted-by":"publisher","unstructured":"Polikar, R., DePasquale, J., Syed Mohammed, H., Brown, G., Kuncheva, L.I.: Learn++.MF: a random subspace approach for the missing feature problem. Pattern Recognit. 43(11), 3817\u20133832 (2010). https:\/\/doi.org\/10.1016\/j.patcog.2010.05.028","DOI":"10.1016\/j.patcog.2010.05.028"},{"key":"780_CR36","doi-asserted-by":"publisher","first-page":"410","DOI":"10.1002\/bs.3830120511","volume":"12","author":"R Quillian","year":"1967","unstructured":"Quillian, R.: Word concepts. A theory and simulation of some basic semantic capabilities. Behav. Sci. 12, 410\u2013430 (1967). https:\/\/doi.org\/10.1002\/bs.3830120511","journal-title":"Behav. Sci."},{"key":"780_CR37","doi-asserted-by":"publisher","first-page":"709","DOI":"10.1007\/s00778-019-00552-1","volume":"29","author":"A Ratner","year":"2020","unstructured":"Ratner, A., Bach, S.H., Ehrenberg, H., Fries, J., Wu, S., R\u00e9, C.: Snorkel: rapid training data creation with weak supervision. VLDB J. 29, 709\u2013730 (2020). https:\/\/doi.org\/10.1007\/s00778-019-00552-1","journal-title":"VLDB J."},{"issue":"1\u20132","key":"780_CR38","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10462-009-9124-7","volume":"33","author":"L Rokach","year":"2010","unstructured":"Rokach, L.: Ensemble-based classifiers. Artif. Intell. Rev. 33(1\u20132), 1\u201339 (2010). https:\/\/doi.org\/10.1007\/s10462-009-9124-7","journal-title":"Artif. Intell. Rev."},{"issue":"1\u20132","key":"780_CR39","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1007\/s10618-010-0175-9","volume":"22","author":"CN Silla","year":"2011","unstructured":"Silla, C.N., Freitas, A.A.: A survey of hierarchical classification across different application domains. Data Min Knowl Discov 22(1\u20132), 31\u201372 (2011). https:\/\/doi.org\/10.1007\/s10618-010-0175-9","journal-title":"Data Min Knowl Discov"},{"key":"780_CR40","unstructured":"Sowa, J.F.: Principles of Semantic Networks. Explorations in the Representation of Knowledge. Representation and Reasoning. Morgan Kaufmann (1991)"},{"issue":"13","key":"780_CR41","doi-asserted-by":"publisher","first-page":"1529","DOI":"10.14778\/2733004.2733024","volume":"7","author":"C Sun","year":"2014","unstructured":"Sun, C., Rampalli, N., Yang, F., Doan, A.: Chimera: large-scale classification using machine learning, rules, and crowdsourcing. PVLDB 7(13), 1529\u20131540 (2014). https:\/\/doi.org\/10.14778\/2733004.2733024","journal-title":"PVLDB"},{"issue":"04","key":"780_CR42","doi-asserted-by":"publisher","first-page":"687","DOI":"10.1142\/S0218001409007326","volume":"23","author":"Y Sun","year":"2009","unstructured":"Sun, Y., Wong, A., Kamel, M.: Classification of imbalanced data: a review. Int. J. Pattern Recognit. Artif. Intell. 23(04), 687\u2013719 (2009). https:\/\/doi.org\/10.1142\/S0218001409007326","journal-title":"Int. J. Pattern Recognit. Artif. Intell."},{"issue":"5","key":"780_CR43","doi-asserted-by":"publisher","first-page":"1623","DOI":"10.1016\/j.patcog.2014.11.014","volume":"48","author":"Z Sun","year":"2015","unstructured":"Sun, Z., Song, Q., Zhu, X., Sun, H., Xu, B., Zhou, Y.: A novel ensemble method for classifying imbalanced data. Pattern Recognit. 48(5), 1623\u20131637 (2015). https:\/\/doi.org\/10.1016\/j.patcog.2014.11.014","journal-title":"Pattern Recognit."},{"key":"780_CR44","doi-asserted-by":"publisher","unstructured":"Suresh, H., Guttag, J.: A framework for understanding sources of harm throughout the machine learning life cycle. In: Proceedings of the 1st ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO) (2021). https:\/\/doi.org\/10.1145\/3465416.3483305","DOI":"10.1145\/3465416.3483305"},{"key":"780_CR45","unstructured":"Thalmann, S., Gursch, H.G., Suschnigg, J., Gashi, M., Ennsbrunner, H., Fuchs, A.K., Schreck, T., Mutlu, B., Mangler, J., Kappl, G., Huemer, C., Lindstaedt, S.: Cognitive decision support for industrial product life cycles: a position paper. In: Proceedings of the 11th International Conference on Advanced Cognitive Technologies and Applications (COGNITIVE). IARIA, Venice, Italy (2019)"},{"key":"780_CR46","unstructured":"Treder-Tschechlov, D., Reimann, P., Schwarz, H., Mitschang, B.: Approach to synthetic data generation for imbalanced multi-class problems with heterogeneous groups. In: Proceedings of the 20th Conference on Datenbanksysteme f\u00fcr Business, Technologie und Web (BTW). Dresden, Germany (2023)"},{"issue":"8","key":"780_CR47","doi-asserted-by":"publisher","first-page":"902","DOI":"10.1016\/j.jprocont.2010.06.001","volume":"20","author":"S Verron","year":"2010","unstructured":"Verron, S., Li, J., Tiplica, T.: Fault detection and isolation of faults in a multivariate process with Bayesian network. J. Process Control 20(8), 902\u2013911 (2010). https:\/\/doi.org\/10.1016\/j.jprocont.2010.06.001","journal-title":"J. Process Control"},{"key":"780_CR48","unstructured":"Wagstaff, K., Cardie, C., Rogers, S., Schr\u00f6dl, S.: Constrained K-means clustering with background knowledge. In: Proceedings of the 18th International Conference on Machine Learning (ICML), pp. 577\u2013584. Williamstown, MA, USA (2001)"},{"issue":"10","key":"780_CR49","doi-asserted-by":"publisher","first-page":"4802","DOI":"10.1109\/TNNLS.2017.2771290","volume":"29","author":"S Wang","year":"2018","unstructured":"Wang, S., Minku, L.L., Yao, X.: A systematic study of online class imbalance learning with concept drift. IEEE Trans. Neural Netw. Learn. Syst. 29(10), 4802\u20134821 (2018). https:\/\/doi.org\/10.1109\/TNNLS.2017.2771290","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"780_CR50","doi-asserted-by":"publisher","unstructured":"Wang, S., Yao, X.: Multiclass imbalance problems: analysis and potential solutions. IEEE Trans. Syst. Man Cybernet. B Cybernet. 42(4), 1119\u20131130 (2012). https:\/\/doi.org\/10.1109\/TSMCB.2012.2187280","DOI":"10.1109\/TSMCB.2012.2187280"},{"key":"780_CR51","doi-asserted-by":"publisher","unstructured":"Weber, C., Hirmer, P., Reimann, P.: A model management platform for industry 4.0\u2014 enabling management of machine learning models in manufacturing environments. In: Proceedings of the 23rd International Conference on Business Information Systems (BIS), pp. 403\u2013417 (2020). https:\/\/doi.org\/10.1007\/978-3-030-53337-3_30","DOI":"10.1007\/978-3-030-53337-3_30"},{"key":"780_CR52","doi-asserted-by":"publisher","unstructured":"Whitley, H.P., Smith, W.D.: Sex-based differences in medications for heart failure. The Lancet 394(10205), 1210\u20131212 (2019). https:\/\/doi.org\/10.1016\/S0140-6736(19)31812-4","DOI":"10.1016\/S0140-6736(19)31812-4"},{"key":"780_CR53","doi-asserted-by":"publisher","unstructured":"Wilhelm, Y., Schreier, U., Reimann, P., Mitschang, B., Ziekow, H.: Data science approaches to quality control in manufacturing: a review of problems, challenges and architecture. In: Proceedings of the 14th Symposium on Service-Oriented Computing (SummerSOC), Communications in Computer and Information Science (CCIS), pp. 45\u201365. Springer (2020). https:\/\/doi.org\/10.1007\/978-3-030-64846-6_4","DOI":"10.1007\/978-3-030-64846-6_4"},{"key":"780_CR54","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1016\/j.inffus.2013.04.006","volume":"16","author":"M Wo\u017aniak","year":"2014","unstructured":"Wo\u017aniak, M., Gra\u00f1a, M., Corchado, E.: A survey of multiple classifier systems as hybrid systems. Inf. Fusion 16, 3\u201317 (2014). https:\/\/doi.org\/10.1016\/j.inffus.2013.04.006","journal-title":"Inf. Fusion"},{"issue":"1","key":"780_CR55","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1080\/21693277.2016.1192517","volume":"4","author":"T Wuest","year":"2016","unstructured":"Wuest, T., Weimer, D., Irgens, C., Thoben, K.D.: Machine learning in manufacturing: advantages, challenges, and applications. Prod. Manuf. Res. 4(1), 23\u201345 (2016). https:\/\/doi.org\/10.1080\/21693277.2016.1192517","journal-title":"Prod. Manuf. Res."},{"key":"780_CR56","unstructured":"Zhou, Z.H., Liu, X.Y.: On multi-class cost-sensitive learning. In: Proceedings of the 21st National Conference on Artificial Intelligence\u2014Vol. 1 (AAAI\u201906), pp. 567\u2013572. AAAI Press, Boston, MA, USA (2006)"}],"container-title":["The VLDB Journal"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00778-023-00780-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00778-023-00780-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00778-023-00780-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,15]],"date-time":"2023-08-15T14:11:41Z","timestamp":1692108701000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00778-023-00780-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,27]]},"references-count":56,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2023,9]]}},"alternative-id":["780"],"URL":"https:\/\/doi.org\/10.1007\/s00778-023-00780-6","relation":{},"ISSN":["1066-8888","0949-877X"],"issn-type":[{"value":"1066-8888","type":"print"},{"value":"0949-877X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,27]]},"assertion":[{"value":"16 February 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 December 2022","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 January 2023","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 February 2023","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}