{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,14]],"date-time":"2026-04-14T14:19:32Z","timestamp":1776176372043,"version":"3.50.1"},"reference-count":53,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,2,3]],"date-time":"2023-02-03T00:00:00Z","timestamp":1675382400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,2,3]],"date-time":"2023-02-03T00:00:00Z","timestamp":1675382400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The recent evolution of machine learning\u00a0(ML) algorithms and the high level of expertise required to use them have fuelled the demand for non-experts solutions. The selection of an appropriate algorithm and the configuration of its hyperparameters is among the most complicated tasks while applying ML to new problems. It necessitates well awareness and knowledge of ML algorithms. The algorithm selection problem\u00a0(ASP) is defined as the process of identifying the algorithm\u00a0(s) that can deliver top performance for a particular problem, task, and evaluation measure. In this context, <jats:italic>meta-learning<\/jats:italic> is one of the approaches to achieve this objective by using prior learning experiences to assist the learning process on unseen problems and tasks. As a data-driven approach, appropriate data characterization is of vital importance for the meta-learning. Nonetheless, the recent literature witness a variety of data characterization techniques including simple, statistical and information theory based measures. However, their quality still needs to be improved. In this paper, a new Autoencoder-kNN (AeKNN) based meta-model with built-in latent features extraction is proposed. The approach is aimed to extract new characterizations of the data, with lower dimensionality but more significant and meaningful features. AeKNN internally uses a deep autoencoder as a latent features extractor from a set of existing meta-features induced from the dataset. From this new features vectors the computed distances are more significant, thus providing a way to accurately recommending top-performing pipelines for previously unseen datasets. In an application on a large-scale hyperparameters optimization task for 400 real world datasets with varying schemas as a meta-learning task, we show that AeKNN offers considerable improvements of the classical kNN as well as traditional meta-models in terms of performance.<\/jats:p>","DOI":"10.1186\/s40537-023-00687-7","type":"journal-article","created":{"date-parts":[[2023,2,3]],"date-time":"2023-02-03T16:04:30Z","timestamp":1675440270000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":15,"title":["Autoencoder-kNN meta-model based data characterization approach for an automated selection of AI algorithms"],"prefix":"10.1186","volume":"10","author":[{"given":"Moncef","family":"Garouani","sequence":"first","affiliation":[]},{"given":"Adeel","family":"Ahmad","sequence":"additional","affiliation":[]},{"given":"Mourad","family":"Bouneffa","sequence":"additional","affiliation":[]},{"given":"Mohamed","family":"Hamlich","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,2,3]]},"reference":[{"key":"687_CR1","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-022-00612-4","author":"M Garouani","year":"2022","unstructured":"Garouani M, Ahmad A, Bouneffa M, Hamlich M, Bourguin G, Lewandowski A. Using meta-learning for automated algorithms selection and configuration: an experimental framework for industrial big data. J Big Data. 2022. https:\/\/doi.org\/10.1186\/s40537-022-00612-4.","journal-title":"J Big Data."},{"issue":"1","key":"687_CR2","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1186\/s40537-021-00419-9","volume":"8","author":"A Adadi","year":"2021","unstructured":"Adadi A. A survey on data-efficient algorithms in big data era. J Big Data. 2021;8(1):24. https:\/\/doi.org\/10.1186\/s40537-021-00419-9.","journal-title":"J Big Data"},{"issue":"1","key":"687_CR3","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1186\/s40537-020-00398-3","volume":"8","author":"M Rostami","year":"2020","unstructured":"Rostami M, Berahmand K, Forouzandeh S. A novel community detection based genetic algorithm for feature selection. J Big Data. 2020;8(1):2. https:\/\/doi.org\/10.1186\/s40537-020-00398-3.","journal-title":"J Big Data"},{"key":"687_CR4","doi-asserted-by":"publisher","DOI":"10.1016\/j.softx.2021.100919","volume":"17","author":"M Garouani","year":"2022","unstructured":"Garouani M, Ahmad A, Bouneffa M, Hamlich M. AMLBID: An auto-explained automated machine learning tool for big industrial data. SoftwareX. 2022;17: 100919. https:\/\/doi.org\/10.1016\/j.softx.2021.100919.","journal-title":"SoftwareX"},{"key":"687_CR5","unstructured":"Feurer M, Klein A, Eggensperger K, Springenberg JT, Blum M, Hutter F. Efficient and robust automated machine learning. In: Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 2. NIPS\u201915, pp. 2755\u20132763. MIT Press."},{"issue":"1\u20132","key":"687_CR6","doi-asserted-by":"publisher","first-page":"1169","DOI":"10.1007\/s00170-022-08761-9","volume":"120","author":"M Garouani","year":"2022","unstructured":"Garouani M, Ahmad A, Bouneffa M, Hamlich M, Bourguin G, Lewandowski A. Towards big industrial data mining through explainable automated machine learning. Int J Advan Manuf Technol. 2022;120(1\u20132):1169\u201388. https:\/\/doi.org\/10.1007\/s00170-022-08761-9.","journal-title":"Int J Advan Manuf Technol"},{"key":"687_CR7","doi-asserted-by":"crossref","unstructured":"Garouani M, Ahmad A, Bouneffa M, Hamlich M. Scalable Meta-Bayesian Based Hyperparameters Optimization for Machine Learning. In: Hamlich M, Bellatreche L, Siadat A, Ventura S, editor. Smart Applications and Data Analysis. SADASC 2022. Communications in Computer and Information Science, vol 1677. Cham: Springer; 2022. https:\/\/doi.org\/10.1007\/978-3-031-20490-6_14","DOI":"10.1007\/978-3-031-20490-6_14"},{"key":"687_CR8","doi-asserted-by":"publisher","unstructured":"Garouani M, Ahmad A, Bouneffa M, Lewandowski A, Bourguin G, Hamlich M. Towards the Automation of Industrial Data Science: A Meta-learning based Approach. In: 23rd International Conference on Enterprise Information Systems, pp. 709\u2013716. https:\/\/doi.org\/10.5220\/0010457107090716.","DOI":"10.5220\/0010457107090716"},{"key":"687_CR9","unstructured":"Laadan D, Vainshtein R, Curiel Y, Katz G, Rokach L. RankML: a Meta Learning-Based Approach for Pre-Ranking Machine Learning Pipelines. 2019. 1911.00108."},{"key":"687_CR10","doi-asserted-by":"crossref","unstructured":"Garouani M, Zaysa K. Leveraging the Automated Machine Learning for Arabic Opinion Mining: A Preliminary Study on AutoML Tools and Comparison to Human Performance. In: Motahhir S, Bossoufi B, editor. Digital Technologies and Applications. ICDTA 2022. Lecture Notes in Networks and Systems, vol 455. Cham: Springer; 2022. https:\/\/doi.org\/10.1007\/978-3-031-02447-4_17","DOI":"10.1007\/978-3-031-02447-4_17"},{"key":"687_CR11","doi-asserted-by":"publisher","unstructured":"Thornton C, Hutter F, Hoos HH, Leyton-Brown K. Auto-WEKA: Combined selection and hyperparameter optimization of classification algorithms. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD \u201913, pp. 847\u2013855. Association for Computing Machinery. https:\/\/doi.org\/10.1145\/2487575.2487629.","DOI":"10.1145\/2487575.2487629"},{"key":"687_CR12","doi-asserted-by":"publisher","unstructured":"Olson RS, Moore JH. TPOT: A Tree-Based Pipeline Optimization Tool for Automating Machine Learning. In: Hutter F, Kotthoff L, Vanschoren J (eds.) Automated Machine Learning: Methods, Systems, Challenges. The Springer Series on Challenges in Machine Learning, pp. 151\u2013160. Springer International Publishing. https:\/\/doi.org\/10.1007\/978-3-030-05318-5_8.","DOI":"10.1007\/978-3-030-05318-5_8"},{"key":"687_CR13","doi-asserted-by":"publisher","unstructured":"Garouani M, Hamlich M, Ahmad A, Bouneffa M, Bourguin G, Lewandowski A. Toward an\u00a0automatic assistance framework for\u00a0the\u00a0selection and\u00a0configuration of\u00a0machine learning based data analytics solutions in\u00a0industry 4.0. In: Proceedings of the 5th International Conference on Big Data and Internet of Things, pp. 3\u201315. Springer. https:\/\/doi.org\/10.1007\/978-3-031-07969-6_1.","DOI":"10.1007\/978-3-031-07969-6_1"},{"key":"687_CR14","doi-asserted-by":"publisher","unstructured":"Nural MV, Peng H, Miller JA. Using meta-learning for model type selection in predictive big data analytics. In: 2017 IEEE International Conference on Big Data (Big Data), pp. 2027\u20132036. https:\/\/doi.org\/10.1109\/BigData.2017.8258149.","DOI":"10.1109\/BigData.2017.8258149"},{"key":"687_CR15","doi-asserted-by":"publisher","unstructured":"Garouani M, Ahmad A, Bouneffa M, Hamlich M, Bourguin G, Lewandowski A. Towards meta-learning based data analytics to\u00a0better assist the\u00a0domain experts in\u00a0industry 4.0. In: Artificial Intelligence in Data and Big Data Processing, pp. 265\u2013277. Springer. https:\/\/doi.org\/10.1007\/978-3-030-97610-1_22.","DOI":"10.1007\/978-3-030-97610-1_22"},{"key":"687_CR16","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"141","DOI":"10.1007\/3-540-36182-0_14","volume-title":"Discovery Science","author":"Y Peng","year":"2002","unstructured":"Peng Y, Flach PA, Soares C, Brazdil P. Improved Dataset Characterisation for Meta-learning. In: Lange S, Satoh K, Smith CH, editors. Discovery Science. Lecture Notes in Computer Science. Springer: Berlin; 2002. p. 141\u201352. https:\/\/doi.org\/10.1007\/3-540-36182-0_14."},{"key":"687_CR17","unstructured":"Vanschoren J. Meta-Learning: A Survey. arxiv:1810.03548."},{"key":"687_CR18","doi-asserted-by":"publisher","unstructured":"Matejka J, Fitzmaurice G. Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 1290\u20131294. Association for Computing Machinery. https:\/\/doi.org\/10.1145\/3025453.3025912.","DOI":"10.1145\/3025453.3025912"},{"key":"687_CR19","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"222","DOI":"10.1007\/3-540-45357-1_26","volume-title":"Advances in Knowledge Discovery and Data Mining","author":"A Kalousis","year":"2001","unstructured":"Kalousis A, Hilario M. Feature Selection for Meta-learning. In: Cheung D, Williams GJ, Li Q, editors. Advances in Knowledge Discovery and Data Mining. Lecture Notes in Computer Science. Berlin: Springer; 2001. p. 222\u201333. https:\/\/doi.org\/10.1007\/3-540-45357-1_26."},{"key":"687_CR20","first-page":"78","volume":"111","author":"Y Pavel","year":"2002","unstructured":"Pavel Y, Soares BC. Decision tree-based data characterization for meta-learning. IDDM. 2002;111:78.","journal-title":"IDDM"},{"key":"687_CR21","unstructured":"Meskhi MM, Rivolli A, Mantovani RG, Vilalta R. Learning Abstract Task Representations. arxiv:2101.07852."},{"issue":"1","key":"687_CR22","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1186\/s40537-021-00444-8","volume":"8","author":"L Alzubaidi","year":"2021","unstructured":"Alzubaidi L, Zhang J, Humaidi AJ, Al-Dujaili A, Duan Y, Al-Shamma O, Santamar\u00eda J, Fadhel MA, Al-Amidie M, Farhan L. Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. J Big Data. 2021;8(1):53. https:\/\/doi.org\/10.1186\/s40537-021-00444-8.","journal-title":"J Big Data."},{"issue":"34","key":"687_CR23","doi-asserted-by":"publisher","first-page":"197","DOI":"10.1561\/2000000039","volume":"7","author":"L Deng","year":"2014","unstructured":"Deng L, Yu D. Deep learning: methods and applications. Found Trends Signal Processing. 2014;7(34):197\u2013387. https:\/\/doi.org\/10.1561\/2000000039.","journal-title":"Found Trends Signal Processing"},{"key":"687_CR24","doi-asserted-by":"publisher","unstructured":"Gosztolya G, Busa-Fekete R, Gr\u00f3sz T, T\u00f3th L. DNN-Based Feature Extraction and Classifier Combination for Child-Directed Speech, Cold and Snoring Identification. In: Interspeech 2017, pp. 3522\u20133526. ISCA. https:\/\/doi.org\/10.21437\/Interspeech.2017-905.","DOI":"10.21437\/Interspeech.2017-905"},{"key":"687_CR25","doi-asserted-by":"publisher","unstructured":"Wang W, Huang Y, Wang Y, Wang L. Generalized Autoencoder: A Neural Network Framework for Dimensionality Reduction. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 496\u2013503. https:\/\/doi.org\/10.1109\/CVPRW.2014.79.","DOI":"10.1109\/CVPRW.2014.79"},{"issue":"1","key":"687_CR26","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1007\/s10115-018-1156-3","volume":"57","author":"V Bhatia","year":"2018","unstructured":"Bhatia V, Rani R. DFuzzy: A deep learning-based fuzzy clustering model for large graphs. Knowl Inform Syst. 2018;57(1):159\u201381. https:\/\/doi.org\/10.1007\/s10115-018-1156-3.","journal-title":"Knowl Inform Syst"},{"key":"687_CR27","doi-asserted-by":"publisher","unstructured":"Vincent P, Larochelle H, Bengio Y, Manzagol P-A. Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th International Conference on Machine Learning. ICML \u201908, pp. 1096\u20131103. Association for Computing Machinery. https:\/\/doi.org\/10.1145\/1390156.1390294.","DOI":"10.1145\/1390156.1390294"},{"issue":"1","key":"687_CR28","doi-asserted-by":"publisher","first-page":"436","DOI":"10.2991\/ijcis.2018.125905686","volume":"12","author":"FJ Pulgar","year":"2018","unstructured":"Pulgar FJ, Charte F, Rivera AJ, del Jesus MJ. AEkNN: An AutoEncoder kNN-Based Classifier With Built-in Dimensionality Reduction. Int J Comput Intell Syst. 2018;12(1):436\u201352. https:\/\/doi.org\/10.2991\/ijcis.2018.125905686.","journal-title":"Int J Comput Intell Syst"},{"key":"687_CR29","doi-asserted-by":"publisher","first-page":"224","DOI":"10.1016\/j.ins.2015.05.010","volume":"317","author":"MA Mu\u00f1oz","year":"2015","unstructured":"Mu\u00f1oz MA, Sun Y, Kirley M, Halgamuge SK. Algorithm selection for black-box continuous optimization problems: A survey on methods and challenges. Information Sci. 2015;317:224\u201345. https:\/\/doi.org\/10.1016\/j.ins.2015.05.010.","journal-title":"Information Sci"},{"key":"687_CR30","doi-asserted-by":"crossref","unstructured":"Feurer M, Springenberg J, Hutter F. Initializing Bayesian Hyperparameter Optimization via Meta-Learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 29. 2015. https:\/\/ojs.aaai.org\/index.php\/AAAI\/article\/view\/9354.","DOI":"10.1609\/aaai.v29i1.9354"},{"key":"687_CR31","unstructured":"Drori I, Krishnamurthy Y, Louren\u00e7o R, Rampin R, Cho K, Silva C, Freire J. Automatic Machine Learning by Pipeline Synthesis Using Model-Based Reinforcement Learning and a Grammar. arxiv:1905.10345"},{"key":"687_CR32","unstructured":"Li L, Jamieson KG, DeSalvo G, Rostamizadeh A, Talwalkar A. Efficient hyperparameter optimization and infinitely many armed bandits. CoRR abs\/1603.06560 (2016). arxiv:1603.06560."},{"key":"687_CR33","doi-asserted-by":"publisher","unstructured":"das D\u00f4res SN, Alves L, Ruiz DD, Barros RC. A meta-learning framework for algorithm recommendation in software fault prediction. In: Proceedings of the 31st Annual ACM Symposium on Applied Computing. SAC \u201916, pp. 1486\u20131491. Association for Computing Machinery. https:\/\/doi.org\/10.1145\/2851613.2851788.","DOI":"10.1145\/2851613.2851788"},{"key":"687_CR34","doi-asserted-by":"publisher","unstructured":"Cohen-Shapira N, Rokach L, Shapira B, Katz G, Vainshtein R. AutoGRD: Model Recommendation Through Graphical Dataset Representation. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management. CIKM \u201919, pp. 821\u2013830. https:\/\/doi.org\/10.1145\/3357384.3357896.","DOI":"10.1145\/3357384.3357896"},{"key":"687_CR35","doi-asserted-by":"publisher","DOI":"10.1007\/s10044-012-0280-z","author":"M Reif","year":"2012","unstructured":"Reif M, Shafait F, Goldstein M, Breuel T, Dengel A. Automatic classifier selection for non-experts. Pattern Anal Appl. 2012. https:\/\/doi.org\/10.1007\/s10044-012-0280-z.","journal-title":"Pattern Anal Appl"},{"key":"687_CR36","doi-asserted-by":"publisher","unstructured":"Pinto F, Soares C, Mendes-Moreira Ja. Towards Automatic Generation of Metafeatures. In: PAKDD. https:\/\/doi.org\/10.1007\/978-3-319-31753-3_18.","DOI":"10.1007\/978-3-319-31753-3_18"},{"key":"687_CR37","doi-asserted-by":"publisher","unstructured":"Katz G, Shin EC, Song D. ExploreKit: Automatic Feature Generation and Selection. 2016 IEEE 16th International Conference on Data Mining (ICDM). https:\/\/doi.org\/10.1109\/ICDM.2016.0123.","DOI":"10.1109\/ICDM.2016.0123"},{"key":"687_CR38","doi-asserted-by":"publisher","unstructured":"Vainshtein R, Greenstein-Messica A, Katz G, Shapira B, Rokach L. A Hybrid Approach for Automatic Model Recommendation. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management. CIKM \u201918, pp. 1623\u20131626. https:\/\/doi.org\/10.1145\/3269206.3269299.","DOI":"10.1145\/3269206.3269299"},{"issue":"2","key":"687_CR39","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1023\/A:1019956318069","volume":"18","author":"R Vilalta","year":"2002","unstructured":"Vilalta R, Drissi Y. A Perspective View and Survey of Meta-Learning. Artif Intell Rev. 2002;18(2):77\u201395. https:\/\/doi.org\/10.1023\/A:1019956318069.","journal-title":"Artif Intell Rev"},{"issue":"4","key":"687_CR40","doi-asserted-by":"publisher","first-page":"1249","DOI":"10.1002\/widm.1249","volume":"8","author":"O Sagi","year":"2018","unstructured":"Sagi O, Rokach L. Ensemble learning: A survey. WIREs Data Mining and Knowledge Discovery. 2018;8(4):1249. https:\/\/doi.org\/10.1002\/widm.1249.","journal-title":"WIREs Data Mining and Knowledge Discovery"},{"key":"687_CR41","unstructured":"Santoro A, Bartunov S, Botvinick M, Wierstra D, Lillicrap T. Meta-learning with memory-augmented neural networks. In: Proceedings of the 33rd International Conference on International Conference on Machine Learning - Volume 48. ICML\u201916, pp. 1842\u20131850. JMLR.org."},{"key":"687_CR42","doi-asserted-by":"publisher","first-page":"203","DOI":"10.1016\/j.ins.2018.10.043","volume":"477","author":"A Bruno","year":"2018","unstructured":"Bruno A. Pimentel and Andr\u00e9 Carlos Ponce de Leon Ferreira de Carvalho: A new data characterization for selecting clustering algorithms using meta-learning. Inform Sci. 2018;477:203\u201319. https:\/\/doi.org\/10.1016\/j.ins.2018.10.043.","journal-title":"Inform Sci"},{"key":"687_CR43","unstructured":"Rendell L, Seshu R, Tcheng D. Layered concept-learning and dynamically-variable bias management. In: Proceedings of IJCAI-87, pp. 308\u2013314. Morgan Kaufmann."},{"key":"687_CR44","unstructured":"Pfahringer B. Tell me who can learn you and I can tell you who you are: Landmarking Various Learning Algorithms. https:\/\/www.semanticscholar.org\/paper\/Tell-me-who-can-learn-you-and-I-can-tell-you-who-Pfahringer-Bensusan\/78e71a6a649dd6778bb1c0923f626d6573cc2b06."},{"key":"687_CR45","unstructured":"Michie D, Spiegelhalter DJ, Taylor CC, Campbell J (eds). Machine Learning, Neural and Statistical Classification. Ellis Horwood. 1995."},{"key":"687_CR46","doi-asserted-by":"publisher","unstructured":"Souza BF. Meta-aprendizagem Aplicada \u00e0 Classifica\u00e7\u00e3o de Dados de Express\u00e3o G\u00eanica. https:\/\/doi.org\/10.11606\/T.55.2010.tde-04012011-142551.","DOI":"10.11606\/T.55.2010.tde-04012011-142551"},{"key":"687_CR47","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1016\/j.ins.2014.12.044","volume":"301","author":"DG Ferrari","year":"2015","unstructured":"Ferrari DG, de Castro LN. Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods. Inform Sci. 2015;301:181\u201394. https:\/\/doi.org\/10.1016\/j.ins.2014.12.044.","journal-title":"Inform Sci"},{"issue":"1","key":"687_CR48","doi-asserted-by":"publisher","first-page":"4547","DOI":"10.1038\/srep04547","volume":"4","author":"ON Yaveroglu","year":"2014","unstructured":"Yaveroglu ON, Malod-Dognin N, Davis D, Levnajic Z, Janjic V, Karapandza R, Stojmirovic A, Pr\u017eulj N. Revealing the Hidden Language of Complex Networks. Sci Rep. 2014;4(1):4547. https:\/\/doi.org\/10.1038\/srep04547.","journal-title":"Sci Rep"},{"issue":"4","key":"687_CR49","doi-asserted-by":"publisher","first-page":"697","DOI":"10.1515\/amcs-2017-0048","volume":"27","author":"B Bilalli","year":"2017","unstructured":"Bilalli B, Abello A, Aluja-Banet T. On the predictive power of meta-features in OpenML. Int J Appl Math Computer Sci. 2017;27(4):697\u2013712. https:\/\/doi.org\/10.1515\/amcs-2017-0048.","journal-title":"Int J Appl Math Computer Sci"},{"issue":"6","key":"687_CR50","doi-asserted-by":"publisher","first-page":"417","DOI":"10.1037\/h0071325","volume":"24","author":"H Hotelling","year":"1993","unstructured":"Hotelling H. Analysis of a complex of statistical variables into principal components. J Educ Psychol. 1993;24(6):417\u201341. https:\/\/doi.org\/10.1037\/h0071325.","journal-title":"J Educ Psychol"},{"issue":"1","key":"687_CR51","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1186\/s40537-020-00305-w","volume":"7","author":"JT Hancock","year":"2020","unstructured":"Hancock JT, Khoshgoftaar TM. Survey on categorical data for neural networks. J Big Data. 2020;7(1):28. https:\/\/doi.org\/10.1186\/s40537-020-00305-w.","journal-title":"J Big Data"},{"key":"687_CR52","unstructured":"Alcoba\u00e7a E, Siqueira F, Rivolli A, Garcia LPF, Oliva JT, de Carvalho ACPLF. MFE: Towards reproducible meta-feature extraction. J Mach Learning Res 2020;21(111), 1\u20135."},{"key":"687_CR53","unstructured":"Cohen-Shapira N, Rokach L. Automatic Selection of Clustering Algorithms Using Supervised Graph Embedding. arxiv:2011.08225."}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-023-00687-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-023-00687-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-023-00687-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,3]],"date-time":"2023-02-03T16:05:24Z","timestamp":1675440324000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-023-00687-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,3]]},"references-count":53,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["687"],"URL":"https:\/\/doi.org\/10.1186\/s40537-023-00687-7","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,3]]},"assertion":[{"value":"9 December 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 January 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 February 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"14"}}