{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,21]],"date-time":"2026-03-21T13:25:26Z","timestamp":1774099526858,"version":"3.50.1"},"reference-count":36,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2025,6,12]],"date-time":"2025-06-12T00:00:00Z","timestamp":1749686400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Interdisciplinary Centre for Mathematical and Computational Modelling (ICM), University of Warsaw","award":["g98-2104"],"award-info":[{"award-number":["g98-2104"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>This paper discusses the results of a study investigating how input data characteristics affect the performance of time-series classification models. In this experiment, we used 82 synthetically generated time-series datasets, created based on predefined functions with added noise. These datasets varied in structure, including differences in the number of classes and noise levels, while maintaining a consistent length and total number of observations. This design allowed us to systematically assess the influence of dataset characteristics on classification outcomes. Seven classification models were evaluated and their performance was compared using accuracy metrics, training time and memory requirements. According to the evaluation, the CNN Classifier achieved the best results, demonstrating the highest robustness to an increasing number of classes and noise. In contrast, the least effective model was the Catch22 Classifier. Overall, the performed research leads to the conclusion that as the number of classes and the level of noise in the data increase, all classification models become less effective, achieving lower accuracy metrics.<\/jats:p>","DOI":"10.3390\/e27060624","type":"journal-article","created":{"date-parts":[[2025,6,12]],"date-time":"2025-06-12T06:42:48Z","timestamp":1749710568000},"page":"624","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Simulation Study on How Input Data Affects Time-Series Classification Model Results"],"prefix":"10.3390","volume":"27","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5143-398X","authenticated-orcid":false,"given":"Maria","family":"Sadowska","sequence":"first","affiliation":[{"name":"Institute of Information Technology, Warsaw University of Life Sciences-SGGW, 02-787 Warszawa, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6953-8907","authenticated-orcid":false,"given":"Krzysztof","family":"Gajowniczek","sequence":"additional","affiliation":[{"name":"Institute of Information Technology, Warsaw University of Life Sciences-SGGW, 02-787 Warszawa, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2025,6,12]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"445","DOI":"10.2217\/pme-2022-0014","article-title":"Artificial intelligence in healthcare: A primer for medical education in radiomics","volume":"19","author":"Waldman","year":"2022","journal-title":"Pers. Med."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Buntine, W., Grobelnik, M., Mladeni\u0107, D., and Shawe-Taylor, J. (2009). Kernels for Periodic Time Series Arising in Astronomy. Machine Learning and Knowledge Discovery in Databases, Proceedings of the European Conference, ECML PKDD 2009, Bled, Slovenia, 7\u201311 September 2009, Springer.","DOI":"10.1007\/978-3-642-04180-8"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Chen, J.F., Chen, W.L., Huang, C.P., Huang, S.H., and Chen, A.P. (2016, January 16\u201318). Financial Time-Series Data Analysis Using Deep Convolutional Neural Networks. Proceedings of the 2016 7th International Conference on Cloud Computing and Big Data (CCBD), Macau, China.","DOI":"10.1109\/CCBD.2016.027"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Gajowniczek, K., Bator, M., Zabkowski, T., Orlowski, A., and Loo, C.K. (2020). Simulation Study on the Electricity Data Streams Time Series Clustering. Energies, 13.","DOI":"10.3390\/en13040924"},{"key":"ref_5","first-page":"1","article-title":"aeon: A Python Toolkit for Learning from Time Series","volume":"25","author":"Middlehurst","year":"2024","journal-title":"J. Mach. Learn. Res."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"162","DOI":"10.21629\/JSEE.2017.01.18","article-title":"Convolutional neural networks for time series classification","volume":"28","author":"Zhao","year":"2017","journal-title":"J. Syst. Eng. Electron."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1454","DOI":"10.1007\/s10618-020-00701-z","article-title":"ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels","volume":"34","author":"Dempster","year":"2019","journal-title":"Data Min. Knowl. Discov."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1821","DOI":"10.1007\/s10618-019-00647-x","article-title":"catch22: CAnonical Time-series CHaracteristics","volume":"33","author":"Lubba","year":"2019","journal-title":"Data Min. Knowl. Discov."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Dhariyal, B., Le Nguyen, T., and Ifrim, G. (2023). Back to Basics: A Sanity Check on Modern Time Series Classification Algorithms. Advanced Analytics and Learning on Temporal Data, Proceedings of the 8th ECML PKDD Workshop, AALTD 2023, Turin, Italy, 18\u201322 September 2023, Springer.","DOI":"10.1007\/978-3-031-49896-1_14"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1958","DOI":"10.1007\/s10618-024-01022-1","article-title":"Bake off redux: A review and experimental evaluation of recent time series classification algorithms","volume":"38","author":"Middlehurst","year":"2024","journal-title":"Data Min. Knowl. Discov."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"721","DOI":"10.1007\/s42452-020-2506-9","article-title":"Time series classification: Nearest neighbor versus deep learning models","volume":"2","author":"Jiang","year":"2020","journal-title":"SN Appl. Sci."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Taga, E.O., Ildiz, M.E., and Oymak, S. (2025). TimePFN: Effective Multivariate Time Series Forecasting with Synthetic Data. arXiv.","DOI":"10.1609\/aaai.v39i19.34288"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Pei, H., Ren, K., Yang, Y., Liu, C., Qin, T., and Li, D. (2021, January 7\u201310). Towards Generating Real-World Time Series Data. Proceedings of the 2021 IEEE International Conference on Data Mining (ICDM), Auckland, New Zealand.","DOI":"10.1109\/ICDM51629.2021.00058"},{"key":"ref_14","unstructured":"Qian, J., Xie, B., Wan, B., Li, M., Sun, M., and Chiang, P.Y. (2024). TimeLDM: Latent Diffusion Model for Unconditional Time Series Generation. arXiv."},{"key":"ref_15","unstructured":"Coletta, A., Gopalakrishan, S., Borrajo, D., and Vyetrenko, S. (2023). On the Constrained Time-Series Generation Problem. arXiv."},{"key":"ref_16","unstructured":"Vergara, S. (2013). A Synthesizer Based on Frequency-Phase Analysis and Square Waves. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Sikder, M.F., Ramachandranpillai, R., and Heintz, F. (2024). TransFusion: Generating Long, High Fidelity Time Series using Diffusion Models with Transformers. arXiv.","DOI":"10.1016\/j.mlwa.2025.100652"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"112917","DOI":"10.1016\/j.knosys.2024.112917","article-title":"Boundary-enhanced time series data imputation with long-term dependency diffusion models","volume":"310","author":"Xiao","year":"2025","journal-title":"Knowl.-Based Syst."},{"key":"ref_19","unstructured":"Das, A., Kong, W., Sen, R., and Zhou, Y. (2024). A decoder-only foundation model for time-series forecasting. arXiv."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Xiao, C., Gou, Z., Tai, W., Zhang, K., and Zhou, F. (2023, January 6\u201310). Imputation-based Time-Series Anomaly Detection with Conditional Weight-Incremental Diffusion Models. Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Long Beach, CA, USA.","DOI":"10.1145\/3580305.3599391"},{"key":"ref_21","unstructured":"Nie, Y., Nguyen, N.H., Sinthong, P., and Kalagnanam, J. (2023). A Time Series is Worth 64 Words: Long-term Forecasting with Transformers. arXiv."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Sch\u00e4fer, P., and Leser, U. (2017, January 6\u201310). Fast and Accurate Time Series Classification with WEASEL. Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. Association for Computing Machinery, Singapore.","DOI":"10.1145\/3132847.3132980"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1016\/j.neucom.2018.03.067","article-title":"Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh\u2014A Python package)","volume":"307","author":"Christ","year":"2018","journal-title":"Neurocomputing"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1016\/j.ins.2013.02.030","article-title":"A Time Series Forest for Classification and Feature Extraction","volume":"239","author":"Deng","year":"2013","journal-title":"Inf. Sci."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Dissanayake, O., McPherson, S., Allyndree, J., Kennedy, E., Cunningham, P., and Riaboff, L. (2024). Evaluating ROCKET and Catch22 features for calf behaviour classification from accelerometer data using Machine Learning models. arXiv.","DOI":"10.2139\/ssrn.4937129"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1505","DOI":"10.1007\/s10618-014-0377-7","article-title":"The BOSS is concerned with time series classification in the presence of noise","volume":"29","year":"2015","journal-title":"Data Min. Knowl. Discov."},{"key":"ref_27","unstructured":"Sofia, V. (2024). Analysis and Innovation in Time Series Classification Algorithms and Methods: Towards a Superior Convolutional Algorithm with Feature Selection. [Master\u2019s Thesis, Aristotle University of Thessaloniki]."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Middlehurst, M., Large, J., and Bagnall, A. (2020, January 10\u201313). The Canonical Interval Forest (CIF) Classifier for Time Series Classification. Proceedings of the 2020 IEEE International Conference on Big Data, Big Data 2020, Atlanta, GA, USA.","DOI":"10.1109\/BigData50022.2020.9378424"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1","DOI":"10.5121\/ijdkp.2015.5201","article-title":"A Review on Evaluation Metrics for Data Classification Evaluations","volume":"5","author":"Hossin","year":"2015","journal-title":"Int. J. Data Min. Knowl. Manag. Process"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1016\/j.aci.2018.08.003","article-title":"Classification assessment methods","volume":"17","author":"Tharwat","year":"2020","journal-title":"Appl. Comput. Inform."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"861","DOI":"10.1016\/j.patrec.2005.10.010","article-title":"An introduction to ROC analysis","volume":"27","author":"Fawcett","year":"2006","journal-title":"Pattern Recognit. Lett."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1016\/j.patcog.2019.02.023","article-title":"The impact of class imbalance in classification performance metrics based on the binary confusion matrix","volume":"91","author":"Luque","year":"2019","journal-title":"Pattern Recognit."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Caruana, R., and Niculescu-Mizil, A. (2004, January 22\u201325). Data mining in metric space: An empirical analysis of supervised learning performance criteria. Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD \u201904, Seattle, WA, USA.","DOI":"10.1145\/1014052.1014063"},{"key":"ref_34","first-page":"75259","article-title":"Jaccard Metric Losses: Optimizing the Jaccard Index with Soft Labels","volume":"Volume 36","author":"Oh","year":"2023","journal-title":"Advances in Neural Information Processing Systems"},{"key":"ref_35","first-page":"599","article-title":"Classification Model Evaluation Metrics","volume":"12","author":"Vujovic","year":"2021","journal-title":"Int. J. Adv. Comput. Sci. Appl."},{"key":"ref_36","unstructured":"Ferrer, L. (2022). Analysis and Comparison of Classification Metrics. arXiv."}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/27\/6\/624\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:50:37Z","timestamp":1760032237000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/27\/6\/624"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,12]]},"references-count":36,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2025,6]]}},"alternative-id":["e27060624"],"URL":"https:\/\/doi.org\/10.3390\/e27060624","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,6,12]]}}}