{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,5]],"date-time":"2026-05-05T17:42:37Z","timestamp":1778002957269,"version":"3.51.4"},"reference-count":45,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,1,9]],"date-time":"2024-01-09T00:00:00Z","timestamp":1704758400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,1,9]],"date-time":"2024-01-09T00:00:00Z","timestamp":1704758400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100010418","name":"Institute for Information and Communications Technology Promotion","doi-asserted-by":"publisher","award":["IITP-2023-RS-2023-00259497"],"award-info":[{"award-number":["IITP-2023-RS-2023-00259497"]}],"id":[{"id":"10.13039\/501100010418","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["NRF-2023S1A5A8075518"],"award-info":[{"award-number":["NRF-2023S1A5A8075518"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In this study, we utilize data-driven approaches to predict flight departure delays. The growing demand for air travel is outpacing the capacity and infrastructure available to support it. In addition, abnormal weather patterns caused by climate change contribute to the frequent occurrence of flight delays. In light of the extensive network of international flights covering vast distances across continents and oceans, the importance of forecasting flight delays over extended time periods becomes increasingly evident. Existing research has predominantly concentrated on short-term predictions, prompting our study to specifically address this aspect. We collected datasets spanning over 10 years from three different airports such as ICN airport in South Korea, JFK and MDW airport in the United States, capturing flight information at six different time intervals (2, 4, 8, 16, 24, and 48\u00a0h) prior to flight departure. The datasets comprise 1,569,879 instances for ICN, 773,347 for JFK, and 404,507 for MDW, respectively. We employed a range of machine learning and deep learning approaches, including Decision Tree, Random Forest, Support Vector Machine, K-nearest neighbors, Logistic Regression, Extreme Gradient Boosting, and Long Short-Term Memory, to predict flight delays. Our models achieved accuracy rates of 0.749 for ICN airport, 0.852 for JFK airport, and 0.785 for MDW airport in 2-h predictions. Furthermore, for 48-h predictions, our models achieved accuracy rates of 0.748 for ICN airport, 0.846 for JFK airport, and 0.772 for MDW airport based on our experimental results. Consequently, we have successfully validated the accuracy of flight delay predictions for longer time frames. The implications and future research directions derived from these findings are also discussed.<\/jats:p>","DOI":"10.1186\/s40537-023-00867-5","type":"journal-article","created":{"date-parts":[[2024,1,9]],"date-time":"2024-01-09T13:02:33Z","timestamp":1704805353000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":32,"title":["Prediction of flight departure delays caused by weather conditions adopting data-driven approaches"],"prefix":"10.1186","volume":"11","author":[{"given":"Seongeun","family":"Kim","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Eunil","family":"Park","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,1,9]]},"reference":[{"key":"867_CR1","unstructured":"Economics-IATA: air passenger market analysis 2014. 2014."},{"key":"867_CR2","unstructured":"Economics-IATA: air passenger market analysis 2023. 2023. https:\/\/www.iata.org\/en\/iata-repository\/publications\/economic-reports\/air-passenger-market-analysis2\/."},{"key":"867_CR3","doi-asserted-by":"publisher","first-page":"e0219","DOI":"10.5028\/jatm.v11.977","volume":"11","author":"M Efthymiou","year":"2018","unstructured":"Efthymiou M, Njoya ET, Lo PL, Papatheodorou A, Randall D. The impact of delays on customers\u2019 satisfaction: an empirical analysis of the British airways on-time performance at Heathrow airport. J Aerosp Technol Manag. 2018;11:e0219.","journal-title":"J Aerosp Technol Manag"},{"key":"867_CR4","doi-asserted-by":"crossref","unstructured":"Tileag\u0103 C, Oprisan O. Flights delay compensation 261\/2004: a challenge for airline companies? In: Organizations and Performance in a complex world: 26th international economic conference of Sibiu (IECS) 26. Springer; 2021. p. 335\u201344.","DOI":"10.1007\/978-3-030-50676-6_27"},{"key":"867_CR5","unstructured":"U.S.D. of transportation, airline on-time performance data. 2023. https:\/\/www.transtats.bts.gov\/tables.asp?QO_VQ=EFD &QO_anzr=Nv4yv0r."},{"issue":"4","key":"867_CR6","first-page":"461","volume":"9","author":"G-M Sim","year":"2018","unstructured":"Sim G-M, Kim Y-S, Jung M-P, Kim J-W, Park M-S, Hong S-H, Kang K-K. Changes in the frequency of abnormal weather events in South Korea in recent years. J Korean Soc Clim Change. 2018;9(4):461\u201370.","journal-title":"J Korean Soc Clim Change"},{"key":"867_CR7","unstructured":"Lee J-W, Yoo H-I, Kim G-H. Analysis of South Korea\u2019s heavy rain characteristics from 2006 to 2015 using AWS data. In: Proceedings of the Korean meteorological society conference. 2016. p. 521\u20132."},{"key":"867_CR8","doi-asserted-by":"crossref","unstructured":"Kim YJ, Choi S, Briceno S, Mavris D. A deep learning approach to flight delay prediction. In: 2016 IEEE\/AIAA 35th digital avionics systems conference (DASC). IEEE; 2016. p. 1\u20136.","DOI":"10.1109\/DASC.2016.7778092"},{"issue":"5","key":"867_CR9","first-page":"2689","volume":"26","author":"H Khaksar","year":"2019","unstructured":"Khaksar H, Sheikholeslami A. Airline delay prediction by machine learning algorithms. Scientia Iranica. 2019;26(5):2689\u2013702.","journal-title":"Scientia Iranica"},{"issue":"3","key":"867_CR10","doi-asserted-by":"publisher","first-page":"494","DOI":"10.3390\/math11030494","volume":"11","author":"J Qu","year":"2023","unstructured":"Qu J, Wu S, Zhang J. Flight delay propagation prediction based on deep learning. Mathematics. 2023;11(3):494.","journal-title":"Mathematics"},{"issue":"part 1","key":"867_CR11","first-page":"390","volume":"35","author":"SM Al-Tabbakh","year":"2018","unstructured":"Al-Tabbakh SM, El-Zahed H. Machine learning techniques for analysis of Egyptian flight delay. J Sci Res Sci. 2018;35(part 1):390\u20139.","journal-title":"J Sci Res Sci"},{"issue":"7","key":"867_CR12","doi-asserted-by":"publisher","first-page":"2749","DOI":"10.3390\/su12072749","volume":"12","author":"B Ye","year":"2020","unstructured":"Ye B, Liu B, Tian Y, Wan L. A methodology for predicting aggregate flight departure delays in airports based on supervised learning. Sustainability. 2020;12(7):2749.","journal-title":"Sustainability"},{"issue":"6","key":"867_CR13","doi-asserted-by":"publisher","first-page":"1223","DOI":"10.16984\/saufenbilder.710107","volume":"24","author":"MC Atlio\u011flu","year":"2020","unstructured":"Atlio\u011flu MC, Bolat M, \u015eahin M, Tunali V, Kilin\u00e7 D. Supervised learning approaches to flight delay prediction. Sakarya Univ J Sci. 2020;24(6):1223\u201331.","journal-title":"Sakarya Univ J Sci"},{"key":"867_CR14","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-020-00380-z","volume":"7","author":"MF Yazdi","year":"2020","unstructured":"Yazdi MF, Kamel SR, Chabok SJM, Kheirabadi M. Flight delay prediction based on deep learning and Levenberg\u2013Marquart algorithm. J Big Data. 2020;7:1\u201328.","journal-title":"J Big Data"},{"key":"867_CR15","doi-asserted-by":"publisher","DOI":"10.1016\/j.tele.2023.102066","volume":"85","author":"J Lee","year":"2023","unstructured":"Lee J, Cha J, Park E. Data-driven approaches into political orientation and news outlet discrimination: the case of news articles in south korea. Telemat Inform. 2023;85: 102066.","journal-title":"Telemat Inform"},{"key":"867_CR16","doi-asserted-by":"crossref","unstructured":"Gao Z, Gatpandan MP, Gatpandan PH. Classification decision tree algorithm in predicting students\u2019 course preference. In: 2021 2nd international symposium on computer engineering and intelligent communications (ISCEIC). IEEE; 2021. p. 93\u20137.","DOI":"10.1109\/ISCEIC53685.2021.00026"},{"key":"867_CR17","doi-asserted-by":"crossref","unstructured":"Sharma A, Sharma M, Dwivedi R. Improved decision tree classification (IDT) algorithm for social media data. In: 2021 10th international conference on system modeling & advancement in research trends (SMART). IEEE; 2021. p. 155\u20137.","DOI":"10.1109\/SMART52563.2021.9676265"},{"issue":"6","key":"867_CR18","doi-asserted-by":"publisher","first-page":"2503","DOI":"10.1080\/13467581.2021.1972808","volume":"21","author":"E Kim","year":"2022","unstructured":"Kim E, Ji H, Kim J, Park E. Classifying apartment defect repair tasks in South Korea: a machine learning approach. J Asian Archit Build Eng. 2022;21(6):2503\u201310.","journal-title":"J Asian Archit Build Eng"},{"key":"867_CR19","doi-asserted-by":"crossref","unstructured":"Soumya A, Kumar GH. Classification of ancient epigraphs into different periods using random forests. In: 2014 fifth international conference on signal and image processing. IEEE; 2014. p. 171\u20138.","DOI":"10.1109\/ICSIP.2014.33"},{"key":"867_CR20","doi-asserted-by":"crossref","unstructured":"Ardiansyah D, Mantoro T, Syafei WA. Potential classification prediction of solar and wind energy in Indonesia using machine learning with random forest algorithm. In: 2022 5th international conference of computer and informatics engineering (IC2IE). IEEE; 2022. p. 297\u2013302.","DOI":"10.1109\/IC2IE56416.2022.9970129"},{"key":"867_CR21","doi-asserted-by":"publisher","DOI":"10.1016\/j.tele.2023.102001","volume":"82","author":"J Lee","year":"2023","unstructured":"Lee J, Park E. D-HRSP: dataset of helpful reviews for service providers. Telemat Inform. 2023;82:102001.","journal-title":"Telemat Inform"},{"key":"867_CR22","doi-asserted-by":"crossref","unstructured":"Fadhil IM, Sibaroni Y. Topic classification in Indonesian-language tweets using fast-text feature expansion with support vector machine (SVM). In: 2022 international conference on data science and its applications (ICoDSA). IEEE; 2022. p. 214\u20139.","DOI":"10.1109\/ICoDSA55874.2022.9862899"},{"key":"867_CR23","doi-asserted-by":"crossref","unstructured":"Charan PVS, Ramkumar G. Black fungus classification using Adaboost with SVM-based classifier and compare accuracy with support vector machine. In: 2022 5th international conference on contemporary computing and informatics (IC3I). IEEE; 2022. p. 1895\u2013901.","DOI":"10.1109\/IC3I56241.2022.10072559"},{"key":"867_CR24","doi-asserted-by":"publisher","first-page":"3205","DOI":"10.1007\/s13042-023-01828-3","volume":"14","author":"S Hwang","year":"2023","unstructured":"Hwang S, Ahn H, Park E. iMovieRec: a hybrid movie recommendation method based on a user-image-item model. Int J Mach Learn Cybern. 2023;14:3205\u201316.","journal-title":"Int J Mach Learn Cybern"},{"key":"867_CR25","doi-asserted-by":"crossref","unstructured":"Auleria M, Arrahmah AI, Saputra DE. A review on KN nearest neighbour based classification for object recognition. In: 2021 international conference on data science and its applications (ICoDSA). 2021; IEEE. p. 274\u201380.","DOI":"10.1109\/ICoDSA53588.2021.9617466"},{"key":"867_CR26","doi-asserted-by":"crossref","unstructured":"Kim S, An C, Cha J, Kim D, Park E. D-visa: a dataset for detecting visual sentiment from art images. In: Proceedings of the IEEE\/CVF international conference on computer vision. 2023. p. 3051\u20139.","DOI":"10.1109\/ICCVW60793.2023.00328"},{"issue":"15","key":"867_CR27","doi-asserted-by":"publisher","first-page":"5179","DOI":"10.3390\/app10155179","volume":"10","author":"M Akoulih","year":"2020","unstructured":"Akoulih M, Tigani S, Saadane R, Tazi A. Electrocoagulation based chromium removal efficiency classification using logistic regression. Appl Sci. 2020;10(15):5179.","journal-title":"Appl Sci"},{"issue":"2","key":"867_CR28","doi-asserted-by":"publisher","first-page":"298","DOI":"10.31577\/cai_2021_2_298","volume":"40","author":"X Guan","year":"2021","unstructured":"Guan X, Zhang J, Chen S. Logistic regression based on statistical learning model with linearized kernel for classification. Comput Inform. 2021;40(2):298\u2013317.","journal-title":"Comput Inform"},{"issue":"12","key":"867_CR29","doi-asserted-by":"publisher","first-page":"4187","DOI":"10.3390\/s21124187","volume":"21","author":"A Paleczek","year":"2021","unstructured":"Paleczek A, Grochala D, Rydosz A. Artificial breath classification using XGBoost algorithm for diabetes detection. Sensors. 2021;21(12):4187.","journal-title":"Sensors"},{"issue":"15","key":"867_CR30","doi-asserted-by":"publisher","first-page":"2435","DOI":"10.3390\/electronics11152435","volume":"11","author":"H Liang","year":"2022","unstructured":"Liang H, Li J, Wu H, Li L, Zhou X, Jiang X. Mammographic classification of breast cancer microcalcifications through extreme gradient boosting. Electronics. 2022;11(15):2435.","journal-title":"Electronics"},{"key":"867_CR31","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2022.108437","volume":"242","author":"S Lee","year":"2022","unstructured":"Lee S, Jeong D, Park E. MultiEmo: multi-task framework for emoji prediction. Knowl-Based Syst. 2022;242: 108437.","journal-title":"Knowl-Based Syst"},{"issue":"12","key":"867_CR32","first-page":"3855","volume":"16","author":"Y Hur","year":"2022","unstructured":"Hur Y. Malaysian name-based ethnicity classification using LSTM. KSII Trans Internet Inf Syst. 2022;16(12):3855\u201367.","journal-title":"KSII Trans Internet Inf Syst"},{"key":"867_CR33","doi-asserted-by":"crossref","unstructured":"Zerrouki N, Houacine A, Harrou F, Bouarroudj R, Cherifi MY, Sun Y. Exploiting deep learning-based LSTM classification for improving hand gesture recognition to enhance visitors\u2019 museum experiences. In: 2022 international conference on innovation and intelligence for informatics, computing, and technologies (3ICT). IEEE; 2022. p. 451\u20136.","DOI":"10.1109\/3ICT56508.2022.9990722"},{"key":"867_CR34","doi-asserted-by":"crossref","unstructured":"Madanan M, Venugopal A, Velayudhan NC. A hybrid anomaly based intrusion detection methodology using IWD for LSTM classification. In: 2020 IEEE international conference on advanced networks and telecommunications systems (ANTS). IEEE; 2020. p. 1\u20135.","DOI":"10.1109\/ANTS50601.2020.9342820"},{"key":"867_CR35","volume":"458","author":"S Lee","year":"2023","unstructured":"Lee S, Kim J, Kim D, Kim KJ, Park E. Computational approaches to developing the implicit media bias dataset: assessing political orientations of nonpolitical news articles. Appl Math Comput. 2023;458:128219.","journal-title":"Appl Math Comput"},{"key":"867_CR36","doi-asserted-by":"publisher","DOI":"10.1016\/j.tele.2023.101948","volume":"78","author":"S Lee","year":"2023","unstructured":"Lee S, Kim J, Park E. Can book covers help predict bestsellers using machine learning approaches? Telemat Inform. 2023;78: 101948.","journal-title":"Telemat Inform"},{"issue":"1","key":"867_CR37","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-022-00674-4","volume":"10","author":"E Park","year":"2023","unstructured":"Park E. CRNet: a multimodal deep convolutional neural network for customer revisit prediction. J Big Data. 2023;10(1):1\u201310.","journal-title":"J Big Data"},{"issue":"1","key":"867_CR38","doi-asserted-by":"publisher","first-page":"109","DOI":"10.1007\/s40558-022-00222-z","volume":"24","author":"S Oh","year":"2022","unstructured":"Oh S, Ji H, Kim J, Park E, del Pobil AP. Deep learning model based on expectation\u2013confirmation theory to predict customer satisfaction in hospitality service. Inform Technol Tour. 2022;24(1):109\u201326.","journal-title":"Inform Technol Tour"},{"key":"867_CR39","doi-asserted-by":"publisher","DOI":"10.1016\/j.tele.2022.101906","volume":"76","author":"H Yu","year":"2023","unstructured":"Yu H, Park E. A harmless webtoon for all: an automatic age-restriction prediction system for webtoon contents. Telemat Inform. 2023;76: 101906.","journal-title":"Telemat Inform"},{"key":"867_CR40","unstructured":"Incheon airport weather. https:\/\/data.kma.go.kr\/data\/air\/selectAmosRltmList.do?pgmNo=575 &tabNo=1."},{"key":"867_CR41","unstructured":"New York City weather. https:\/\/www.wunderground.com\/history\/daily\/us\/ny\/new-york-city\/KLGA."},{"key":"867_CR42","unstructured":"Chicago City weather. https:\/\/www.wunderground.com\/history\/daily\/us\/il\/chicago\/KMDW."},{"key":"867_CR43","unstructured":"Incheon air port flight. https:\/\/www.airport.kr\/co\/ko\/cpr\/statisticOfDelay.do."},{"key":"867_CR44","unstructured":"United States Department of Transport. https:\/\/www.transtats.bts.gov\/tables.asp?QO_VQ=EFD &QO_anzr=Nv4yv0r."},{"key":"867_CR45","doi-asserted-by":"crossref","unstructured":"Panda B, Adhikari RK. A method for classification of missing values using data mining techniques. In: 2020 international conference on computer science, engineering and applications (ICCSEA). IEEE; 2020. p. 1\u20135.","DOI":"10.1109\/ICCSEA49143.2020.9132935"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-023-00867-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-023-00867-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-023-00867-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,1,9]],"date-time":"2024-01-09T13:06:07Z","timestamp":1704805567000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-023-00867-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,9]]},"references-count":45,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["867"],"URL":"https:\/\/doi.org\/10.1186\/s40537-023-00867-5","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,1,9]]},"assertion":[{"value":"9 June 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 December 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 January 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"11"}}