{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T17:58:55Z","timestamp":1775066335601,"version":"3.50.1"},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2021,5,8]],"date-time":"2021-05-08T00:00:00Z","timestamp":1620432000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100007257","name":"National Institute of Science and Technology for Software Engineering","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100007257","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100006162","name":"Pernambuco State Science and Technology Support Foundation","doi-asserted-by":"publisher","award":["APQ-0399-1.03\/17"],"award-info":[{"award-number":["APQ-0399-1.03\/17"]}],"id":[{"id":"10.13039\/501100006162","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002322","name":"Coordination for the Improvement of Higher Education Personnel","doi-asserted-by":"publisher","award":["88887.136410\/2017-00"],"award-info":[{"award-number":["88887.136410\/2017-00"]}],"id":[{"id":"10.13039\/501100002322","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003593","name":"National Council for Scientific and Technological Development","doi-asserted-by":"publisher","award":["465614\/2014-0"],"award-info":[{"award-number":["465614\/2014-0"]}],"id":[{"id":"10.13039\/501100003593","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2022,8,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Travel delays and bus overcrowding are some of the daily dissatisfactions of public transportation users. These problems may be caused by bus bunching, an event that occurs when two or more buses are running the same route together, i.e. out of schedule. Due to the stochastic nature of the traffic, a static schedule is not effective to avoid the occurrence of these events; thus, preventive actions are necessary to improve the reliability of the public transportation system. In this context, we propose a decision tree ensemble model to predict bus bunching. We use an ensemble of Random Forest, eXtreme Gradient Boosting and Categorical Boosting models applied to Global Positioning System, General Transit Feed Specification, weather and traffic situation data. The efficacy of the proposed model has been demonstrated using real data sets and has been compared with four baselines: Linear Regression, Logistic Regression, Support Vector Machine and Relevance Vector Machine. According to the results, the proposed model can achieve an efficacy between 74 and 80% and can be used to predict bus bunching in real time up to 10 stops before its occurrence.<\/jats:p>","DOI":"10.1093\/comjnl\/bxab045","type":"journal-article","created":{"date-parts":[[2021,4,2]],"date-time":"2021-04-02T11:08:45Z","timestamp":1617361725000},"page":"2044-2062","source":"Crossref","is-referenced-by-count":7,"title":["A Decision Tree Ensemble Model for Predicting Bus Bunching"],"prefix":"10.1093","volume":"65","author":[{"given":"Veruska","family":"Borges Santos","sequence":"first","affiliation":[{"name":"Department of Computer Science , Federal University of Campina Grande, Campina Grande, 58100-000, Para\u00edba, Brazil"}]},{"given":"Carlos Eduardo","family":"S Pires","sequence":"additional","affiliation":[{"name":"Department of Computer Science , Federal University of Campina Grande, Campina Grande, 58100-000, Para\u00edba, Brazil"}]},{"given":"Dimas","family":"Cassimiro Nascimento","sequence":"additional","affiliation":[{"name":"Department of Computer Science , Federal University of Campina Grande, Campina Grande, 58100-000, Para\u00edba, Brazil"},{"name":"Universidade Federal do Agreste de Pernambuco , Garanhuns, 55290-000, Pernambuco, Brazil"}]},{"given":"Andreza Raquel M","family":"de Queiroz","sequence":"additional","affiliation":[{"name":"Department of Computer Science , Federal University of Campina Grande, Campina Grande, 58100-000, Para\u00edba, Brazil"}]}],"member":"286","published-online":{"date-parts":[[2021,5,8]]},"reference":[{"key":"2022081612372204000_ref1","article-title":"\u00d4nibus urbano perde tr\u00eas milh\u00f5es de passageiros por dia","author":"NTU","year":"2017"},{"key":"2022081612372204000_ref2","article-title":"Financiamento do custeio do transporte p\u00fablico coletivo urbano","author":"dos Santos","year":"2018"},{"key":"2022081612372204000_ref3","first-page":"1","article-title":"Spatiotemporal clustering and analysis of road accident hotspots by exploiting GIS technology and Kernel density estimation","volume":"00","author":"Kazmi","year":"2020","journal-title":"Comput. J."},{"key":"2022081612372204000_ref4","doi-asserted-by":"crossref","first-page":"460","DOI":"10.1016\/j.asoc.2016.06.031","article-title":"An online learning approach to eliminate bus bunching in real-time","volume":"47","author":"Moreira-Matias","year":"2016","journal-title":"Appl. Soft Comput."},{"key":"2022081612372204000_ref5","doi-asserted-by":"crossref","first-page":"1772","DOI":"10.1109\/TITS.2016.2620483","article-title":"Probabilistic prediction of bus headway using relevance vector machine regression","volume":"18","author":"Yu","year":"2017","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"2022081612372204000_ref6","first-page":"1","article-title":"On the tradeoff between sensitivity and specificity in bus bunching prediction","volume":"24","author":"Sun","year":"2020","journal-title":"J. Intell. Transp. Syst."},{"key":"2022081612372204000_ref7","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques","author":"Witten","year":"2005"},{"key":"2022081612372204000_ref8","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1016\/j.trb.2017.06.013","article-title":"A predictive-control framework to address bus bunching","volume":"104","author":"Andres","year":"2017","journal-title":"Transport. Res. Part B Meth."},{"key":"2022081612372204000_ref9","doi-asserted-by":"crossref","first-page":"300","DOI":"10.1016\/j.trb.2016.07.005","article-title":"Bus bunching along a corridor served by two lines","volume":"93","author":"Schm\u00f6cker","year":"2016","journal-title":"Transport. Res. Part B Meth"},{"key":"2022081612372204000_ref10","doi-asserted-by":"crossref","first-page":"532","DOI":"10.1080\/15472450.2016.1154764","article-title":"Developing travel time estimation methods using sparse GPS data","volume":"20","author":"Sanaullah","year":"2016","journal-title":"J. Intell. Transport. Syst."},{"key":"2022081612372204000_ref11","doi-asserted-by":"crossref","first-page":"3630","DOI":"10.1109\/TITS.2018.2846036","article-title":"Estimating inefficiency in bus trip choices from a user perspective with schedule, positioning, and ticketing data","volume":"19","author":"Braz","year":"2018","journal-title":"IEEE Trans. Intell. Transport. Syst."},{"key":"2022081612372204000_ref12","doi-asserted-by":"crossref","first-page":"1964","DOI":"10.1109\/TITS.2019.2909107","article-title":"An ensemble learning-based vehicle steering detector using smartphones","volume":"21","author":"Ouyang","year":"2019","journal-title":"IEEE Trans. Intell. Transport. Syst."},{"key":"2022081612372204000_ref13","first-page":"73","article-title":"Tree-based machine learning methods for survey research","volume":"13","author":"Kern","year":"2019","journal-title":"Surv. Res. Methods"},{"key":"2022081612372204000_ref14","first-page":"84","article-title":"The Impact of Adverse Weather Conditions on Urban Bus Performance Measures","volume-title":"Proc. 8th Int. IEEE Conf. Intelligent Transportation Systems","author":"Hofmann","year":"2005"},{"key":"2022081612372204000_ref15","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1175\/2009WCAS1014.1","article-title":"Assessing the impact of weather on traffic intensity","volume":"2","author":"Cools","year":"2010","journal-title":"Weather Clim Soc"},{"key":"2022081612372204000_ref16","first-page":"4330","article-title":"Empirical Findings of Bus Bunching Distributions and Attributes Using Archived AVL\/APC Bus Data","volume-title":"11th Int. Conf. Chinese Transportation Professionals (ICCTP)","author":"Feng","year":"2011"},{"key":"2022081612372204000_ref17","doi-asserted-by":"crossref","first-page":"43","DOI":"10.5121\/ijdkp.2012.2504","article-title":"Incremental learning: areas and methods-a survey","volume":"2","author":"Joshi","year":"2012","journal-title":"Int. J. Data Min. Knowl. Manag. Process"},{"key":"2022081612372204000_ref18","first-page":"388","article-title":"Maintaining a Bus Schedule","volume-title":"2nd Conf. Australian Road Research Board (ARRB)","author":"Newell","year":"1964"},{"key":"2022081612372204000_ref19","doi-asserted-by":"crossref","first-page":"1444","DOI":"10.1016\/j.trpro.2017.05.170","article-title":"Using automatic vehicle location data to model and identify determinants of bus bunching","volume":"25","author":"Rashidi","year":"2017","journal-title":"Transport. Res. Proc."},{"key":"2022081612372204000_ref20","doi-asserted-by":"crossref","first-page":"332","DOI":"10.1080\/15472450.2018.1494596","article-title":"Modeling bus bunching using massive location and fare collection data","volume":"23","author":"Arriagada","year":"2019","journal-title":"J. Intell. Transport. Syst."},{"key":"2022081612372204000_ref21","first-page":"1","article-title":"Data as a Resource: Real-Time Predictive Analytics for Bus Bunching","volume-title":"Proc. Annual Meeting of the Transportation Research Board","author":"Nair","year":"2014"},{"key":"2022081612372204000_ref22","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1016\/j.trc.2016.09.007","article-title":"Headway-based bus bunching prediction using transit smart card data","volume":"72","author":"Yu","year":"2016","journal-title":"Transp. Res. Part C Emerg. Technol."},{"key":"2022081612372204000_ref23","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1111\/j.1467-8667.2007.00494.x","article-title":"Estimating a rail passenger trip origin-destination matrix using automatic data collection systems","volume":"22","author":"Zhao","year":"2007","journal-title":"Comput. Aided Civ. Inf. Eng."},{"key":"2022081612372204000_ref24","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1007\/s12469-016-0137-8","article-title":"Identifying passenger flow characteristics and evaluating travel time reliability by visualizing AFC data: a case study of Shanghai metro","volume":"8","author":"Sun","year":"2016","journal-title":"Public Transp."},{"key":"2022081612372204000_ref25","first-page":"1542","article-title":"Applying the Support Vector Machine to Predicting Headway-based Bus Bunching","volume-title":"19th COTA Int. Conf. Transportation Professionals","author":"Yang","year":"2019"},{"key":"2022081612372204000_ref26","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1007\/BF00116251","article-title":"Induction of decision trees","volume":"1","author":"Quinlan","year":"1986","journal-title":"Mach. Learn."},{"key":"2022081612372204000_ref27","volume-title":"Classification and Regression Trees","author":"Breiman","year":"1984"},{"key":"2022081612372204000_ref28","first-page":"278","article-title":"Random Decision Forests","volume-title":"Proc. 3rd Int. Conf. Document Analysis and Recognition","author":"Ho","year":"1995"},{"key":"2022081612372204000_ref29","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1016\/S0167-9473(01)00065-2","article-title":"Stochastic gradient boosting","volume":"38","author":"Friedman","year":"2002","journal-title":"Comput. Stat. Data Anal."},{"key":"2022081612372204000_ref30","first-page":"785","article-title":"Xgboost: A Scalable Tree Boosting System","volume-title":"Proc. 22nd ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining","author":"Chen","year":"2016"},{"key":"2022081612372204000_ref31","article-title":"CatBoost: gradient boosting with categorical features support","volume-title":"CoRR","author":"Dorogush","year":"2018"},{"key":"2022081612372204000_ref32","first-page":"227","article-title":"An Incremental Probabilistic Model to Predict Bus Bunching in Real-Time","volume-title":"Int. Symp. Intelligent Data Analysis","author":"Moreira-Matias","year":"2014"},{"key":"2022081612372204000_ref33","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1093\/biomet\/30.1-2.81","article-title":"A new measure of rank correlation","volume":"30","author":"Kendall","year":"1938","journal-title":"Biometrika"},{"key":"2022081612372204000_ref34","doi-asserted-by":"crossref","first-page":"1261","DOI":"10.1016\/j.neucom.2017.06.084","article-title":"Incremental on-line learning: a review and comparison of state of the art algorithms","volume":"275","author":"Losing","year":"2018","journal-title":"Neurocomputing"},{"key":"2022081612372204000_ref35","first-page":"1135","article-title":"Why Should I Trust You? Explaining the Predictions of Any Classifier","volume-title":"Proc. 22nd ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining","author":"Ribeiro","year":"2016"},{"key":"2022081612372204000_ref36","first-page":"4765","volume-title":"Advances in Neural Information Processing Systems","author":"Lundberg","year":"2017"}],"container-title":["The Computer Journal"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/comjnl\/article-pdf\/65\/8\/2044\/45329784\/bxab045.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/comjnl\/article-pdf\/65\/8\/2044\/45329784\/bxab045.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,8,16]],"date-time":"2022-08-16T12:39:31Z","timestamp":1660653571000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/comjnl\/article\/65\/8\/2044\/6269132"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,8]]},"references-count":36,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2021,5,8]]},"published-print":{"date-parts":[[2022,8,11]]}},"URL":"https:\/\/doi.org\/10.1093\/comjnl\/bxab045","relation":{},"ISSN":["0010-4620","1460-2067"],"issn-type":[{"value":"0010-4620","type":"print"},{"value":"1460-2067","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2022,8]]},"published":{"date-parts":[[2021,5,8]]}}}