{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,25]],"date-time":"2025-10-25T14:21:39Z","timestamp":1761402099010,"version":"3.37.3"},"reference-count":25,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,10,13]],"date-time":"2020-10-13T00:00:00Z","timestamp":1602547200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,10,13]],"date-time":"2020-10-13T00:00:00Z","timestamp":1602547200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100006378","name":"Universitas Indonesia","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100006378","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Real-time information mining of a big dataset consisting of time series data is a very challenging task. For this purpose, we propose using the mean distance and the standard deviation to enhance the accuracy of the existing fast incremental model tree with the drift detection (FIMT-DD) algorithm. The standard FIMT-DD algorithm uses the Hoeffding bound as its splitting criterion. We propose the further use of the mean distance and standard deviation, which are used to split a tree more accurately than the standard method. We verify our proposed method using the large Traffic Demand Dataset, which consists of 4,000,000 instances; Tennet\u2019s big wind power plant dataset, which consists of 435,268 instances; and a road weather dataset, which consists of 30,000,000 instances. The results show that our proposed FIMT-DD algorithm improves the accuracy compared to the standard method and Chernoff bound approach. The measured errors demonstrate that our approach results in a lower Mean Absolute Percentage Error (MAPE) in every stage of learning by approximately 2.49% compared with the Chernoff Bound method and 19.65% compared with the standard method.<\/jats:p>","DOI":"10.1186\/s40537-020-00359-w","type":"journal-article","created":{"date-parts":[[2020,10,13]],"date-time":"2020-10-13T14:02:55Z","timestamp":1602597775000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Distance variable improvement of time-series big data stream evaluation"],"prefix":"10.1186","volume":"7","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2652-3227","authenticated-orcid":false,"given":"Ari","family":"Wibisono","sequence":"first","affiliation":[]},{"given":"Petrus","family":"Mursanto","sequence":"additional","affiliation":[]},{"given":"Jihan","family":"Adibah","sequence":"additional","affiliation":[]},{"given":"Wendy D. W. T.","family":"Bayu","sequence":"additional","affiliation":[]},{"given":"May Iffah","family":"Rizki","sequence":"additional","affiliation":[]},{"given":"Lintang Matahari","family":"Hasani","sequence":"additional","affiliation":[]},{"given":"Valian Fil","family":"Ahli","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,10,13]]},"reference":[{"key":"359_CR1","unstructured":"Adak, M. Fatih, and Mustafa Akpinar. 2018. A hybrid artificial bee colony algorithm using multiple linear regression on time-series datasets."},{"issue":"5","key":"359_CR2","doi-asserted-by":"publisher","first-page":"793","DOI":"10.3233\/ida-140669","volume":"18","author":"S Aghabozorgi","year":"2014","unstructured":"Aghabozorgi S, Wah TY. Clustering of Large Time Series Datasets. Intelligent Data Analysis. 2014;18(5):793\u2013817. https:\/\/doi.org\/10.3233\/ida-140669.","journal-title":"Intelligent Data Analysis"},{"issue":"1","key":"359_CR3","doi-asserted-by":"publisher","first-page":"128","DOI":"10.1007\/s10618-010-0201-y","volume":"23","author":"E Ikonomovska","year":"2011","unstructured":"Ikonomovska E, Gama J, D\u017eeroski S. Learning model trees from evolving data streams. Data Min Knowl Disc. 2011;23(1):128\u201368.","journal-title":"Data Min Knowl Disc"},{"issue":"301","key":"359_CR4","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1080\/01621459.1963.10500830","volume":"58","author":"W Hoeffding","year":"1963","unstructured":"Hoeffding W. Probability Inequalities for Sums of Bounded Random Variables. Journal of the American Statistical Association. 1963;58(301):13\u201330.","journal-title":"Journal of the American Statistical Association"},{"key":"359_CR5","doi-asserted-by":"crossref","unstructured":"Wibisono, A., Wisesa, H.A., Jatmiko, W., Mursanto, P., Sarwinda, D. 2016. Perceptron rule improvement on FIMT-DD for large traffic data stream. In: Proceedings of the International Joint Conference on Neural Networks. 2016; 5161\u20137.","DOI":"10.1109\/IJCNN.2016.7727881"},{"key":"359_CR6","unstructured":"Zhang, C., Bennett-type generalization bounds: Large-deviation case and faster rate of convergence. 2013. In: Uncertainty in Artificial Intelligence - Proceedings of the 29th Conference UAI 2013. 2013; 714\u201322."},{"key":"359_CR7","unstructured":"Beygelzimer, A., Langford, J., Lifshits, Y., Sorkin, G., Strehl, A. 2009. Conditional probability tree estimation analysis and algorithms. In: Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence UAI. 2009; 51\u20138."},{"key":"359_CR8","unstructured":"Balsubramani, A., Ramdas, A. 2016. Sequential nonparametric testing with the law of the iterated logarithm. 32nd Conference on Uncertainty in Artificial Intelligence 2016 UAI 2016. 42-51."},{"issue":"12","key":"359_CR9","doi-asserted-by":"publisher","first-page":"9508","DOI":"10.1109\/TVT.2016.2585575","volume":"65","author":"A Koesdwiady","year":"2016","unstructured":"Koesdwiady A, Soua R, Karray F. Improving traffic flow prediction with weather information in connected cars: a deep learning approach. IEEE Trans Veh Technol. 2016;65(12):9508\u201317.","journal-title":"IEEE Trans Veh Technol"},{"key":"359_CR10","doi-asserted-by":"crossref","unstructured":"Soua, R., Koesdwiady, A., Karray, F. 2016. Big-data-generated traffic flow prediction using deep learning and dempster-shafer theory. In: Proceedings of the International Joint Conference on Neural Networks. 2016; 3195\u2013202.","DOI":"10.1109\/IJCNN.2016.7727607"},{"key":"359_CR11","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1016\/j.knosys.2015.10.028","volume":"93","author":"A Wibisono","year":"2016","unstructured":"Wibisono A, Jatmiko W, Wisesa HA, Hardjono B, Mursanto P. Traffic big data prediction and visualization using Fast Incremental Model Trees-Drift Detection (FIMT-DD). Knowl-Based Syst. 2016;93:33\u201346.","journal-title":"Knowl-Based Syst"},{"key":"359_CR12","unstructured":"Wibisono, A., Sina, I., Ihsannuddin, M.A., Hafizh, A., Hardjono, B., Nurhadiyatna, A., Jatmiko, W., Mursanto,.P. 2012. Traffic intelligent system architecture based on social media information, International Conference on Advanced Computer Science and Information Systems, ICACSIS. 2012; 25\u201330."},{"key":"359_CR13","unstructured":"Y. Lv, Y. Duan, W. Kang, Z. Li and F. Y. Wang. 2015. Traffic Flow Prediction with Big Data: A Deep Learning Approach. In: IEEE Transactions on Intelligent Transportation Systems. vol. 16, p. 865\u201373."},{"key":"359_CR14","doi-asserted-by":"publisher","first-page":"2920","DOI":"10.1109\/ACCESS.2016.2570021","volume":"2016","author":"D Xia","year":"2016","unstructured":"Xia D, Li H, Wang B, Li Y, Zhang Z. A map reduce-based nearest neighbor approach for big-data-driven traffic flow prediction. IEEE Access. 2016;2016:2920\u201334.","journal-title":"IEEE Access"},{"key":"359_CR15","doi-asserted-by":"crossref","unstructured":"Hou Z, Li X. Repeatability and Similarity of Freeway Traffic Flow and Long-Term Prediction Under Big Data. In: IEEE Transactions on Intelligent Transportation Systems. 2016; 1786\u201396.","DOI":"10.1109\/TITS.2015.2511156"},{"key":"359_CR16","doi-asserted-by":"crossref","unstructured":"Bifet A, et al. Efficient online evaluation of big data stream classifiers. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining. 2015.","DOI":"10.1145\/2783258.2783372"},{"key":"359_CR17","doi-asserted-by":"crossref","unstructured":"Vu AT, et al. Distributed adaptive model rules for mining big data streams. In: 2014 IEEE International Conference on Big Data (Big Data). IEEE, 2014.","DOI":"10.1109\/BigData.2014.7004251"},{"key":"359_CR18","unstructured":"Ta V-D, Chuan-Ming L, Goodwill WN. Big data stream computing in healthcare real-time analytics. In: 2016 IEEE International Conference on Cloud Computing and Big Data Analysis (ICCCBDA). IEEE, 2016."},{"key":"359_CR19","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-019-0220-5","volume":"6","author":"A Wibisono","year":"2019","unstructured":"Wibisono A, Sarwinda D, Mursanto P. Tree stream mining algorithm with Chernoff-bound and standard deviation approach for big data stream. Journal of Big Data. 2019;6:1.","journal-title":"Journal of Big Data"},{"key":"359_CR20","unstructured":"Phillips, J.M. 2012. Chernoff-hoeffding inequality and applications. arXiv preprint arXiv:1209.6396. 2012 Sep 27."},{"key":"359_CR21","unstructured":"Y. Lv, Y. Duan, W. Kang, Z. Li and F. Y. Wang. 2015. Traffic Flow Prediction with Big Data: A Deep Learning Approach. In: IEEE Transactions on Intelligent Transportation Systems. 16, 2 (April 2015), 865\u201373."},{"key":"359_CR22","unstructured":"Grab, Traffic Management| Grab AI, https:\/\/www.aiforsea.com\/traffic-management."},{"key":"359_CR23","unstructured":"Open Power System Data (OPSD). (2018). Data Platform: Renewable Power Plants. https:\/\/data.open-power-system-data.org\/renewable_power_plants\/."},{"key":"359_CR24","unstructured":"Smart Green Infrastructure Monitoring Sensors - Historical, https:\/\/data.cityofchicago.org\/Environment-Sustainable-Development\/Smart-Green-Infrastructure-Monitoring-Sensors-Hist\/ggws-77ih, US-Department of Transporta-tion-Seattle, Accessed 5 Apr 2019."},{"key":"359_CR25","first-page":"1601","volume":"11","author":"A Bifet","year":"2010","unstructured":"Bifet A, Holmes G, Kirkby R, Pfahringer B. MOA: massive Online Analysis. J Mach Learn Res. 2010;11:1601\u20134.","journal-title":"J Mach Learn Res"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00359-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-020-00359-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00359-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,10,13]],"date-time":"2021-10-13T02:27:49Z","timestamp":1634092069000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-020-00359-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,13]]},"references-count":25,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["359"],"URL":"https:\/\/doi.org\/10.1186\/s40537-020-00359-w","relation":{},"ISSN":["2196-1115"],"issn-type":[{"type":"electronic","value":"2196-1115"}],"subject":[],"published":{"date-parts":[[2020,10,13]]},"assertion":[{"value":"28 April 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 September 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 October 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"85"}}