{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,23]],"date-time":"2026-01-23T07:48:59Z","timestamp":1769154539807,"version":"3.49.0"},"reference-count":51,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2021,11,15]],"date-time":"2021-11-15T00:00:00Z","timestamp":1636934400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100016999","name":"Western Norway University of Applied Sciences","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100016999","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100005632","name":"National Centre for Research and Development","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100005632","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Automated Guided Vehicles integrated with Collaborative Robots for Smart Industry Perspective","award":["NOR\/POLNOR\/CoBotAGV\/0027\/2019 -00"],"award-info":[{"award-number":["NOR\/POLNOR\/CoBotAGV\/0027\/2019 -00"]}]},{"name":"NSF","award":["III-1763325, III-1909323, III-2106758, and SaTC-1930941"],"award-info":[{"award-number":["III-1763325, III-1909323, III-2106758, and SaTC-1930941"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2022,6,30]]},"abstract":"<jats:p>High-utility sequential pattern mining (HUSPM) is a hot research topic in recent decades since it combines both sequential and utility properties to reveal more information and knowledge rather than the traditional frequent itemset mining or sequential pattern mining. Several works of HUSPM have been presented but most of them are based on main memory to speed up mining performance. However, this assumption is not realistic and not suitable in large-scale environments since in real industry, the size of the collected data is very huge and it is impossible to fit the data into the main memory of a single machine. In this article, we first develop a parallel and distributed three-stage MapReduce model for mining high-utility sequential patterns based on large-scale databases. Two properties are then developed to hold the correctness and completeness of the discovered patterns in the developed framework. In addition, two data structures called sidset and utility-linked list are utilized in the developed framework to accelerate the computation for mining the required patterns. From the results, we can observe that the designed model has good performance in large-scale datasets in terms of runtime, memory, efficiency of the number of distributed nodes, and scalability compared to the serial HUSP-Span approach.<\/jats:p>","DOI":"10.1145\/3487046","type":"journal-article","created":{"date-parts":[[2021,11,15]],"date-time":"2021-11-15T17:31:28Z","timestamp":1636997488000},"page":"1-26","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":24,"title":["Scalable Mining of High-Utility Sequential Patterns With Three-Tier MapReduce Model"],"prefix":"10.1145","volume":"16","author":[{"given":"Jerry Chun-Wei","family":"Lin","sequence":"first","affiliation":[{"name":"Western Norway University of Applied Sciences, Bergen, Norway"}]},{"given":"Youcef","family":"Djenouri","sequence":"additional","affiliation":[{"name":"SINTEF Digital, Oslo, Norway"}]},{"given":"Gautam","family":"Srivastava","sequence":"additional","affiliation":[{"name":"Brandon University, Canada and China Medical University, Taichung, Taiwan"}]},{"given":"Yuanfa","family":"Li","sequence":"additional","affiliation":[{"name":"Harbin Institute of Technology (Shenzhen), Shenzhen, China"}]},{"given":"Philip S.","family":"Yu","sequence":"additional","affiliation":[{"name":"University of Illinois at Chicago, Illinois, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,11,15]]},"reference":[{"key":"e_1_3_1_2_2","doi-asserted-by":"publisher","DOI":"10.1109\/69.250074"},{"key":"e_1_3_1_3_2","doi-asserted-by":"publisher","DOI":"10.5555\/645920.672836"},{"key":"e_1_3_1_4_2","doi-asserted-by":"publisher","DOI":"10.5555\/645480.655281"},{"key":"e_1_3_1_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/SNPD.2010.21"},{"key":"e_1_3_1_6_2","doi-asserted-by":"crossref","unstructured":"C. F. Ahmed S. K. Tanbeer and B. S. Jeong. 2010. A novel approach for mining high-utility sequential patterns in sequence databases . Electronics and Telecommunications Research Institute 676\u2013686.","DOI":"10.4218\/etrij.10.1510.0066"},{"key":"e_1_3_1_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2015.2420557"},{"key":"e_1_3_1_8_2","doi-asserted-by":"crossref","unstructured":"U. Ahmed J. C. W. Lin G. Srivastava R. Yasin and Y. Djenouri. 2020. An evolutionary model to mine high expected utility patterns from uncertain databases. IEEE Transactions on Emerging Topics in Computational Intelligence 5 1 (2020) 19\u201328.","DOI":"10.1109\/TETCI.2020.3000224"},{"key":"e_1_3_1_9_2","doi-asserted-by":"publisher","DOI":"10.1109\/69.553155"},{"key":"e_1_3_1_10_2","doi-asserted-by":"publisher","DOI":"10.5555\/951949.952150"},{"key":"e_1_3_1_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.bdr.2016.07.001"},{"key":"e_1_3_1_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/1327452.1327492"},{"key":"e_1_3_1_13_2","doi-asserted-by":"crossref","unstructured":"K. C. Duong M. Bamha A. Giacometti D. Li A. Soulet and C. Vrain. 2018. Mapfim+: Memory aware parallelized frequent itemset mining in very large datasets. In Proceedings of the Transactions on Large-Scale Data-and Knowledge-Centered Systems XXXIX Vol. 39. Springer Berlin 200\u2013225. https:\/\/doi.org\/10.1007\/978-3-662-58415-6_7","DOI":"10.1007\/978-3-662-58415-6_7"},{"key":"e_1_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-18032-8_19"},{"key":"e_1_3_1_15_2","doi-asserted-by":"publisher","DOI":"10.1145\/3314107"},{"key":"e_1_3_1_16_2","doi-asserted-by":"publisher","DOI":"10.1145\/3314107"},{"key":"e_1_3_1_17_2","doi-asserted-by":"publisher","DOI":"10.1145\/3311350.3347167"},{"key":"e_1_3_1_18_2","doi-asserted-by":"publisher","DOI":"10.1023\/B:DAMI.0000005258.31418.83"},{"key":"e_1_3_1_19_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2020.106653"},{"key":"e_1_3_1_20_2","doi-asserted-by":"publisher","DOI":"10.1145\/1454008.1454027"},{"key":"e_1_3_1_21_2","doi-asserted-by":"publisher","DOI":"10.1007\/11430919_79"},{"key":"e_1_3_1_22_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2010.12.082"},{"key":"e_1_3_1_23_2","doi-asserted-by":"publisher","DOI":"10.1145\/2396761.2396773"},{"key":"e_1_3_1_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2012.20"},{"key":"e_1_3_1_25_2","doi-asserted-by":"publisher","DOI":"10.1145\/2184751.2184842"},{"key":"e_1_3_1_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/iFuzzy.2013.6825476"},{"key":"e_1_3_1_27_2","doi-asserted-by":"crossref","first-page":"649","DOI":"10.1007\/978-3-319-18032-8_51","volume-title":"Advances in Knowledge Discovery and Data Mining","author":"Lin Y. C.","year":"2015","unstructured":"Y. C. Lin, C. W. Wu, and V. S. Tseng. 2015. Mining high utility itemsets in big data. Advances in Knowledge Discovery and Data Mining. Springer International Publishing, 649\u2013661."},{"key":"e_1_3_1_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2015.2510012"},{"key":"e_1_3_1_29_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2015.12.019"},{"key":"e_1_3_1_30_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2018.10.010"},{"key":"e_1_3_1_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/BigData.2013.6691742"},{"key":"e_1_3_1_32_2","doi-asserted-by":"publisher","DOI":"10.3390\/s20041078"},{"key":"e_1_3_1_33_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2020.03.030"},{"key":"e_1_3_1_34_2","doi-asserted-by":"publisher","DOI":"10.5555\/645484.656379"},{"key":"e_1_3_1_35_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46131-1_8"},{"key":"e_1_3_1_36_2","doi-asserted-by":"publisher","DOI":"10.5555\/645337.650382"},{"key":"e_1_3_1_37_2","doi-asserted-by":"publisher","DOI":"10.5555\/1997305.1997329"},{"key":"e_1_3_1_38_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-31087-4_63"},{"key":"e_1_3_1_39_2","doi-asserted-by":"crossref","unstructured":"G. Srivastava J. C. W. Lin M. Pirouz Y. Li and U. Yun. 2020. A pre-large weighted-fusion system of sensed high-utility patterns. IEEE Sensors Journal 21 14 (2020) 15626\u201315634.","DOI":"10.1109\/JSEN.2020.2991045"},{"key":"e_1_3_1_40_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2019.112967"},{"key":"e_1_3_1_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2012.59"},{"key":"e_1_3_1_42_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.2992729"},{"key":"e_1_3_1_43_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-015-0914-8"},{"key":"e_1_3_1_44_2","doi-asserted-by":"publisher","DOI":"10.1145\/3363571"},{"key":"e_1_3_1_45_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2020.12.004"},{"key":"e_1_3_1_46_2","doi-asserted-by":"publisher","DOI":"10.1137\/1.9781611972740.51"},{"key":"e_1_3_1_47_2","doi-asserted-by":"publisher","DOI":"10.1145\/2339530.2339636"},{"key":"e_1_3_1_48_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2019.09.024"},{"key":"e_1_3_1_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2013.148"},{"key":"e_1_3_1_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2007.115"},{"key":"e_1_3_1_51_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-016-0986-0"},{"key":"e_1_3_1_52_2","doi-asserted-by":"publisher","DOI":"10.1186\/s12918-017-0475-4"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3487046","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3487046","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3487046","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:18:47Z","timestamp":1750191527000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3487046"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,11,15]]},"references-count":51,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2022,6,30]]}},"alternative-id":["10.1145\/3487046"],"URL":"https:\/\/doi.org\/10.1145\/3487046","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,11,15]]},"assertion":[{"value":"2021-01-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-09-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-11-15","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}