{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T00:28:55Z","timestamp":1777854535583,"version":"3.51.4"},"reference-count":60,"publisher":"SAGE Publications","issue":"2","license":[{"start":{"date-parts":[[2016,7,10]],"date-time":"2016-07-10T00:00:00Z","timestamp":1468108800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Information Science"],"published-print":{"date-parts":[[2017,4]]},"abstract":"<jats:p>OLAP (On-line Analytical Processing) can provide users with aggregate results from different perspectives and granularities. With the advent of heterogeneous information networks that consist of multi-type, interconnected nodes, such as bibliographic networks and knowledge graphs, it is important to study flexible aggregation in such networks. The aggregation results by existing work are limited to one type of node, which cannot be applied to aggregation on multi-type nodes, and relations in large-scale heterogeneous information networks. In this paper, we investigate the flexible aggregation problem on large-scale heterogeneous information networks, which is defined on multi-type nodes and relations. Moreover, by considering both attributes and structures, we propose a novel function based on graph entropy to measure the similarities of nodes. Further, we prove that the aggregation problem based on the function is NP-hard. Therefore, we develop an efficient heuristic algorithm for aggregation in two phases: informational aggregation and structural aggregation. The algorithm has linear time and space complexity. Extensive experiments on real-world datasets demonstrate the effectiveness and efficiency of the proposed algorithm.<\/jats:p>","DOI":"10.1177\/0165551516630237","type":"journal-article","created":{"date-parts":[[2016,2,1]],"date-time":"2016-02-01T21:44:16Z","timestamp":1454363056000},"page":"186-203","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":6,"title":["A flexible aggregation framework on large-scale heterogeneous information networks"],"prefix":"10.1177","volume":"43","author":[{"given":"Dan","family":"Yin","sequence":"first","affiliation":[{"name":"School of Computer Science and Technology, Harbin Institute of Technology, China"}]},{"given":"Hong","family":"Gao","sequence":"additional","affiliation":[{"name":"School of Computer Science and Technology, Harbin Institute of Technology, China"}]}],"member":"179","published-online":{"date-parts":[[2016,7,10]]},"reference":[{"key":"bibr1-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2008.30"},{"key":"bibr2-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989413"},{"key":"bibr3-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/1376616.1376675"},{"key":"bibr4-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-07782-6_66"},{"key":"bibr5-0165551516630237","unstructured":"Yin D, Gao H, Zou ZN, Li JZ. Minimized-cost cube query on heterogeneous information networks. Journal of Combinatorial Optimization 2015, in press, http:\/\/link.springer.com\/article\/10.1007%2Fs10878\u2013015\u20139967\u20136."},{"key":"bibr6-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2723737"},{"key":"bibr7-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/956750.956769"},{"key":"bibr8-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1177\/0165551515602808"},{"key":"bibr9-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1007\/s11135-013-9837-1"},{"key":"bibr10-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1007\/s11135-012-9799-8"},{"key":"bibr11-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1177\/0165551515603324"},{"key":"bibr12-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2011.09.096"},{"key":"bibr13-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1177\/0165551515588669"},{"key":"bibr14-0165551516630237","unstructured":"Mao J, Lu K, Li G, Yi M. Profiling users with tag networks in diffusion-based personalized recommendation. Journal of Information Science 2015, in press, http:\/\/jis.sagepub.com\/content\/early\/2015\/10\/12\/0165551515603321.full."},{"key":"bibr15-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1177\/0165551515602846"},{"key":"bibr16-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1177\/0165551515598926"},{"key":"bibr17-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1177\/0165551515602847"},{"key":"bibr18-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1177\/0165551515603323"},{"key":"bibr19-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4614-0320-3_6"},{"issue":"1","key":"bibr20-0165551516630237","first-page":"1","volume":"1","author":"Hu H","year":"2005","journal-title":"Bioinformatics"},{"key":"bibr21-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2015.7113344"},{"key":"bibr22-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/2339530.2339726"},{"key":"bibr23-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1109\/TSP.2012.2212886"},{"key":"bibr24-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2015.7113346"},{"key":"bibr25-0165551516630237","unstructured":"Cui J, Wang F, Zhai J. Citation networks as a multi-layer graph: Link prediction and importance ranking, http:\/\/snap.stanford.edu\/class\/cs224w-2010\/proj2010\/."},{"key":"bibr26-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1080\/01969722.2015.1007737"},{"key":"bibr27-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2010.08.042"},{"key":"bibr28-0165551516630237","first-page":"506","volume-title":"Proceedings of very large database conference","author":"Agarwal S","year":"1996"},{"key":"bibr29-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009726021843"},{"key":"bibr30-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-009-0228-9"},{"key":"bibr31-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4419-6515-8_16"},{"key":"bibr32-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2014.6816676"},{"key":"bibr33-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2010.5447830"},{"key":"bibr34-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/2339530.2339738"},{"key":"bibr35-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/2783258.2783328"},{"key":"bibr36-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/1516360.1516426"},{"issue":"5","key":"bibr37-0165551516630237","first-page":"394","volume":"5","author":"Sun Y","year":"2012","journal-title":"Endowment of Very Large DataBases"},{"key":"bibr38-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/2513092.2500492"},{"key":"bibr39-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/1557019.1557107"},{"issue":"11","key":"bibr40-0165551516630237","first-page":"992","volume":"4","author":"Sun Y","year":"2011","journal-title":"Endowment of Very Large Databases"},{"key":"bibr41-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/2247596.2247618"},{"key":"bibr42-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1109\/ASONAM.2011.112"},{"key":"bibr43-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/2124295.2124373"},{"key":"bibr44-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/2339530.2339765"},{"key":"bibr45-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2014.6816703"},{"key":"bibr46-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2009.43"},{"key":"bibr47-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/2020408.2020603"},{"key":"bibr48-0165551516630237","first-page":"1199","volume-title":"Proceedings of ACM international conference on management of data","author":"Shen W","year":"2014"},{"key":"bibr49-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/2621934.2621937"},{"key":"bibr50-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2015.2426696"},{"issue":"8","key":"bibr51-0165551516630237","first-page":"565","volume":"7","author":"Yang S","year":"2014","journal-title":"Endowment of Very Large Databases"},{"key":"bibr52-0165551516630237","unstructured":"Cover T, Thomas J. Elements of information theory, 2nd edn.Hoboken, NJ: Wiley, 2006, pp. 12\u201314."},{"key":"bibr53-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1002\/j.1538-7305.1948.tb01338.x"},{"key":"bibr54-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1016\/j.amc.2007.12.010"},{"key":"bibr55-0165551516630237","first-page":"411","author":"Korner J","year":"1973","journal-title":"6th Prague conference on information theory"},{"key":"bibr56-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2010.08.041"},{"key":"bibr57-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/1134271.1134282"},{"key":"bibr58-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1007\/BF02477860"},{"key":"bibr59-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1016\/j.tcs.2005.09.015"},{"key":"bibr60-0165551516630237","doi-asserted-by":"publisher","DOI":"10.1145\/1232722.1232727"}],"container-title":["Journal of Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551516630237","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/0165551516630237","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551516630237","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T23:09:24Z","timestamp":1777504164000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/0165551516630237"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,7,10]]},"references-count":60,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2017,4]]}},"alternative-id":["10.1177\/0165551516630237"],"URL":"https:\/\/doi.org\/10.1177\/0165551516630237","relation":{},"ISSN":["0165-5515","1741-6485"],"issn-type":[{"value":"0165-5515","type":"print"},{"value":"1741-6485","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,7,10]]}}}