{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,21]],"date-time":"2025-10-21T03:28:19Z","timestamp":1761017299570,"version":"3.37.3"},"reference-count":30,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,3,16]],"date-time":"2020-03-16T00:00:00Z","timestamp":1584316800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,3,16]],"date-time":"2020-03-16T00:00:00Z","timestamp":1584316800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"JST ACT-i"},{"DOI":"10.13039\/501100001691","name":"Japan Society for the Promotion of Science","doi-asserted-by":"publisher","award":["16H05904"],"award-info":[{"award-number":["16H05904"]}],"id":[{"id":"10.13039\/501100001691","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Negative screening is one method to avoid interactions with inappropriate entities. For example, financial institutions keep investment exclusion lists of inappropriate firms that have environmental, social, and governance (ESG) problems. They create their investment exclusion lists by gathering information from various news sources to keep their portfolios profitable as well as green. International organizations also maintain smart sanctions lists that are used to prohibit trade with entities that are involved in illegal activities. In the present paper, we focus on the prediction of investment exclusion lists in the finance domain. We construct a vast heterogeneous information network that covers the necessary information surrounding each firm, which is assembled using seven professionally curated datasets and two open datasets, which results in approximately 50 million nodes and 400 million edges in total. Exploiting these vast datasets and motivated by how professional investigators and journalists undertake their daily investigations, we propose a model that can learn to predict firms that are more likely to be added to an investment exclusion list in the near future. Our approach is tested using the negative news investment exclusion list data of more than 35,000 firms worldwide from January 2012 to May 2018. Comparing with the state-of-the-art methods with and without using the network, we show that the predictive accuracy is substantially improved when using the vast information stored in the heterogeneous information network. This work suggests new ways to consolidate the diffuse information contained in big data to monitor dominant firms on a global scale for better risk management and more socially responsible investment.<\/jats:p>","DOI":"10.1186\/s40537-020-00295-9","type":"journal-article","created":{"date-parts":[[2020,3,16]],"date-time":"2020-03-16T11:06:08Z","timestamp":1584356768000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Prediction of ESG compliance using a heterogeneous information network"],"prefix":"10.1186","volume":"7","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4452-0302","authenticated-orcid":false,"given":"Ryohei","family":"Hisano","sequence":"first","affiliation":[]},{"given":"Didier","family":"Sornette","sequence":"additional","affiliation":[]},{"given":"Takayuki","family":"Mizuno","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,3,16]]},"reference":[{"key":"295_CR1","unstructured":"OECD. Responsible business conduct for institutional investors: key considerations for due diligence under the oecd guidelines for multinational enterprises. OECD guidlines; 2017."},{"key":"295_CR2","doi-asserted-by":"publisher","unstructured":"Sherwood W.M, Pollard J. Responsible investing: an introduction to environmental, social, and governance investments. Routledge; 2018. https:\/\/doi.org\/10.4324\/9780203712078.","DOI":"10.4324\/9780203712078"},{"key":"295_CR3","unstructured":"Markham JW. A financial history of modern U.S. corporate Scandals: from enron to reform. M.E. Sharpe; 2006. https:\/\/books.google.co.jp\/books?id=Z7qTGiF8FCgC."},{"key":"295_CR4","unstructured":"Hill C. A survey of heterogeneous information network analysis. U Pitt L Rev. 2010;585."},{"key":"295_CR5","volume-title":"Kitacyosen Kaku No Shikingen Kokuren Sousa No Hiroku [funding source of North Korea: a note on United Nation\u2019s Investigation] Funding Source","author":"K Furukawa","year":"2017","unstructured":"Furukawa K. Kitacyosen Kaku No Shikingen Kokuren Sousa No Hiroku [funding source of North Korea: a note on United Nation\u2019s Investigation] Funding Source. Tokyo: Tokyo Shincyosya; 2017."},{"key":"295_CR6","unstructured":"of Exchanges, W.F. Wfe annual statistics guide 2017; 2017."},{"key":"295_CR7","unstructured":"Hofmann A, Perchani S, Portisch J, Hertling S, Paulheim H. Dbkwik: towards knowledge graph creation from thousands of wikis. In: ISWC-P&D-Industry 2017 : proceedings of the ISWC 2017 posters & demonstrations and industry tracks co-located with 16th international semantic web conference (ISWC 2017) Vienna, Austria, October 23rd to 25th, 2017, vol. 1963. RWTH, Aachen; 2017, p. 540. http:\/\/ub-madoc.bib.uni-mannheim.de\/43119\/."},{"key":"295_CR8","doi-asserted-by":"publisher","unstructured":"Dong X, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K, Strohmann T, Sun S, Zhang W. Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD \u201914. ACM, New York, NY, USA; 2014, p. 601\u201310. https:\/\/doi.org\/10.1145\/2623330.2623623. http:\/\/doi.acm.org\/10.1145\/2623330.2623623.","DOI":"10.1145\/2623330.2623623"},{"key":"295_CR9","doi-asserted-by":"crossref","unstructured":"Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z. Dbpedia: A nucleus for a web of open data. In: Proceedings of the 6th International The Semantic Web and 2Nd Asian Conference on Asian Semantic Web Conference. ISWC\u201907\/ASWC\u201907. Springer, Berlin, Heidelberg; 2007, p. 722\u201335. http:\/\/dl.acm.org\/citation.cfm?id=1785162.1785216.","DOI":"10.1007\/978-3-540-76298-0_52"},{"issue":"1\u20132","key":"295_CR10","doi-asserted-by":"publisher","first-page":"39","DOI":"10.3233\/DS-170007","volume":"1","author":"X Wilcke","year":"2017","unstructured":"Wilcke X, Bloem P, De Boer V. The knowledge graph as the default data model for learning on heterogeneous knowledge. Data Sci. 2017;1(1\u20132):39\u201357. https:\/\/doi.org\/10.3233\/DS-170007.","journal-title":"Data Sci"},{"issue":"2","key":"295_CR11","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1145\/2481244.2481248","volume":"14","author":"Y Sun","year":"2013","unstructured":"Sun Y, Han J. Mining heterogeneous information networks: a structural analysis approach. SIGKDD Explor Newslett. 2013;14(2):20\u20138. https:\/\/doi.org\/10.1145\/2481244.2481248.","journal-title":"SIGKDD Explor Newslett"},{"key":"295_CR12","doi-asserted-by":"publisher","unstructured":"Wang H, Zhang F, Wang J, Zhao M, Li W, Xie X, Guo M. Ripplenet: propagating user preferences on the knowledge graph for recommender systems. In: Proceedings of the 27th ACM international conference on information and knowledge management. CIKM \u201918. ACM, New York, NY, USA; 2018, p. 417\u201326. https:\/\/doi.org\/10.1145\/3269206.3271739. http:\/\/doi.acm.org\/10.1145\/3269206.3271739.","DOI":"10.1145\/3269206.3271739"},{"key":"295_CR13","doi-asserted-by":"publisher","unstructured":"Chen Y, Liu R, Xu W. Movie recommendation in heterogeneous information networks. In: 2016 IEEE information technology, networking, electronic and automation control conference; 2016, p. 637\u201340. https:\/\/doi.org\/10.1109\/ITNEC.2016.7560438.","DOI":"10.1109\/ITNEC.2016.7560438"},{"key":"295_CR14","doi-asserted-by":"publisher","unstructured":"Cao B, Mao M, Viidu S, Yu P.S. Hitfraud: A broad learning approach for collective fraud detection in heterogeneous information networks. In: 2017 IEEE international conference on data mining (ICDM); 2018, p. 769\u201374. https:\/\/doi.org\/10.1109\/ICDM.2017.90. http:\/\/doi.ieeecomputersociety.org\/10.1109\/ICDM.2017.90.","DOI":"10.1109\/ICDM.2017.90"},{"issue":"1","key":"295_CR15","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1109\/JPROC.2015.2483592","volume":"104","author":"M Nickel","year":"2016","unstructured":"Nickel M, Murphy K, Tresp V, Gabrilovich E. A review of relational machine learning for knowledge graphs. Proc IEEE. 2016;104(1):11\u201333.","journal-title":"Proc IEEE"},{"issue":"12","key":"295_CR16","doi-asserted-by":"publisher","first-page":"2724","DOI":"10.1109\/TKDE.2017.2754499","volume":"29","author":"Q Wang","year":"2017","unstructured":"Wang Q, Mao Z, Wang B, Guo L. Knowledge graph embedding: a survey of approaches and applications. IEEE Trans Knowl Data Eng. 2017;29(12):2724\u201343. https:\/\/doi.org\/10.1109\/TKDE.2017.2754499.","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"295_CR17","doi-asserted-by":"publisher","unstructured":"Hu Y, Da Q, Zeng A, Yu Y, Xu Y. Reinforcement learning to rank in e-commerce search engine: formalization, analysis, and application. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. KDD \u201918. ACM, New York, NY, USA; 2018, p. 368\u201377. https:\/\/doi.org\/10.1145\/3219819.3219846. http:\/\/doi.acm.org\/10.1145\/3219819.3219846.","DOI":"10.1145\/3219819.3219846"},{"key":"295_CR18","doi-asserted-by":"crossref","unstructured":"Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z. Dbpedia: a nucleus for a web of open data. In: Proceedings of the 6th international the semantic web and 2nd Asian conference on Asian semantic web conference. ISWC\u201907\/ASW\u201907. Springer, Berlin, Heidelberg; 2007, p. 722\u201335.","DOI":"10.1007\/978-3-540-76298-0_52"},{"key":"295_CR19","doi-asserted-by":"publisher","unstructured":"Wan C, Li X, Kao B, Yu X, Gu Q, Cheung D, Han J. Classification with active learning and meta-paths in heterogeneous information networks. In: Proceedings of the 24th ACM international on conference on information and knowledge management. CIKM \u201915. ACM, New York, NY, USA; 2015, p. 443\u201352. https:\/\/doi.org\/10.1145\/2806416.2806507. http:\/\/doi.acm.org\/10.1145\/2806416.2806507.","DOI":"10.1145\/2806416.2806507"},{"key":"295_CR20","volume-title":"Semi-supervised learning","author":"O Chapelle","year":"2010","unstructured":"Chapelle O, Schlkopf B, Zien A. Semi-supervised learning. 1st ed. Cambridge: The MIT Press; 2010.","edition":"1"},{"key":"295_CR21","doi-asserted-by":"publisher","unstructured":"Backstrom L, Leskovec J. Supervised random walks: predicting and recommending links in social networks. In: Proceedings of the fourth ACM international conference on web search and data mining. WSDM \u201911. ACM, New York, NY, USA; 2011, p. 635\u201344. https:\/\/doi.org\/10.1145\/1935826.1935914. http:\/\/doi.acm.org\/10.1145\/1935826.1935914.","DOI":"10.1145\/1935826.1935914"},{"key":"295_CR22","unstructured":"Lao N, Mitchell T, Cohen WW. Random walk inference and learning in a large scale knowledge base. In: Proceedings of the conference on empirical methods in natural language processing. EMNLP \u201911. Association for Computational Linguistics, Stroudsburg, PA, USA; 2011, p. 529\u201339. http:\/\/dl.acm.org\/citation.cfm?id=2145432.2145494."},{"key":"295_CR23","doi-asserted-by":"publisher","unstructured":"Wang D, Cui P, Zhu W. Structural deep network embedding. In: Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD \u201916. ACM, New York, NY, USA; 2016, p. 1225\u201334. https:\/\/doi.org\/10.1145\/2939672.2939753. http:\/\/doi.acm.org\/10.1145\/2939672.2939753.","DOI":"10.1145\/2939672.2939753"},{"key":"295_CR24","doi-asserted-by":"publisher","unstructured":"Davis J, Goadrich M. The relationship between precision-recall and roc curves. In: Proceedings of the 23rd international conference on machine learning. ICML \u201906. ACM, New York, NY, USA; 2006, p. 233\u201340. https:\/\/doi.org\/10.1145\/1143844.1143874. http:\/\/doi.acm.org\/10.1145\/1143844.1143874.","DOI":"10.1145\/1143844.1143874"},{"key":"295_CR25","series-title":"Springer series in statistics","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-21606-5","volume-title":"The elements of statistical learning","author":"T Hastie","year":"2001","unstructured":"Hastie T, Tibshirani R, Friedman J. The elements of statistical learning., Springer series in statisticsNew York: Springer; 2001."},{"issue":"1","key":"295_CR26","first-page":"849","volume":"13","author":"M \u017ditnik","year":"2012","unstructured":"\u017ditnik M, Zupan B. Nimfa: a python library for nonnegative matrix factorization. J Mach Learn Res. 2012;13(1):849\u201353.","journal-title":"J Mach Learn Res"},{"issue":"1","key":"295_CR27","doi-asserted-by":"publisher","first-page":"421","DOI":"10.32614\/RJ-2017-016","volume":"9","author":"BM Greenwell","year":"2017","unstructured":"Greenwell BM. pdp: an r package for constructing partial dependence plots. R J. 2017;9(1):421\u201336.","journal-title":"R J"},{"issue":"2","key":"295_CR28","doi-asserted-by":"publisher","first-page":"599","DOI":"10.1111\/j.1540-6261.2012.01726.x","volume":"67","author":"DH Solomon","year":"2012","unstructured":"Solomon DH. Selective publicity and stock prices. J Financ. 2012;67(2):599\u2013638. https:\/\/doi.org\/10.1111\/j.1540-6261.2012.01726.x.","journal-title":"J Financ"},{"key":"295_CR29","doi-asserted-by":"publisher","DOI":"10.2139\/ssrn.2738806","author":"U Birchler","year":"2016","unstructured":"Birchler U, Hegglin R, Reichenecker MR, Wagner AF. Which swiss gnomes attract money? efficiency and reputation as performance drivers of wealth management banks. Swiss Financ Instit Res Paper. 2016;. https:\/\/doi.org\/10.2139\/ssrn.2738806.","journal-title":"Swiss Financ Instit Res Paper"},{"key":"295_CR30","unstructured":"Wang T. An in-depth analysis of the impact of esg investing on returns using large-scale news data. Master Thesis, ETH Zurich; 2019."}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00295-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s40537-020-00295-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00295-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,16]],"date-time":"2021-03-16T00:51:49Z","timestamp":1615855909000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-020-00295-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,16]]},"references-count":30,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["295"],"URL":"https:\/\/doi.org\/10.1186\/s40537-020-00295-9","relation":{},"ISSN":["2196-1115"],"issn-type":[{"type":"electronic","value":"2196-1115"}],"subject":[],"published":{"date-parts":[[2020,3,16]]},"assertion":[{"value":"16 November 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 February 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 March 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"22"}}