{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T17:59:28Z","timestamp":1775325568347,"version":"3.50.1"},"reference-count":45,"publisher":"MIT Press","issue":"1","content-domain":{"domain":["www.mitpressjournals.org"],"crossmark-restriction":true},"short-container-title":["Quantitative Science Studies"],"published-print":{"date-parts":[[2020,2]]},"abstract":"<jats:p>An ongoing project explores the extent to which artificial intelligence (AI), specifically in the areas of natural language processing and semantic reasoning, can be exploited to facilitate the studies of science by deploying software agents equipped with natural language understanding capabilities to read scholarly publications on the web. The knowledge extracted by these AI agents is organized into a heterogeneous graph, called Microsoft Academic Graph (MAG), where the nodes and the edges represent the entities engaging in scholarly communications and the relationships among them, respectively. The frequently updated data set and a few software tools central to the underlying AI components are distributed under an open data license for research and commercial applications. This paper describes the design, schema, and technical and business motivations behind MAG and elaborates how MAG can be used in analytics, search, and recommendation scenarios. How AI plays an important role in avoiding various biases and human induced errors in other data sets and how the technologies can be further improved in the future are also discussed.<\/jats:p>","DOI":"10.1162\/qss_a_00021","type":"journal-article","created":{"date-parts":[[2020,1,23]],"date-time":"2020-01-23T14:38:13Z","timestamp":1579790293000},"page":"396-413","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":408,"title":["Microsoft Academic Graph: When experts are not enough"],"prefix":"10.1162","volume":"1","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7089-7966","authenticated-orcid":true,"given":"Kuansan","family":"Wang","sequence":"first","affiliation":[{"name":"Microsoft Research, Redmond, WA, 98052, USA"}]},{"given":"Zhihong","family":"Shen","sequence":"additional","affiliation":[{"name":"Microsoft Research, Redmond, WA, 98052, USA"}]},{"given":"Chiyuan","family":"Huang","sequence":"additional","affiliation":[{"name":"Microsoft Research, Redmond, WA, 98052, USA"}]},{"given":"Chieh-Han","family":"Wu","sequence":"additional","affiliation":[{"name":"Microsoft Research, Redmond, WA, 98052, USA"}]},{"given":"Yuxiao","family":"Dong","sequence":"additional","affiliation":[{"name":"Microsoft Research, Redmond, WA, 98052, USA"}]},{"given":"Anshul","family":"Kanakia","sequence":"additional","affiliation":[{"name":"Microsoft Research, Redmond, WA, 98052, USA"}]}],"member":"281","reference":[{"issue":"7","key":"bib1","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1145\/3332803","volume":"62","author":"Berger E.","year":"2019","journal-title":"Communications of the ACM"},{"key":"bib2","author":"Chawla D.","year":"2019","journal-title":"Nature"},{"key":"bib3","first-page":"1","volume-title":"2013 IEEE International Conference on Big Data","author":"Cock M.","year":"2013"},{"issue":"6","key":"bib4","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1145\/1953122.1953146","volume":"54","author":"Franceschet M.","year":"2011","journal-title":"Communications of the ACM"},{"issue":"3","key":"bib5","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1080\/09296179508590051","volume":"2","author":"Gale W.","year":"1995","journal-title":"Journal of Quantitative Linguistics"},{"issue":"3159","key":"bib6","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1126\/science.122.3159.108","volume":"122","author":"Garfield E.","year":"1955","journal-title":"Science"},{"issue":"3619","key":"bib7","doi-asserted-by":"crossref","first-page":"649","DOI":"10.1126\/science.144.3619.649","volume":"144","author":"Garfield E.","year":"1964","journal-title":"Science"},{"issue":"4060","key":"bib8","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1126\/science.178.4060.471","volume":"178","author":"Garfield E.","year":"1972","journal-title":"Science"},{"key":"bib9","first-page":"39","author":"Gy\u00f6ngyi Z.","year":"2005","journal-title":"AIRWeb"},{"issue":"2","key":"bib10","doi-asserted-by":"crossref","first-page":"146","DOI":"10.1080\/00437956.1954.11659520","volume":"10","author":"Harris Z.","year":"1954","journal-title":"WORD"},{"issue":"1","key":"bib11","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1007\/s11192-019-03114-y","volume":"120","author":"Harzing A.-W.","year":"2019","journal-title":"Scientometrics"},{"issue":"1","key":"bib12","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1007\/s11192-016-2185-x","volume":"110","author":"Harzing A.-W.","year":"2017","journal-title":"Scientometrics"},{"key":"bib13","first-page":"1","volume-title":"The Handbook of Evolutionary Psychology","author":"Haselton M.","year":"2015"},{"issue":"7","key":"bib14","first-page":"6","volume":"22","author":"Herrmannova D.","year":"2016","journal-title":"D-Lib Magazine"},{"key":"bib15","doi-asserted-by":"crossref","DOI":"10.3389\/frma.2018.00023","volume":"3","author":"Hook D.","year":"2018","journal-title":"Frontiers in Research Metrics and Analytics"},{"issue":"3","key":"bib16","doi-asserted-by":"crossref","first-page":"1551","DOI":"10.1007\/s11192-017-2535-3","volume":"113","author":"Hug S.","year":"2017","journal-title":"Scientometrics"},{"issue":"1","key":"bib17","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1007\/s11192-017-2247-8","volume":"111","author":"Hug S.","year":"2017","journal-title":"Scientometrics"},{"key":"bib18","doi-asserted-by":"crossref","first-page":"781","DOI":"10.1145\/3018661.3018699","volume-title":"Proceedings of the Tenth ACM International Conference on Web Search and Data Mining","author":"Joachims T.","year":"2017"},{"key":"bib19","first-page":"2893","volume-title":"Proceedings of WWW-2019","author":"Kanakia A.","year":"2019"},{"key":"bib20","first-page":"429","volume-title":"IJCAI\u201907 Proceedings of the 20th International Joint Conference on Artifical Intelligence","author":"Kanani P.","year":"2007"},{"issue":"1","key":"bib21","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1016\/j.joi.2018.01.009","volume":"12","author":"Kousha K.","year":"2018","journal-title":"Journal of Informetrics"},{"issue":"1","key":"bib22","first-page":"2921","volume":"16","author":"Li C.-L.","year":"2015","journal-title":"Journal of Machine Learning Research"},{"key":"bib23","first-page":"8","volume-title":"Proceedings of ACM SIGKDD Annual Conference on Knowledge Discovery and Data Mining","author":"Liu J.","year":"2013"},{"issue":"3","key":"bib24","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1002\/asi.23056","volume":"65","author":"L\u00f3pez-C\u00f3zar E.","year":"2014","journal-title":"Journal of the Association for Information Science and Technology"},{"key":"bib25","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511809071","volume-title":"Introduction to information retrieval","author":"Manning C.","year":"2008"},{"issue":"44","key":"bib26","doi-asserted-by":"crossref","first-page":"11103","DOI":"10.1523\/JNEUROSCI.0002-08.2008","volume":"28","author":"Maslov S.","year":"2008","journal-title":"The Journal of Neuroscience"},{"key":"bib28","first-page":"3111","author":"Mikolov T.","year":"2013","journal-title":"Proceedings of Advances in Neural Information Processing Systems"},{"issue":"12","key":"bib29","first-page":"1","volume":"3","author":"Rougier N.","year":"2017","journal-title":"PeerJ"},{"key":"bib30","first-page":"1","volume-title":"Proceedings of ACM SIGKDD Annual Conference on Knowledge Discovery and Data Mining","author":"Roy S.","year":"2013"},{"key":"bib31","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1145\/2740908.2742839","volume-title":"Proceedings of the 24th International Conference on World Wide Web","author":"Sinha A.","year":"2015"},{"key":"bib33","doi-asserted-by":"crossref","first-page":"373","DOI":"10.1145\/2872518.2890513","volume-title":"WWW \u201916 Companion Proceedings of the 25th International Conference Companion on World Wide Web","author":"Tang J.","year":"2016"},{"issue":"2","key":"bib34","doi-asserted-by":"crossref","first-page":"34","DOI":"10.3390\/publications7020034","volume":"7","author":"Tennant J.","year":"2019","journal-title":"Publications"},{"issue":"4","key":"bib35","doi-asserted-by":"crossref","first-page":"1201","DOI":"10.1016\/j.joi.2017.10.006","volume":"11","author":"Thelwall M.","year":"2017","journal-title":"Journal of Informetrics"},{"issue":"2","key":"bib36","doi-asserted-by":"crossref","first-page":"913","DOI":"10.1007\/s11192-018-2704-z","volume":"115","author":"Thelwall M.","year":"2018","journal-title":"Scientometrics"},{"issue":"1","key":"bib37","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1007\/s11192-017-2558-9","volume":"114","author":"Thelwall M.","year":"2018","journal-title":"Scientometrics"},{"issue":"1","key":"bib38","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.joi.2017.11.001","volume":"12","author":"Thelwall M.","year":"2018","journal-title":"Journal of Informetrics"},{"key":"bib39","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1007\/978-3-319-10377-8_4","volume-title":"Measuring Scholarly Impact","author":"Waltman L.","year":"2014"},{"key":"bib40","doi-asserted-by":"crossref","first-page":"45","DOI":"10.3389\/fdata.2019.00045","volume":"2","author":"Wang K.","year":"2019","journal-title":"Frontiers in Big Data"},{"issue":"1","key":"bib41","doi-asserted-by":"crossref","first-page":"819","DOI":"10.1002\/pra2.2017.14505401170","volume":"54","author":"Wang P.","year":"2017","journal-title":"Proceedings of the Association for Information Science and Technology"},{"key":"bib42","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1145\/2911451.2911537","volume-title":"Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval","author":"Wang X.","year":"2016"},{"key":"bib43","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1145\/2484028.2484053","volume-title":"Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval","author":"White R.","year":"2013"},{"issue":"6068","key":"bib45","doi-asserted-by":"crossref","first-page":"542","DOI":"10.1126\/science.1212540","volume":"335","author":"Wilhite A.","year":"2012","journal-title":"Science"},{"key":"bib46","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1007\/978-3-030-18590-9_12","volume-title":"International Conference on Database Systems for Advanced Applications","author":"Zhang S.","year":"2019"},{"key":"bib47","doi-asserted-by":"crossref","first-page":"1002","DOI":"10.1145\/3219819.3219859","volume-title":"Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","author":"Zhang Y.","year":"2018"},{"key":"bib48","first-page":"6","volume-title":"Proceedings of ACM SIGKDD Annual Conference on Knowledge Discovery and Data Mining","author":"Zhong E.","year":"2013"}],"container-title":["Quantitative Science Studies"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/qss_a_00021","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,25]],"date-time":"2023-09-25T15:10:40Z","timestamp":1695654640000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/qss\/article\/1\/1\/396-413\/15572"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,2]]},"references-count":45,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,2]]}},"alternative-id":["10.1162\/qss_a_00021"],"URL":"https:\/\/doi.org\/10.1162\/qss_a_00021","relation":{},"ISSN":["2641-3337"],"issn-type":[{"value":"2641-3337","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,2]]},"assertion":[{"value":"2019-07-09","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2019-12-10","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-01-23","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}