{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,5,14]],"date-time":"2025-05-14T03:42:45Z","timestamp":1747194165332,"version":"3.40.5"},"reference-count":52,"publisher":"MIT Press","issue":"3","content-domain":{"domain":["www.mitpressjournals.org"],"crossmark-restriction":true},"short-container-title":["Quantitative Science Studies"],"published-print":{"date-parts":[[2020,9]]},"abstract":"<jats:p>Scholarly content has become more difficult to find as information retrieval has devolved from bespoke systems that exploit disciplinary ontologies to keyword search on generic search engines. In parallel, more scholarly content is available through open access mechanisms. These trends have failed to converge in ways that would facilitate text data mining, both for information retrieval and as a research method for the quantitative social sciences. Scholarly content has become open to read without becoming open to mine, due both to constraints by publishers and to lack of attention in scholarly communication. The quantity of available text has grown faster than has the quality. Academic dossier systems are among the means to acquire more quality data for mining. Universities, publishers, and private enterprise may be able to mine these data for strategic purposes, however. On the positive front, changes in copyright may allow more data mining. Privacy, intellectual freedom, and access to knowledge are at stake. The next frontier of activism in open access scholarship is control over content for mining as a means to democratize knowledge.<\/jats:p>","DOI":"10.1162\/qss_a_00053","type":"journal-article","created":{"date-parts":[[2020,6,29]],"date-time":"2020-06-29T11:36:45Z","timestamp":1593430605000},"page":"993-1000","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":1,"title":["Whose text, whose mining, and to whose benefit?"],"prefix":"10.1162","volume":"1","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9344-1029","authenticated-orcid":true,"given":"Christine L.","family":"Borgman","sequence":"first","affiliation":[{"name":"Director, Center for Knowledge Infrastructures, University of California, Los Angeles"}]}],"member":"281","reference":[{"volume-title":"The copyright wars: Three centuries of trans-Atlantic battle","year":"2014","author":"Baldwin P.","key":"bib1"},{"issue":"5687","key":"bib2","doi-asserted-by":"crossref","first-page":"1110","DOI":"10.1126\/science.1100526","volume":"305","author":"Benkler Y.","year":"2004","journal-title":"Science"},{"key":"bib3","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/3131.001.0001","volume-title":"From Gutenberg to the global information infrastructure: Access to information in the networked world","author":"Borgman C. L.","year":"2000"},{"key":"bib4","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/7434.001.0001","volume-title":"Scholarship in the digital age: Information, infrastructure, and the internet","author":"Borgman C. L.","year":"2007"},{"key":"bib5","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/9963.001.0001","volume-title":"Big data, little data, no data: Scholarship in the networked world","author":"Borgman C. L.","year":"2015"},{"key":"bib6","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1515\/9783110308464-008","volume-title":"Theories of informetrics and scholarly communication","author":"Borgman C. L.","year":"2016"},{"issue":"2","key":"bib7","first-page":"365","volume":"33","author":"Borgman C. L.","year":"2018","journal-title":"Berkeley Technology Law Journal"},{"volume-title":"Presented at the National Forum: Data Mining Research Using In-copyright and Limited-access Text Datasets","year":"2018","author":"Borgman C. L.","key":"bib8"},{"volume-title":"Effective online searching: A basic text","year":"1984","author":"Borgman C. L.","key":"bib9"},{"volume-title":"Open data in a big data world: An international accord","year":"2015","author":"Boulton G.","key":"bib10"},{"key":"bib11","first-page":"981","volume":"28","author":"Cohen J. E.","year":"1996","journal-title":"Connecticut Law Review"},{"journal-title":"Financial Times","year":"2016","author":"Cookson R.","key":"bib12"},{"volume-title":"IMLS National Forum on data mining research using in-copyright and limited-access text datasets: Discussion paper, forum statements, and SWOT analyses","year":"2018","author":"Dickson E.","key":"bib13"},{"key":"bib14","volume":"12","author":"Duguid P.","year":"2007","journal-title":"First Monday"},{"key":"bib16","first-page":"252","volume-title":"Academy & the Internet","author":"Elkin-Koren N.","year":"2004"},{"issue":"1","key":"bib17","volume":"59","author":"Elkin-Koren N.","year":"2017","journal-title":"Arizona Law Review"},{"journal-title":"The Chronicle of Higher Education","year":"2018","author":"Ellis L.","key":"bib18"},{"journal-title":"The Chronicle of Higher Education","year":"2019","author":"Ellis L.","key":"bib19"},{"journal-title":"Science","year":"2016","author":"Enserink M.","key":"bib21"},{"issue":"7359","key":"bib23","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1038\/476145a","volume":"476","author":"Ginsparg P.","year":"2011","journal-title":"Nature"},{"key":"bib24","first-page":"39","volume":"2","author":"Harnad S.","year":"1991","journal-title":"Public-Access Computer Systems Review"},{"issue":"12","key":"bib25","volume":"5","author":"Harnad S.","year":"1999","journal-title":"D-Lib Magazine"},{"issue":"3","key":"bib26","doi-asserted-by":"crossref","DOI":"10.1045\/march2005-harnad","volume":"11","author":"Harnad S.","year":"2005","journal-title":"D-Lib Magazine"},{"volume-title":"Understanding knowledge as a commons: From theory to practice","year":"2007","author":"Hess C.","key":"bib28"},{"volume-title":"The information commons: A public policy report","year":"2004","author":"Kranich N.","key":"bib29"},{"journal-title":"The Scientist","year":"2017","author":"Kwon D.","key":"bib30"},{"issue":"4","key":"bib31","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1087\/20140402","volume":"27","author":"Lammey R.","year":"2014","journal-title":"Learned Publishing"},{"key":"bib32","volume":"13","author":"Leetaru K.","year":"2008","journal-title":"First Monday"},{"volume-title":"The future of ideas: The fate of the commons in a connected world","year":"2001","author":"Lessig L.","key":"bib33"},{"volume-title":"Research data management: Practical strategies for information professionals","year":"2014","author":"Levine M.","key":"bib34"},{"issue":"4","key":"bib35","doi-asserted-by":"crossref","DOI":"10.5210\/fm.v22i4.7414","volume":"22","author":"Lynch C.","year":"2017","journal-title":"First Monday"},{"journal-title":"Inside Higher Ed","year":"2017","author":"McKenzie L.","key":"bib37"},{"volume-title":"Panton principles","year":"2010","author":"Murray-Rust P.","key":"bib38"},{"journal-title":"The Chronicle Review","year":"2010","author":"Nunberg G.","key":"bib39"},{"key":"bib40","volume":"13","author":"O\u2019Sullivan M.","year":"2008","journal-title":"First Monday"},{"volume-title":"Elsevier buys SSRN.com: What it means for scholarly publication","year":"2016","author":"Pike G. H.","key":"bib41"},{"key":"bib42","doi-asserted-by":"crossref","first-page":"e4375","DOI":"10.7717\/peerj.4375","volume":"6","author":"Piwowar H.","year":"2018","journal-title":"PeerJ"},{"volume-title":"ELPUB 2018","year":"2018","author":"Posada A.","key":"bib44"},{"issue":"6422","key":"bib45","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1126\/science.363.6422.11","volume":"363","author":"Rabesandratana T.","year":"2019","journal-title":"Science"},{"volume-title":"Designing the microbial research commons: Strategies for accessing, managing, and using essential public knowledge assets","year":"2009","author":"Reichman J. H.","key":"bib46"},{"key":"bib47","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9781139128957","volume-title":"Governing digitally integrated genetic resources, data, and literature: Global intellectual property strategies for a redesigned microbial research commons","author":"Reichman J. H.","year":"2016"},{"issue":"11","key":"bib48","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1145\/3363179","volume":"62","author":"Samuelson P.","year":"2019","journal-title":"Communications of the ACM"},{"key":"bib49","doi-asserted-by":"crossref","first-page":"183","DOI":"10.2218\/ijdc.v13i1.620","volume":"13","author":"Senseney M.","year":"2018","journal-title":"International Journal of Digital Curation"},{"key":"bib50","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/9286.001.0001","volume-title":"Open access","author":"Suber P.","year":"2012"},{"volume-title":"Opening Plenary Session presented at the Coalition for Networked Information (CNI) Spring 2013 Membership Meeting","year":"2013","author":"Van de Sompel H.","key":"bib52"},{"journal-title":"ArXiv:1605.06154 [Cs]","year":"2016","author":"Van de Sompel H.","key":"bib53"},{"issue":"2","key":"bib54","doi-asserted-by":"crossref","first-page":"201","DOI":"10.5860\/crl.78.2.201","volume":"78","author":"Wilkin J. P.","year":"2017","journal-title":"College & Research Libraries"},{"key":"bib55","doi-asserted-by":"crossref","first-page":"160018","DOI":"10.1038\/sdata.2016.18","volume":"3","author":"Wilkinson M. D.","year":"2016","journal-title":"Scientific Data"},{"issue":"3","key":"bib56","doi-asserted-by":"crossref","first-page":"5","DOI":"10.6017\/ital.v33i3.5485","volume":"33","author":"Williams L. A.","year":"2014","journal-title":"Information Technology and Libraries"},{"volume-title":"The access principle: The case for open access to research and scholarship","year":"2006","author":"Willinsky J.","key":"bib57"},{"issue":"2","key":"bib58","doi-asserted-by":"crossref","first-page":"196","DOI":"10.1353\/lib.2018.0033","volume":"67","author":"Willinsky J.","year":"2018","journal-title":"Library Trends"},{"journal-title":"The Scientist","year":"2018","author":"Yeager A.","key":"bib59"}],"container-title":["Quantitative Science Studies"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/qss_a_00053","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,8]],"date-time":"2024-08-08T18:10:07Z","timestamp":1723140607000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/qss\/article\/1\/3\/993-1000\/96095"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9]]},"references-count":52,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2020,9]]}},"alternative-id":["10.1162\/qss_a_00053"],"URL":"https:\/\/doi.org\/10.1162\/qss_a_00053","relation":{},"ISSN":["2641-3337"],"issn-type":[{"type":"electronic","value":"2641-3337"}],"subject":[],"published":{"date-parts":[[2020,9]]},"assertion":[{"value":"2020-08-17","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}