{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,22]],"date-time":"2026-01-22T06:02:54Z","timestamp":1769061774833,"version":"3.49.0"},"reference-count":63,"publisher":"Emerald","issue":"2","license":[{"start":{"date-parts":[[2022,7,1]],"date-time":"2022-07-01T00:00:00Z","timestamp":1656633600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JD"],"published-print":{"date-parts":[[2023,3,6]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>Large cultural heritage datasets from museum collections tend to be biased and demonstrate omissions that result from a series of decisions at various stages of the collection construction. The purpose of this study is to apply a set of ethical criteria to compare the level of bias of six online databases produced by two major art museums, identifying the most biased and the least biased databases.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>At the first stage, the relevant data have been automatically extracted from all six databases and mapped to a unified ontological scheme based on Wikidata. Then, the authors applied ethical criteria to the results of the geographical distribution of records provided by two major art museums as online databases accessed via museums' websites, API datasets and datasets submitted to Wikidata.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>The authors show that the museums use different artworks in each of its online databases and each data-base has different types of bias reflected by the study variables, such as artworks' country of origin or the creator's nationality. For most variables, the database behind the online search system on the museum's website is more balanced and ethical than the API dataset and Wikidata databases of the two museums.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>By applying ethical criteria to the analysis of cultural bias in various museum databases aimed at different audiences including end users, researchers and commercial institutions, this paper shows the importance of explicating bias and maintaining integrity in cultural heritage representation through different channels that potentially have high impact on how culture is perceived, disseminated, contextualized and transformed.<\/jats:p><\/jats:sec>","DOI":"10.1108\/jd-02-2022-0047","type":"journal-article","created":{"date-parts":[[2022,6,30]],"date-time":"2022-06-30T03:55:09Z","timestamp":1656561309000},"page":"320-340","source":"Crossref","is-referenced-by-count":13,"title":["What do they make us see: a\u00a0comparative study of cultural bias\u00a0in online databases of two large museums"],"prefix":"10.1108","volume":"79","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9161-2541","authenticated-orcid":false,"given":"Maayan","family":"Zhitomirsky-Geffet","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0775-9656","authenticated-orcid":false,"given":"Inna","family":"Kizhner","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2167-2469","authenticated-orcid":false,"given":"Sara","family":"Minster","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2022,7,1]]},"reference":[{"key":"key2023030414461613300_ref001","first-page":"74","article-title":"Measuring the effects of bias in training data for literary classification","year":"2020"},{"key":"key2023030414461613300_ref002","volume-title":"The Birth of the Museum: History, Theory, Politics","year":"1995"},{"key":"key2023030414461613300_ref003","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1111\/j.1548-1379.2010.01107.x","article-title":"Neocolonial collaboration: museum as contact zone revisited","volume":"34","year":"2011","journal-title":"Museum Anthropology"},{"key":"key2023030414461613300_ref004","volume-title":"A World of Fiction: Digital Collections and the Future of Literary History","year":"2018"},{"issue":"1","key":"key2023030414461613300_ref005","first-page":"95","article-title":"Why you can't model away bias","volume":"8","year":"2020","journal-title":"Modern Language Quarterly"},{"issue":"8","key":"key2023030414461613300_ref060","doi-asserted-by":"crossref","first-page":"888","DOI":"10.1002\/asi.24172","article-title":"Digital data archives as knowledge infrastructures: mediating data sharing and reuse","volume":"70","year":"2019","journal-title":"Journal of the Association for Information Science and Technology"},{"key":"key2023030414461613300_ref006","volume-title":"Sorting Things Out: Classification and its Consequences","year":"2000"},{"issue":"4","key":"key2023030414461613300_ref007","doi-asserted-by":"crossref","first-page":"522","DOI":"10.1080\/03080188.2021.1872874","article-title":"Can I believe what I see? Data visualization and trust in the humanities","volume":"46","year":"2021","journal-title":"Interdisciplinary Science Reviews"},{"key":"key2023030414461613300_ref008","unstructured":"Brett, D. (1994), \u201cThe representation of culture\u201d, in Kockel, U. (Ed.), Culture, Tourism and Development: The Case of Ireland, Liverpool University Press, Liverpool, p. 117."},{"key":"key2023030414461613300_ref009","unstructured":"Ceitnic, E. and She, J. (2021), \u201cUnderstanding and creating art with AI: review and outlook\u201d, arXiv:2102.09109, available at: https:\/\/arxiv.org\/abs\/2102.09109 (accessed 19 February 2022)."},{"key":"key2023030414461613300_ref010","article-title":"Engaging the data science community with met open access API","year":"2020","journal-title":"Blogs\/Now at the Met. The Metropolitan Museum of Art"},{"key":"key2023030414461613300_ref011","volume-title":"Syntactic Structures","year":"1957"},{"key":"key2023030414461613300_ref012","doi-asserted-by":"crossref","unstructured":"Clear, J. (1992), \u201cCorpus sampling\u201d, in Leitner, G. (Ed.), New Directions in English Language Corpora, Mouton-de-Gruyter, Berlin, pp.\u00a021-31.","DOI":"10.1515\/9783110878202.21"},{"key":"key2023030414461613300_ref013","doi-asserted-by":"crossref","first-page":"939","DOI":"10.1111\/nph.14855","article-title":"Widespread sampling biases in herbaria revealed from large-scale digitization","volume":"217","year":"2018","journal-title":"New Phytologist"},{"key":"key2023030414461613300_ref062","article-title":"(Version 0.51) Data model for the Swiss performing art platform","year":"2017"},{"key":"key2023030414461613300_ref063","article-title":"Why categorizing native art as \u2018traditional\u2019 and \u2018contemporary\u2019 is toxic","volume-title":"First American Art Magazine","year":"2020"},{"key":"key2023030414461613300_ref014","first-page":"259","article-title":"Hierarchical indexing and documents matching in BoW","year":"2001"},{"issue":"6","key":"key2023030414461613300_ref015","doi-asserted-by":"crossref","first-page":"1190","DOI":"10.1108\/JD-12-2017-0169","article-title":"Leveraging collective intelligence: from univocal to multivocal representation of cultural heritage","volume":"74","year":"2018","journal-title":"Journal of Documentation"},{"issue":"1","key":"key2023030414461613300_ref016","doi-asserted-by":"crossref","first-page":"9","DOI":"10.2752\/147800413X13515292098070","article-title":"Confronting the digital","volume":"10","year":"2013","journal-title":"Cultural and Social History"},{"issue":"1","key":"key2023030414461613300_ref017","doi-asserted-by":"crossref","first-page":"187","DOI":"10.3233\/SW-190386","article-title":"Using the semantic web in digital humanities: shift from data publishing to data-analysis and serendipidous knowledge discovery","volume":"11","year":"2020","journal-title":"Semantic Web"},{"key":"key2023030414461613300_ref018","doi-asserted-by":"crossref","first-page":"187","DOI":"10.5771\/0943-7444-2017-3-187","article-title":"Implications of big data for knowledge organization","volume":"44","year":"2017","journal-title":"Knowledge Organization"},{"key":"key2023030414461613300_ref019","article-title":"The impact of open access on galleries, libraries, museums and archives","year":"2016","journal-title":"Smithsonian Emerging Leaders Development Program"},{"key":"key2023030414461613300_ref020","volume-title":"Images of Works of Art in Museum Collections: The Experience of Open Access","year":"2013"},{"issue":"3","key":"key2023030414461613300_ref021","doi-asserted-by":"crossref","first-page":"607","DOI":"10.1093\/llc\/fqaa055","article-title":"Digital cultural colonialism: measuring bias in aggregated digitized content held in Google arts and culture","volume":"36","year":"2021","journal-title":"Digital Scholarship in the Humanities"},{"issue":"2","key":"key2023030414461613300_ref022","doi-asserted-by":"crossref","first-page":"350","DOI":"10.1093\/llc\/fqy035","article-title":"Accessing Russian culture online: the scope of digitization in museums across Russia","volume":"34","year":"2019","journal-title":"Digital Scholarship in the Humanities"},{"key":"key2023030414461613300_ref023","article-title":"The culture of very rich and very poor: do digital museum collections tell us anything about Jewish culture","volume-title":"Jewish Studies in the Digital Age","year":""},{"key":"key2023030414461613300_ref024","article-title":"Show your work: parsons students design stunning data visualizations with Met open access API","year":"2020","journal-title":"Blogs\/Now at the Met. The Metropolitan Museum of Art"},{"issue":"3","key":"key2023030414461613300_ref025","first-page":"191","article-title":"Museum website features, aesthetics and visitors' impressions: a case study of four museums","volume":"3","year":"2015","journal-title":"Museum Management and Curatorship"},{"issue":"22","key":"key2023030414461613300_ref026","first-page":"13","article-title":"Legacies of prejudice: racism, coproduction and radical trust in the museum","volume":"25","year":"2010","journal-title":"Museum Management and Curatorship"},{"key":"key2023030414461613300_ref027","article-title":"Data here and there: studying web archives research infrastructure in Danish and Canadian settings","year":"2021"},{"issue":"8","key":"key2023030414461613300_ref028","doi-asserted-by":"crossref","first-page":"1515","DOI":"10.1002\/asi.23061","article-title":"Archaeology of a digitization","volume":"65","year":"2014","journal-title":"Journal of the Association for Information Science and Technology"},{"key":"key2023030414461613300_ref029","volume-title":"Cultural Analytics","year":"2020"},{"key":"key2023030414461613300_ref030","volume-title":"Survey of GLAM Open Access Policy and Practice","year":"2022"},{"key":"key2023030414461613300_ref031","article-title":"About the met","author":"Metropolitan Museum of Art","year":"2022","journal-title":"The Met"},{"key":"key2023030414461613300_ref032","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1016\/S0306-4573(01)00020-6","article-title":"Assessing bias in search engines","volume":"38","year":"2002","journal-title":"Information Processing and Management"},{"key":"key2023030414461613300_ref033","first-page":"717","article-title":"Psychology as a historical science","volume":"72","year":"2020","journal-title":"Annual Review of Psychology"},{"key":"key2023030414461613300_ref034","unstructured":"National Conference of State Legislatures (NCSL) (2020), \u201cFederal and state recognized tribes\u201d, available at: https:\/\/www.ncsl.org\/legislators-staff\/legislators\/quad-caucus\/list-of-federal-and-state-recognized-tribes.aspx (accessed 15 December 2021)."},{"key":"key2023030414461613300_ref035","unstructured":"Nguen, V. and Kim, S. (2021), \u201cCleaning and structuring the label space of the iMet collection 2020\u201d, arXiv:2106.00815, ArXiv, available at: https:\/\/arxiv.org\/abs\/2106.00815 (accessed 19 February 2019)."},{"key":"key2023030414461613300_ref036","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1016\/j.jecp.2017.04.017","article-title":"The persistent sampling bias in developmental psychology: a call to action","volume":"162","year":"2017","journal-title":"Journal of Experimental Child Psychology"},{"issue":"Suppl. 22","key":"key2023030414461613300_ref037","first-page":"289","article-title":"The crying child: on colonial archives, digitization, and ethics of care in the cultural commons","volume":"61","year":"2020","journal-title":"Current Anthropology"},{"key":"key2023030414461613300_ref038","article-title":"Encoding the haunting of an object catalogue: on the potential of digital technologies to perpetuate or subvert the silence and bias of the early-modern archive","year":"2021","journal-title":"Digital Scholarship in the Humanities"},{"issue":"6","key":"key2023030414461613300_ref039","doi-asserted-by":"crossref","first-page":"296","DOI":"10.5860\/crln.79.6.296","article-title":"Collections as data: implications for enclosure","volume":"79","year":"2018","journal-title":"College and Research Libraries News"},{"key":"key2023030414461613300_ref040","volume-title":"Museums and Source Communities","year":"2003"},{"key":"key2023030414461613300_ref041","first-page":"145","article-title":"Colonial history and global economics distort our understanding of deep-time biodiversity","volume":"6","year":"2022","journal-title":"Nature Ecology and Evolution"},{"key":"key2023030414461613300_ref042","volume-title":"Vision and Mission","author":"Rijksmuseum","year":"2022"},{"key":"key2023030414461613300_ref043","first-page":"665","article-title":"Analyzing race and citizenship bias in Wikidata","year":"2021"},{"key":"key2023030414461613300_ref044","first-page":"41","article-title":"Biases in generative art: a causal look from the lens of art history","year":"2021"},{"key":"key2023030414461613300_ref045","article-title":"Scaling the mission. The met collection API","year":"2018","journal-title":"Blogs\/Now at the Met. The Metropolitan Museum of Art"},{"issue":"1","key":"key2023030414461613300_ref046","doi-asserted-by":"publisher","DOI":"10.1177\/20539517211006165","article-title":"The value of mass-digitised cultural heritage content in creative contexts","volume":"8","year":"2021","journal-title":"Big Data and Society"},{"key":"key2023030414461613300_ref047","doi-asserted-by":"crossref","unstructured":"Tzouganatou, A. (2021), \u201cOn complexity of GLAM's digital ecosystem: APIs as change makers for opening up knowledge\u201d, in Rauterberg, M. (Ed.), Culture and Computing. Design Thinking and Cultural Computing, Springer International Publishing, Cham, pp.\u00a0348-359.","DOI":"10.1007\/978-3-030-77431-8_22"},{"key":"key2023030414461613300_ref048","volume-title":"Creating a Global Museum: UVA School of Data Science Capstone Project","author":"UVA Data Science","year":"2020"},{"issue":"1","key":"key2023030414461613300_ref049","first-page":"1","article-title":"How open is OpenGLAM? Identifying barriers to commercial and non-commercial reuse of digitised art images","volume":"76","year":"2020","journal-title":"Journal of Documentation"},{"issue":"8","key":"key2023030414461613300_ref050","article-title":"Materials in paintings (MIP): an interdisciplinary dataset for perception, art history and computer vision","volume":"16","year":"2021","journal-title":"PLoS ONE"},{"issue":"2","key":"key2023030414461613300_ref051","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1080\/10645578.2019.1668679","article-title":"Museum collections and online users: development of a segmentation model for the Metropolitan Museum of Art","volume":"22","year":"2019","journal-title":"Visitor Studies"},{"issue":"3","key":"key2023030414461613300_ref052","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3446621","article-title":"Computer vision tagging the Metropolitan Museum of Art's collection: a comparison of three systems","volume":"14","year":"2021","journal-title":"Journal on Computing and Cultural Heritage"},{"key":"key2023030414461613300_ref053","volume-title":"Networks of Empire: Forced Migration in the Dutch East India Company","year":"2009"},{"key":"key2023030414461613300_ref054","article-title":"Cultivating APIs in the cultural heritage sector: lessons from an internship at Europeana","year":"2018"},{"key":"key2023030414461613300_ref055","unstructured":"Ypsilantis, N.-A., Garcia, N., Han, G., Ibrahimi, S., Van Noord, N. and Tolias, G. (2022), \u201cThe met dataset: instance-level recognition for artworks\u201d, arXiv:2202.01747, ArXiv, available at: https:\/\/arxiv.org\/abs\/2202.01747 (accessed 14 February 2022)."},{"key":"key2023030414461613300_ref056","unstructured":"Zhang, C., Kaeser-Chen, C., Vesom, G., Choi, J., Kessler, M. and Belongie, S. (2019), \u201cThe iMet collection 2019 challenge dataset\u201d, arXiv:1906.00901, available at: https:\/\/vision.cornell.edu\/se3\/wp-content\/uploads\/2019\/06\/iMet2019.pdf (accessed 19 February 2022)."},{"issue":"5","key":"key2023030414461613300_ref057","doi-asserted-by":"crossref","first-page":"1124","DOI":"10.1108\/JD-10-2018-0163","article-title":"Towards a diversified knowledge organization system \u2013 an open network of inter-linked subsystems with multiple validity scopes","volume":"75","year":"2019","journal-title":"Journal of Documentation"},{"issue":"6","key":"key2023030414461613300_ref058","doi-asserted-by":"crossref","first-page":"1459","DOI":"10.1108\/JD-04-2020-0053","article-title":"A new framework for ethical creation and evaluation of multi-perspective knowledge organization systems","volume":"76","year":"2020","journal-title":"Journal of Documentation"},{"key":"key2023030414461613300_ref061","doi-asserted-by":"publisher","volume-title":"Frontiers in Digital Humanities","year":"2016","DOI":"10.3389\/fdigh.2016.00003"},{"key":"key2023030414461613300_ref059","volume-title":"Interview with Managing Editor of Big Data and Society","year":"2022"}],"container-title":["Journal of Documentation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-02-2022-0047\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-02-2022-0047\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:33:26Z","timestamp":1753396406000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/jd\/article\/79\/2\/320-340\/221764"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,1]]},"references-count":63,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2022,7,1]]},"published-print":{"date-parts":[[2023,3,6]]}},"alternative-id":["10.1108\/JD-02-2022-0047"],"URL":"https:\/\/doi.org\/10.1108\/jd-02-2022-0047","relation":{},"ISSN":["0022-0418"],"issn-type":[{"value":"0022-0418","type":"print"}],"subject":[],"published":{"date-parts":[[2022,7,1]]}}}