{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,6]],"date-time":"2026-04-06T23:59:31Z","timestamp":1775519971160,"version":"3.50.1"},"reference-count":41,"publisher":"Emerald","issue":"5","license":[{"start":{"date-parts":[[2024,6,4]],"date-time":"2024-06-04T00:00:00Z","timestamp":1717459200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JD"],"published-print":{"date-parts":[[2024,9,3]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>Community-generated digital content (CGDC) is one of the UK\u2019s prime cultural assets. However, CGDC is currently \u201ccritically endangered\u201d (Digital Preservation Coalition, 2021) due to technological and organisational barriers and has proven resistant to traditional methods of linking and integration. The challenge of integrating CGDC into larger archives has effectively silenced diverse community voices within our national collection. Our Heritage, Our Stories (OHOS), funded by the UK\u2019s AHRC programme Towards a National Collection, responds to these urgent challenges by bringing together cutting-edge approaches from cultural heritage, humanities and computer science.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>Existing solutions to CGDC integration, involving bespoke interventionist activities, are expensive, time-consuming and unsustainable at scale, while unsophisticated computational integration erases the meaning and purpose of both CGDC and its creators. Using innovative multidisciplinary methods, AI tools and a co-design process, previously unfindable and unlinkable CGDC will be made discoverable in our virtual national collection.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>There currently exists a range of disconnected, fragile and under-represented community-generated heritage which is at increasing risk of loss. Therefore, OHOS will work to ensure the survival and preservation of these nationally important resources, for the future and for our shared national collection.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>As we dissolve barriers to create meaningful new links across CGDC collections and develop new methods of engagement, OHOS will also make this content accessible to new and diverse audiences. This will facilitate a wealth of fresh research while also embedding new strategies for future management of CGDC into heritage practice and training and fostering newly enriching, robust connections between communities and archival institutions.<\/jats:p><\/jats:sec>","DOI":"10.1108\/jd-03-2024-0057","type":"journal-article","created":{"date-parts":[[2024,6,1]],"date-time":"2024-06-01T05:50:55Z","timestamp":1717221055000},"page":"1133-1147","source":"Crossref","is-referenced-by-count":10,"title":["<i>Our Heritage, Our Stories<\/i>: developing AI tools to link and support community-generated digital cultural heritage"],"prefix":"10.1108","volume":"80","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6657-4205","authenticated-orcid":false,"given":"Ewan D.","family":"Hannaford","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6391-2950","authenticated-orcid":false,"given":"Viktor","family":"Schlegel","sequence":"additional","affiliation":[]},{"given":"Rhiannon","family":"Lewis","sequence":"additional","affiliation":[]},{"given":"Stefan","family":"Ramsden","sequence":"additional","affiliation":[]},{"given":"Jenny","family":"Bunn","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2112-0963","authenticated-orcid":false,"given":"John","family":"Moore","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6337-2632","authenticated-orcid":false,"given":"Marc","family":"Alexander","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8907-7914","authenticated-orcid":false,"given":"Hannah","family":"Barker","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6693-7531","authenticated-orcid":false,"given":"Riza","family":"Batista-Navarro","sequence":"additional","affiliation":[]},{"given":"Lorna","family":"Hughes","sequence":"additional","affiliation":[]},{"given":"Goran","family":"Nenadic","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2024,6,4]]},"reference":[{"key":"key2024083106454246900_ref001","volume-title":"Text Mining for Biology and Biomedicine","year":"2006"},{"issue":"4","key":"key2024083106454246900_ref002","doi-asserted-by":"publisher","first-page":"669","DOI":"10.1515\/jisys-2017-0225","article-title":"Extracting conceptual relationships and inducing concept lattices","volume":"28","year":"2019","journal-title":"Unstructured Text. Journal of Intelligent Systems"},{"key":"key2024083106454246900_ref042","first-page":"722","volume-title":"The Semantic Web, Lecture Notes in Computer Science","year":"2007"},{"key":"key2024083106454246900_ref003","first-page":"2670","article-title":"Open information extraction from the web","year":"2007"},{"issue":"2","key":"key2024083106454246900_ref004","doi-asserted-by":"publisher","first-page":"1","DOI":"10.5153\/sro.2590","article-title":"Revisiting the Archives: a case study from the history of geriatric medicine","volume":"17","year":"2012","journal-title":"Sociological Research Online"},{"issue":"1","key":"key2024083106454246900_ref005","doi-asserted-by":"publisher","first-page":"12","DOI":"10.1002\/(sici)1097-4571(199401)45:1<12::aid-asi2>3.0.co;2-l","article-title":"The relationship between recall and precision","volume":"45","year":"1994","journal-title":"Journal of the American Society for Information Science"},{"key":"key2024083106454246900_ref006","volume-title":"Nothing about Us without Us: Disability Oppression and Empowerment","year":"1998"},{"key":"key2024083106454246900_ref007","doi-asserted-by":"publisher","first-page":"8440","DOI":"10.18653\/v1\/2020.acl-main.747","article-title":"Unsupervised cross-lingual representation learning at scale","year":"2020"},{"issue":"2","key":"key2024083106454246900_ref008","first-page":"3","article-title":"\u2018Editorial\u2019, special issue on Qualitative archiving and data sharing scheme (QUADS) projects","volume":"1","year":"2006","journal-title":"Methodological Innovations Online"},{"key":"key2024083106454246900_ref009","doi-asserted-by":"publisher","first-page":"1","DOI":"10.3389\/frai.2021.609970","article-title":"Registerial adaptation vs. innovation across situational contexts: 18th Century women in transition","volume":"4","year":"2021","journal-title":"Frontiers in Artificial Intelligence"},{"key":"key2024083106454246900_ref010","unstructured":"Digital Preservation Coalition (2021), \u201cThe BitList 2021\u201d, doi: 10.7207\/dpcbitlist21-01, available at: https:\/\/www.dpconline.org\/docs\/miscellaneous\/advocacy\/wdpd\/2521-bitlist2021\/file (accessed 10 February 2022)."},{"issue":"2","key":"key2024083106454246900_ref044","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3604931","article-title":"Named entity recognition and classification in historical documents: a survey","volume":"56","year":"2023","journal-title":"ACM Computing Surveys"},{"issue":"2","key":"key2024083106454246900_ref012","doi-asserted-by":"publisher","first-page":"15","DOI":"10.3828\/archives.2020.10","article-title":"The historical Manuscripts commission: an archival evolution","volume":"55","year":"2020","journal-title":"Archives"},{"key":"key2024083106454246900_ref013","first-page":"37","article-title":"The rewards of using archived oral histories in research: the case of the millennium memory bank","volume":"37","year":"2013","journal-title":"Oral History"},{"key":"key2024083106454246900_ref014","doi-asserted-by":"crossref","unstructured":"Greenhalgh, C. (2020), \u201cSocial surveys\u201d, in Dobson, M. and Ziemann, B. (Eds), Reading Primary Sources: the Interpretation of Texts from Nineteenth and Twentieth Century History, Routledge, London, pp.\u00a0117-137.","DOI":"10.4324\/9780429401916-6"},{"key":"key2024083106454246900_ref015","volume-title":"Reflections on the Centenary of the First World War: Learning and Legacies for the Future","year":"2021"},{"key":"key2024083106454246900_ref016","doi-asserted-by":"publisher","first-page":"2545","DOI":"10.18653\/v1\/2021.naacl-main.201","article-title":"A survey on recent approaches for Natural Language Processing in low-resource scenarios","year":"2021"},{"issue":"6","key":"key2024083106454246900_ref017","doi-asserted-by":"publisher","first-page":"1223","DOI":"10.1108\/jd-02-2021-0032","article-title":"Named-entity recognition for early modern textual documents: a review of capabilities and challenges with strategies for the future","volume":"77","year":"2021","journal-title":"Journal of Documentation"},{"key":"key2024083106454246900_ref018","volume-title":"Digital Sustainability Review of HLF Funded Projects","year":"2019"},{"issue":"2","key":"key2024083106454246900_ref019","doi-asserted-by":"publisher","first-page":"567","DOI":"10.1017\/s0018246x15000515","article-title":"Inventing the \u2018traditional working class\u2019: a re-analysis of interview notes from Young and Willmott's Family and kinship in East London","volume":"59","year":"2016","journal-title":"The Historical Journal"},{"issue":"3","key":"key2024083106454246900_ref020","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1145\/3236386.3241340","article-title":"The Mythos of Model Interpretability: in machine learning, the concept of interpretability is both important and slippery","volume":"16","year":"2018","journal-title":"Queue"},{"issue":"1","key":"key2024083106454246900_ref021","doi-asserted-by":"publisher","first-page":"234","DOI":"10.1093\/hwj\/dbv017","article-title":"Sedimented histories: connections, collaborations and coproduction in regional history","volume":"80","year":"2015","journal-title":"History Workshop Journal"},{"issue":"2","key":"key2024083106454246900_ref022","doi-asserted-by":"publisher","first-page":"255","DOI":"10.3233\/sw-180333","article-title":"Information extraction meets the semantic web: a survey","volume":"11","year":"2020","journal-title":"Semantic Web"},{"key":"key2024083106454246900_ref023","article-title":"Efficient estimation of word representations in vector space","year":"2013"},{"issue":"5","key":"key2024083106454246900_ref024","doi-asserted-by":"publisher","first-page":"544","DOI":"10.1136\/amiajnl-2011-000464","article-title":"Natural language processing: an introduction","volume":"18","year":"2011","journal-title":"Journal of the American Medical Informatics Association"},{"key":"key2024083106454246900_ref025","doi-asserted-by":"publisher","first-page":"190","DOI":"10.1145\/3371158.3371183","article-title":"Co-clustering triples from open information extraction","year":"2020"},{"issue":"2","key":"key2024083106454246900_ref026","doi-asserted-by":"publisher","first-page":"315","DOI":"10.1017\/jbr.2019.291","article-title":"\u2018The people who write to us are the people who don't like us\u2019: class, gender, and citizenship in the survey of sickness, 1943-1952","volume":"59","year":"2020","journal-title":"Journal of British Studies"},{"issue":"1","key":"key2024083106454246900_ref027","doi-asserted-by":"publisher","first-page":"32","DOI":"10.5920\/idp.2015.1132","article-title":"The co-production of historical knowledge: implications for the history of identities","volume":"1","year":"2015","journal-title":"Identity Papers: A Journal of British and Irish Studies"},{"key":"key2024083106454246900_ref028","article-title":"Why do we digitize? The case for slow digitization","year":"2018"},{"key":"key2024083106454246900_ref029","doi-asserted-by":"publisher","first-page":"182","DOI":"10.1145\/3328243.3328268","article-title":"Designing for intelligence: user-centred design in the age of algorithms","year":"2019"},{"key":"key2024083106454246900_ref030","first-page":"1167","article-title":"Neural relation classification with text descriptions","year":"2018"},{"key":"key2024083106454246900_ref031","volume-title":"Theatres of Memory: Past and Present in Contemporary Culture","year":"1994"},{"issue":"3","key":"key2024083106454246900_ref032","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10664-020-09933-5","article-title":"An automated framework for the extraction of semantic legal metadata from legal texts","volume":"26","year":"2021","journal-title":"Empirical Software Engineering"},{"key":"key2024083106454246900_ref033","volume-title":"Dust","year":"2000"},{"key":"key2024083106454246900_ref043","first-page":"102","volume-title":"Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics","year":"2012"},{"key":"key2024083106454246900_ref034","doi-asserted-by":"publisher","first-page":"2920","DOI":"10.18653\/v1\/2020.acl-main.263","article-title":"It's morphin\u2019 time! Combating linguistic discrimination with inflectional perturbations","year":"2020"},{"key":"key2024083106454246900_ref035","unstructured":"Thibeaud, C. (2001), \u201cAccess to archives: England's contribution to the national archive network\u201d, available at: http:\/\/www.ariadne.ac.uk\/issue\/30\/archives\/ (accessed February 10 2022)"},{"issue":"2","key":"key2024083106454246900_ref036","doi-asserted-by":"publisher","first-page":"262","DOI":"10.1093\/llc\/fqt067","article-title":"Exploring entity recognition and disambiguation for cultural heritage collections","volume":"30","year":"2015","journal-title":"Digital Scholarship in the Humanities"},{"issue":"10","key":"key2024083106454246900_ref041","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1145\/2629489","article-title":"Wikidata: a free collaborative knowledgebase","volume":"57","year":"2014","journal-title":"Communications of the ACM"},{"issue":"3","key":"key2024083106454246900_ref037","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3386252","article-title":"Generalizing from a few examples: a survey on few-shot learning","volume":"53","year":"2020","journal-title":"ACM Computing Surveys (CSUR)"},{"key":"key2024083106454246900_ref038","doi-asserted-by":"publisher","first-page":"146","DOI":"10.1080\/00437956.1954.11659520","article-title":"Distributional structure","volume":"10 Nos 2-3","year":"1954","journal-title":"WORD"}],"container-title":["Journal of Documentation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-03-2024-0057\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-03-2024-0057\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:33:42Z","timestamp":1753396422000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/jd\/article\/80\/5\/1133-1147\/1236187"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,4]]},"references-count":41,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2024,6,4]]},"published-print":{"date-parts":[[2024,9,3]]}},"alternative-id":["10.1108\/JD-03-2024-0057"],"URL":"https:\/\/doi.org\/10.1108\/jd-03-2024-0057","relation":{},"ISSN":["0022-0418"],"issn-type":[{"value":"0022-0418","type":"print"}],"subject":[],"published":{"date-parts":[[2024,6,4]]}}}