{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T08:41:41Z","timestamp":1773909701619,"version":"3.50.1"},"reference-count":29,"publisher":"Emerald","issue":"1","license":[{"start":{"date-parts":[[2017,12,20]],"date-time":"2017-12-20T00:00:00Z","timestamp":1513728000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["LHT"],"published-print":{"date-parts":[[2018,2,7]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>Linking libraries and Wikipedia can significantly improve the quality of services provided by these two major silos of knowledge. Such linkage would enrich the quality of Wikipedia articles and at the same time increase the visibility of library resources. To this end, the purpose of this paper is to describe the design and development of a software system for automatic mapping of FAST subject headings, used to index library materials, to their corresponding articles in Wikipedia.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>The proposed system works by first detecting all the candidate Wikipedia concepts (articles) occurring in the titles of the books and other library materials which are indexed with a given FAST subject heading. This is then followed by training and deploying a machine learning (ML) algorithm designed to automatically identify those concepts that correspond to the FAST heading. In specific, the ML algorithm used is a binary classifier which classifies the candidate concepts into either \u201ccorresponding\u201d or \u201cnon-corresponding\u201d categories. The classifier is trained to learn the characteristics of those candidates which have the highest probability of belonging to the \u201ccorresponding\u201d category based on a set of 14 positional, statistical, and semantic features.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>The authors have assessed the performance of the developed system using standard information retrieval measures of precision, recall, and<jats:italic>F<\/jats:italic>-score on a data set containing 170 FAST subject headings manually mapped to their corresponding Wikipedia articles. The evaluation results show that the developed system is capable of achieving<jats:italic>F<\/jats:italic>-scores as high as 0.65 and 0.99 in the corresponding and non-corresponding categories, respectively.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Research limitations\/implications<\/jats:title><jats:p>The size of the data set used to evaluate the performance of the system is rather small. However, the authors believe that the developed data set is large enough to demonstrate the feasibility and scalability of the proposed approach.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Practical implications<\/jats:title><jats:p>The sheer size of English Wikipedia makes the manual mapping of Wikipedia articles to library subject headings a very labor-intensive and time-consuming task. Therefore, the aim is to reduce the cost of such mapping and integration.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Social implications<\/jats:title><jats:p>The proposed mapping paves the way for connecting libraries and Wikipedia as two major silos of knowledge, and enables the bi-directional movement of users between the two.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>To the best of the authors\u2019 knowledge, the current work is the first attempt at automatic mapping of Wikipedia to a library-controlled vocabulary.<\/jats:p><\/jats:sec>","DOI":"10.1108\/lht-04-2017-0066","type":"journal-article","created":{"date-parts":[[2017,12,20]],"date-time":"2017-12-20T04:31:38Z","timestamp":1513744298000},"page":"57-74","source":"Crossref","is-referenced-by-count":6,"title":["Improving the visibility of library resources via mapping library subject headings to Wikipedia articles"],"prefix":"10.1108","volume":"36","author":[{"given":"Arash","family":"Joorabchi","sequence":"first","affiliation":[]},{"given":"Abdulhussain E.","family":"Mahdi","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2017,12,20]]},"reference":[{"issue":"4","key":"key2021041509195786800_ref001","doi-asserted-by":"crossref","first-page":"658","DOI":"10.1016\/j.ipm.2015.12.011","article-title":"Influence of human behavior and the principle of least effort on library and information science research","volume":"52","year":"2016","journal-title":"Information Processing & Management"},{"key":"key2021041509195786800_ref002","unstructured":"De Rosa, C. (2005), \u201cPerceptions of libraries and information resources: a report to the OCLC membership\u201d, Online Computer Library Center (OCLC), Dublin, OH."},{"issue":"1-2","key":"key2021041509195786800_ref003","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1300\/J104v39n01_03","article-title":"FAST: development of simplified headings for metadata","volume":"39","year":"2004","journal-title":"Cataloging & Classification Quarterly"},{"key":"key2021041509195786800_ref004","doi-asserted-by":"crossref","unstructured":"Deveaud, R., Sanjuan, E. and Bellot, P. (2012), \u201cSocial recommendation and external resources for book search\u201d, in Geva, S., Kamps, J. and Schenkel, R. (Eds), Focused Retrieval of Content and Structure: 10th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2011, Saarbr\u00fccken, December 12-14, 2011, Revised Selected Papers, Springer Berlin and Heidelberg, Berlin and Heidelberg, pp. 68-79.","DOI":"10.1007\/978-3-642-35734-3_5"},{"issue":"1","key":"key2021041509195786800_ref005","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1080\/13614560600774313","article-title":"Automated subject classification of textual web pages, based on a controlled vocabulary: challenges and recommendations","volume":"12","year":"2006","journal-title":"New Review of Hypermedia and Multimedia"},{"issue":"1","key":"key2021041509195786800_ref006","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1145\/1656274.1656278","article-title":"The Weka data mining software: an update","volume":"11","year":"2009","journal-title":"SIGKDD Explorations"},{"issue":"2","key":"key2021041509195786800_ref007","doi-asserted-by":"crossref","first-page":"203","DOI":"10.2190\/NV6E-FN3N-7NGN-TWQT","article-title":"To attract or to inform: what are titles for?","volume":"35","year":"2005","journal-title":"Journal of Technical Writing and Communication"},{"key":"key2021041509195786800_ref008","article-title":"Improving access to large-scale digital libraries through semantic-enhanced search and disambiguation","year":"2015"},{"key":"key2021041509195786800_ref009","unstructured":"Hulth, A. (2004), \u201cCombining machine learning and natural language processing for automatic keyword extraction\u201d, PhD thesis, Stockholm University, Stockholm."},{"key":"key2021041509195786800_ref010","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1177\/0165551513514932","article-title":"Towards linking libraries and Wikipedia: automatic subject indexing of library records with Wikipedia concepts","volume":"40","year":"2014","journal-title":"Journal of Information Science"},{"key":"key2021041509195786800_ref011","doi-asserted-by":"crossref","first-page":"570","DOI":"10.1177\/0165551515586669","article-title":"Automatic mapping of user tags to Wikipedia concepts: the case of a Q&A website \u2013 Stackoverflow","volume":"41","year":"2015","journal-title":"Journal of Information Science"},{"issue":"5","key":"key2021041509195786800_ref012","doi-asserted-by":"crossref","first-page":"976","DOI":"10.1108\/JD-07-2014-0103","article-title":"Augmenting Dublin core digital library metadata with Dewey decimal classification","volume":"71","year":"2015","journal-title":"Journal of Documentation"},{"key":"key2021041509195786800_ref013","doi-asserted-by":"crossref","unstructured":"Leacock, C. and Chodorow, M. (1998), \u201cCombining local context and Wordnet similarity for word sense identification\u201d, in Fellbaum, C. (Ed.), WordNet: An Electronic Lexical Database, MIT Press, pp. 265-283.","DOI":"10.7551\/mitpress\/7287.003.0018"},{"key":"key2021041509195786800_ref014","first-page":"142","article-title":"The substantial interdependence of Wikipedia and Google: a case study on the relationship between peer production communities and information technologies","volume-title":"Proceedings of the 11th International AAAI Conference on Web and Social Media (ICWSM 2017), Montreal, May 15-18","year":"2017"},{"key":"key2021041509195786800_ref015","unstructured":"Medelyan, O. (2009), \u201cHuman-competitive automatic topic indexing\u201d, PhD thesis, University of Waikato, Hamilton."},{"key":"key2021041509195786800_ref016","article-title":"An effective, low-cost measure of semantic relatedness obtained from Wikipedia links","year":"2008"},{"key":"key2021041509195786800_ref017","article-title":"Learning to link with Wikipedia","year":"2008"},{"key":"key2021041509195786800_ref018","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1016\/j.artint.2012.06.007","article-title":"An open-source toolkit for mining Wikipedia","volume":"194","year":"2013","journal-title":"Artificial Intelligence"},{"key":"key2021041509195786800_ref019","unstructured":"O\u2019madadhain, J., Fisher, D., Nelson, T., White, S. and Boey, Y.-B. (2009), \u201cJUNG 2.0\u201d, Released Under the Open Source GPL Licence, available at: http:\/\/jung.sourceforge.net\/index.html (accessed March 11, 2012)."},{"key":"key2021041509195786800_ref020","unstructured":"Porter, M.F. (2002), \u201cThe English (Porter2) stemming algorithm\u201d, Snowball, available at: http:\/\/snowball.tartarus.org\/algorithms\/english\/stemmer.html (accessed March 11, 2012)."},{"issue":"1","key":"key2021041509195786800_ref021","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1109\/21.24528","article-title":"Development and application of a metric on semantic nets","volume":"19","year":"1989","journal-title":"IEEE Transactions on Systems, Man and Cybernetics"},{"key":"key2021041509195786800_ref022","unstructured":"Rainie, L. and Tancer, B. (2007), \u201cWikipedia users\u201d, Pew Internet and American Life Project, available at: www.pewinternet.org\/Reports\/2007\/Wikipedia-users.aspx (accessed July 2014)."},{"key":"key2021041509195786800_ref023","unstructured":"Safran, N. (2012), \u201cWikipedia in the SERPs\u201d, available at: www.conductor.com\/blog\/2012\/03\/wikipedia-in-the-serps-appears-on-page-1-for-60-of-informational-34-transactional-queries\/ (accessed July 2013)."},{"key":"key2021041509195786800_ref024","article-title":"Exploiting Wikipedia for information retrieval tasks","year":"2015"},{"key":"key2021041509195786800_ref025","article-title":"WikiRelate! computing semantic relatedness using Wikipedia","year":"2006"},{"issue":"11","key":"key2021041509195786800_ref026","doi-asserted-by":"crossref","first-page":"2269","DOI":"10.1002\/asi.21147","article-title":"An extensive study on automated Dewey decimal classification","volume":"60","year":"2009","journal-title":"Journal of the American Society for Information Science and Technology"},{"issue":"4","key":"key2021041509195786800_ref027","first-page":"78","article-title":"Automated text classification using library classification schemes: trends, issues, and challenges","volume":"36","year":"2007","journal-title":"International Cataloguing and Bibliographic Control (ICBC)"},{"key":"key2021041509195786800_ref028","unstructured":"Zickuhr, K. and Rainie, L. (2011), \u201cWikipedia, past and present\u201d, Pew Research Center, available at: www.pewinternet.org\/2011\/01\/13\/wikipedia-past-and-present\/ (accessed May 2017)."},{"key":"key2021041509195786800_ref029","volume-title":"Human Behaviour and the Principle of Least-Effort","year":"1949"}],"container-title":["Library Hi Tech"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/LHT-04-2017-0066\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/LHT-04-2017-0066\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:14:21Z","timestamp":1753395261000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/lht\/article\/36\/1\/57-74\/260774"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,12,20]]},"references-count":29,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2017,12,20]]},"published-print":{"date-parts":[[2018,2,7]]}},"alternative-id":["10.1108\/LHT-04-2017-0066"],"URL":"https:\/\/doi.org\/10.1108\/lht-04-2017-0066","relation":{},"ISSN":["0737-8831"],"issn-type":[{"value":"0737-8831","type":"print"}],"subject":[],"published":{"date-parts":[[2017,12,20]]}}}