{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T00:28:23Z","timestamp":1777854503414,"version":"3.51.4"},"reference-count":53,"publisher":"SAGE Publications","issue":"5","license":[{"start":{"date-parts":[[2015,5,22]],"date-time":"2015-05-22T00:00:00Z","timestamp":1432252800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Information Science"],"published-print":{"date-parts":[[2015,10]]},"abstract":"<jats:p>\n                    The uncontrolled nature of user-assigned tags makes them prone to various inconsistencies caused by spelling variations, synonyms, acronyms and hyponyms. These inconsistencies in turn lead to some of the common problems associated with the use of folksonomies such as the tag explosion phenomenon. Mapping user tags to their corresponding Wikipedia articles, as well-formed concepts, offers multifaceted benefits to the process of subject metadata generation and management in a wide range of online environments. These include normalization of inconsistencies, elimination of personal tags and improvement of the interchangeability of existing subject metadata. In this article, we propose a machine learning-based method capable of automatic mapping of user tags to their equivalent Wikipedia concepts. We have demonstrated the application of the proposed method and evaluated its performance using the currently most popular computer programming Q&amp;A website, StackOverflow.com , as our test platform. Currently, around 20 million posts in StackOverflow are annotated with about 37,000 unique user tags, from which we have chosen a subset of 1256 tags to evaluate the accuracy performance of our proposed mapping method. We have evaluated the performance of our method using the standard information retrieval measures of precision, recall and F\n                    <jats:sub>1<\/jats:sub>\n                    . Depending on the machine learning-based classification algorithm used as part of the mapping process, F\n                    <jats:sub>1<\/jats:sub>\n                    scores as high as 99.6% were achieved.\n                  <\/jats:p>","DOI":"10.1177\/0165551515586669","type":"journal-article","created":{"date-parts":[[2015,5,22]],"date-time":"2015-05-22T22:11:35Z","timestamp":1432332695000},"page":"570-583","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":19,"title":["Automatic mapping of user tags to Wikipedia concepts: The case of a Q&amp;A website \u2013 StackOverflow"],"prefix":"10.1177","volume":"41","author":[{"given":"Arash","family":"Joorabchi","sequence":"first","affiliation":[{"name":"Department of Electronic and Computer Engineering, University of Limerick, Ireland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael","family":"English","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Information Systems, University of Limerick, Ireland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Abdulhussain E.","family":"Mahdi","sequence":"additional","affiliation":[{"name":"Department of Electronic and Computer Engineering, University of Limerick, Ireland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"179","published-online":{"date-parts":[[2015,5,22]]},"reference":[{"key":"bibr1-0165551515586669","unstructured":"Mathes A. Folksonomies \u2013 Cooperative classification and communication through shared metadata, http:\/\/www.adammathes.com\/academic\/computer-mediated-communication\/folksonomies.html (2004, accessed March 2015)."},{"key":"bibr2-0165551515586669","volume":"10","author":"Trant J","year":"2009","journal-title":"Journal of Digital Information"},{"key":"bibr3-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1045\/january2006-guy"},{"key":"bibr4-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1177\/0165551510386173"},{"key":"bibr5-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1002\/asi.22653"},{"key":"bibr6-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1177\/0165551512451808"},{"key":"bibr7-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2010.10.001"},{"key":"bibr8-0165551515586669","volume-title":"1st International workshop on collective semantics: Collective intelligence & the Semantic Web (CISWeb 2008)","author":"Cantador I","year":"2008"},{"key":"bibr9-0165551515586669","first-page":"4","author":"Noruzi A","year":"2007","journal-title":"Webology"},{"key":"bibr10-0165551515586669","unstructured":"Wikipedia. Wikipedia:Size in volumes, http:\/\/en.wikipedia.org\/wiki\/Wikipedia:Size_in_volumes (2014, accessed Oct 2014)."},{"key":"bibr11-0165551515586669","unstructured":"Rainie L, Tancer B. Wikipedia users, http:\/\/www.pewinternet.org\/Reports\/2007\/Wikipedia-users.aspx (2007, accessed July 2014)."},{"key":"bibr12-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1108\/00220410910998906"},{"key":"bibr13-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1108\/JD-05-2013-0056"},{"key":"bibr14-0165551515586669","volume-title":"1st International workshop on collective semantics: Collective intelligence & the Semantic Web (CISWeb 2008) at the 5th annual European Semantic Web conference (ESWC 2008)","author":"Angeletou S","year":"2008"},{"key":"bibr15-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1017\/S026988891100018X"},{"key":"bibr16-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijhcs.2009.05.004"},{"key":"bibr17-0165551515586669","first-page":"9","volume-title":"11th Conference of the European chapter of the Association for Computational Linguistics (EACL-06)","author":"Bunescu RC","year":"2006"},{"key":"bibr18-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1401976"},{"key":"bibr19-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/1557019.1557066"},{"key":"bibr20-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871769"},{"key":"bibr21-0165551515586669","volume-title":"First AAAI workshop on Wikipedia and artificial intelligence (WIKIAI\u201908)","author":"Medelyan O","year":"2008"},{"key":"bibr22-0165551515586669","volume-title":"First AAAI workshop on Wikipedia and artificial intelligence (WIKIAI\u201908)","author":"Milne D","year":"2008"},{"key":"bibr23-0165551515586669","doi-asserted-by":"publisher","DOI":"10.3115\/1614108.1614160"},{"key":"bibr24-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1109\/WI.2006.119"},{"key":"bibr25-0165551515586669","volume-title":"Seventh international conference on language resources and evaluation (LREC\u201910)","author":"Vivaldi J","year":"2010"},{"key":"bibr26-0165551515586669","volume-title":"SIGIR workshop on time-aware information access (TAIA\u201912)","author":"Osborne M","year":"2012"},{"key":"bibr27-0165551515586669","unstructured":"Wikipedia. List of Wikipedias, http:\/\/en.wikipedia.org\/wiki\/List_of_Wikipedias (2014, accessed September 2014)."},{"key":"bibr28-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871577"},{"key":"bibr29-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1162\/LEON_a_00344"},{"key":"bibr30-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1002\/asi.21577"},{"key":"bibr31-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1038\/438900a"},{"key":"bibr32-0165551515586669","unstructured":"Britannica. Fatally flawed \u2013 Refuting the recent study on encyclopedic accuracy by the journal Nature, http:\/\/corporate.britannica.com\/britannica_nature_response.pdf (2006, accessed July 2014)."},{"key":"bibr33-0165551515586669","first-page":"1","volume":"2014","author":"Xu G","journal-title":"Knowledge and Information Systems"},{"key":"bibr34-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-25832-9_65"},{"key":"bibr35-0165551515586669","first-page":"1440","volume-title":"Proceedings of the 22nd national conference on artificial intelligence \u2013 Volume 2","author":"Ponzetto SP","year":"2007"},{"key":"bibr36-0165551515586669","first-page":"1","volume-title":"Intelligent networking, collaborative systems and applications","author":"Fogarolli A","year":"2011"},{"key":"bibr37-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/1321440.1321504"},{"key":"bibr38-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/2441776.2441791"},{"key":"bibr39-0165551515586669","unstructured":"Stack Exchange statistics, http:\/\/stackexchange.com\/sites?view=list#traffic (2014, accessed September 2014)."},{"key":"bibr40-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/1984701.1984706"},{"key":"bibr41-0165551515586669","first-page":"1","volume":"2012","author":"Barua A","journal-title":"Empirical Software Engineering"},{"key":"bibr42-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/1978942.1979366"},{"key":"bibr43-0165551515586669","doi-asserted-by":"crossref","unstructured":"Nasehi SM, Sillito J, Maurer F, Burns C. What makes a good code example? A study of programming Q&A in StackOverflow. In: Software maintenance (ICSM), 2012 28th IEEE International conference, 2012, pp. 25\u201334.","DOI":"10.1109\/ICSM.2012.6405249"},{"key":"bibr44-0165551515586669","unstructured":"Treude C, Figueira Filho F, Cleary B, Storey M-A. Programming in a socially networked world: The evolution of the social programmer. In: FutureCSD \u201812: Proceedings of the CSCW workshop on the future of collaborative software development, 2012."},{"key":"bibr45-0165551515586669","unstructured":"Meier W. eXist-DB, http:\/\/exist.sourceforge.net\/ (2014, accessed February 2014)."},{"key":"bibr46-0165551515586669","unstructured":"Milne D. An open-source toolkit for mining Wikipedia. In: New Zealand Computer Science Research Student Conference, 2009."},{"key":"bibr47-0165551515586669","volume-title":"Department of Computer Science","author":"Medelyan O","year":"2009"},{"key":"bibr48-0165551515586669","author":"Hulth A","year":"2004","journal-title":"Stockholm University"},{"key":"bibr49-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1023\/A:1009976227802"},{"key":"bibr50-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/313238.313437"},{"key":"bibr51-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1177\/0165551510388080"},{"key":"bibr52-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/1458082.1458150"},{"key":"bibr53-0165551515586669","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"}],"container-title":["Journal of Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551515586669","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/0165551515586669","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551515586669","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T23:09:06Z","timestamp":1777504146000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/0165551515586669"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,5,22]]},"references-count":53,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2015,10]]}},"alternative-id":["10.1177\/0165551515586669"],"URL":"https:\/\/doi.org\/10.1177\/0165551515586669","relation":{},"ISSN":["0165-5515","1741-6485"],"issn-type":[{"value":"0165-5515","type":"print"},{"value":"1741-6485","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,5,22]]}}}