{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,7]],"date-time":"2025-11-07T13:39:26Z","timestamp":1762522766479,"version":"3.41.2"},"reference-count":58,"publisher":"Emerald","issue":"1","license":[{"start":{"date-parts":[[2022,12,19]],"date-time":"2022-12-19T00:00:00Z","timestamp":1671408000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["AJIM"],"published-print":{"date-parts":[[2024,1,2]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>The purpose of this study is to explore to which extent data mining research would be associated with the library and information science (LIS) discipline. This study aims to identify data mining related subject terms and topics in representative LIS scholarly publications.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>A large set of bibliographic records over 38,000 was collected from a scholarly database representing the fields of LIS and the data mining, respectively. A multitude of text mining techniques were applied to investigate prevailing subject terms and research topics, such as influential term analysis and Dirichlet multinomial regression topic modeling.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>The findings of this study revealed the relationship between the LIS and data mining research domains. Various data mining method terms were observed in recent LIS publications, such as machine learning, artificial intelligence and neural networks. The topic modeling result identified prevailing data mining related research topics in LIS, such as machine learning, deep learning, big data and among others. In addition, this study investigated the trends of popular topics in LIS over time in the recent decade.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>This investigation is one of a few studies that empirically investigated the relationships between the LIS and data mining research domains. Multiple text mining techniques were employed to delineate to which extent the two research domains would be associated with each other based on both at the term-level and topic-level analysis. Methodologically, the study identified influential terms in each domain using multiple feature selection indices. In addition, Dirichlet multinomial regression was applied to explore LIS topics in relation to data mining.<\/jats:p><\/jats:sec>","DOI":"10.1108\/ajim-05-2022-0260","type":"journal-article","created":{"date-parts":[[2022,12,15]],"date-time":"2022-12-15T13:20:40Z","timestamp":1671110440000},"page":"65-85","source":"Crossref","is-referenced-by-count":5,"title":["Data mining topics in the discipline of library and information science: analysis of influential terms and\u00a0Dirichlet multinomial regression topic model"],"prefix":"10.1108","volume":"76","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6145-2068","authenticated-orcid":false,"given":"Sukjin","family":"You","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6770-5118","authenticated-orcid":false,"given":"Soohyung","family":"Joo","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4899-2427","authenticated-orcid":false,"given":"Marie","family":"Katsurai","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2022,12,19]]},"reference":[{"issue":"3","key":"key2023122213502755500_ref001","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1108\/00012531011046907","article-title":"Information literacy in the professional literature: an exploratory analysis","volume":"62","year":"2010","journal-title":"Aslib Proceedings"},{"issue":"9","key":"key2023122213502755500_ref002","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1108\/LHTN-05-2017-0035","article-title":"Defining big data and measuring its associated trends in the field of information and library management","volume":"34","year":"2017","journal-title":"Library Hi Tech News"},{"issue":"1","key":"key2023122213502755500_ref003","first-page":"49","article-title":"Big data research outputs in the library and information science: South African's contribution using bibliometric study of knowledge production","volume":"34","year":"2020","journal-title":"African Journal of Library, Archives and Information Science"},{"issue":"4","key":"key2023122213502755500_ref004","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1108\/LHTN-11-2019-0079","article-title":"Big data adoption in academic libraries: a literature review","volume":"37","year":"2020","journal-title":"Library Hi Tech News"},{"key":"key2023122213502755500_ref005","first-page":"1","article-title":"Analysis of bibliometrics research in library philosophy and practice from 1998-2021","volume":"2021","year":"2021","journal-title":"Library Philosophy and Practice"},{"year":"2018","journal-title":"Library Philosophy and Practice","article-title":"Artificial intelligence (AI) application in library systems in Iran: a\u00a0taxonomy study","key":"key2023122213502755500_ref006"},{"key":"key2023122213502755500_ref007","first-page":"993","article-title":"Latent Dirichlet\u00a0allocation","volume":"3","year":"2003","journal-title":"Journal of Machine Learning Research"},{"issue":"3","key":"key2023122213502755500_ref008","doi-asserted-by":"crossref","first-page":"1589","DOI":"10.1007\/s11192-018-2822-7","article-title":"Examining interdisciplinarity of library and information science (LIS) based on LIS articles contributed by non-LIS authors","volume":"116","year":"2018","journal-title":"Scientometrics"},{"issue":"1","key":"key2023122213502755500_ref009","first-page":"5","article-title":"Bibliometrics, scientometrics, webometrics\/cybermetrics, Informetrics and altmetrics--an emerging field in library and information science research","volume":"121","year":"2018","journal-title":"Shanlax International Journal of Education"},{"issue":"1","key":"key2023122213502755500_ref010","first-page":"197","article-title":"Public libraries and the social web: a review and analysis of the existing literature","volume":"76","year":"2020","journal-title":"Journal of Documentation"},{"issue":"1","key":"key2023122213502755500_ref011","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1016\/j.lisr.2014.09.003","article-title":"Research methods in library and information science: a content analysis","volume":"37","year":"2015","journal-title":"Library and Information Science Research"},{"issue":"4","key":"key2023122213502755500_ref012","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1016\/j.lisr.2017.11.001","article-title":"Research methods: what's in the name?","volume":"39","year":"2017","journal-title":"Library and Information Science Research"},{"issue":"3","key":"key2023122213502755500_ref013","doi-asserted-by":"crossref","first-page":"2561","DOI":"10.1007\/s11192-020-03721-0","article-title":"Evolution of research topics in LIS between 1996 and 2019: an analysis based on latent Dirichlet\u00a0allocation topic model","volume":"125","year":"2020","journal-title":"Scientometrics"},{"issue":"23","key":"key2023122213502755500_ref014","doi-asserted-by":"crossref","first-page":"7875","DOI":"10.1007\/s00500-018-3511-4","article-title":"A bibliometric analysis of text mining in medical research","volume":"22","year":"2018","journal-title":"Soft Computing"},{"issue":"2","key":"key2023122213502755500_ref015","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1007\/s11192-013-1076-7","article-title":"A co-word analysis of library and information science in China","volume":"97","year":"2013","journal-title":"Scientometrics"},{"issue":"4","key":"key2023122213502755500_ref059","first-page":"858","article-title":"Exploring the digital humanities research agenda: a text mining approach","volume":"78","year":"2022","journal-title":"Journal of Documentation"},{"key":"key2023122213502755500_ref016","article-title":"Research trends in text mining: semantic network and main path analysis of selected journals","volume":"162","year":"2020","journal-title":"Expert Systems with Applications"},{"issue":"1","key":"key2023122213502755500_ref058","first-page":"1","article-title":"Adoption of data mining methods in the discipline of library and information science","volume":"19","year":"2021","journal-title":"Journal of Library and Information Studies"},{"volume-title":"Data Science","year":"2018","key":"key2023122213502755500_ref017"},{"year":"2020","first-page":"317","article-title":"Exploring data science learning objectives in LIS education","key":"key2023122213502755500_ref018"},{"issue":"2","key":"key2023122213502755500_ref019","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1108\/EL-08-2015-0160","article-title":"Global research on information literacy: a bibliometric analysis from 2005 to 2014","volume":"35","year":"2017","journal-title":"The Electronic Library"},{"issue":"3","key":"key2023122213502755500_ref020","doi-asserted-by":"crossref","first-page":"457","DOI":"10.4275\/KSLIS.2015.49.3.457","article-title":"Weighted subject-method network analysis of library and information science studies","volume":"49","year":"2015","journal-title":"Journal of the Korean Society for Library and Information Science"},{"issue":"3","key":"key2023122213502755500_ref021","doi-asserted-by":"crossref","first-page":"1753","DOI":"10.1007\/s11192-019-03239-0","article-title":"Visual topical analysis of library and information science","volume":"121","year":"2019","journal-title":"Scientometrics"},{"issue":"3","key":"key2023122213502755500_ref022","doi-asserted-by":"crossref","first-page":"278","DOI":"10.1016\/j.acalib.2019.04.001","article-title":"Popular research topics in the recent journal publications of library and information science","volume":"45","year":"2019","journal-title":"The Journal of Academic Librarianship"},{"issue":"5","key":"key2023122213502755500_ref023","article-title":"A temporally dynamic examination of research method usage in the Chinese library and information science community","volume":"58","year":"2021","journal-title":"Information Processing and Management"},{"issue":"3","key":"key2023122213502755500_ref024","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1108\/PMM-05-2021-0026","article-title":"A review of cluster analysis techniques and their uses in library and information science research: k-means and k-medoids clustering","volume":"22","year":"2021","journal-title":"Performance Measurement and Metrics"},{"issue":"1","key":"key2023122213502755500_ref025","doi-asserted-by":"crossref","first-page":"e413","DOI":"10.1002\/pra2.413","article-title":"A cluster analysis of data mining studies in library and information science from 2006 to 2018","volume":"57","year":"2020","journal-title":"Proceedings of the Association for Information Science and Technology"},{"issue":"8","key":"key2023122213502755500_ref026","doi-asserted-by":"crossref","first-page":"1059","DOI":"10.1002\/asi.24474","article-title":"The evolution and shift of research topics and methods in library and information science","volume":"72","year":"2021","journal-title":"Journal of the Association for Information Science and Technology"},{"issue":"1","key":"key2023122213502755500_ref027","first-page":"33","article-title":"A cluster and content analysis of data mining studies in Library and\u00a0Information Science","volume":"10","year":"2021","journal-title":"Qualitative and Quantitative Methods in Libraries"},{"issue":"2","key":"key2023122213502755500_ref042","first-page":"51","article-title":"Analyzing publishing trends in information literacy literature: a bibliometric study","volume":"20","year":"2015","journal-title":"Malaysian Journal of Library and Information Science"},{"issue":"2","key":"key2023122213502755500_ref028","doi-asserted-by":"crossref","first-page":"1","DOI":"10.20309\/jdis.201609","article-title":"Information science roles in the emerging field of data science","volume":"1","year":"2016","journal-title":"Journal of Data and Information Science"},{"year":"2008","article-title":"Topic models conditioned on arbitrary features with Dirichlet-multinomial regression","key":"key2023122213502755500_ref029"},{"issue":"2","key":"key2023122213502755500_ref030","doi-asserted-by":"crossref","first-page":"53","DOI":"10.3743\/KOSIM.2011.28.2.053","article-title":"A bibliometric analysis of the literature on information literacy","volume":"28","year":"2011","journal-title":"Journal of the Korean Society for Information Management"},{"issue":"1","key":"key2023122213502755500_ref031","doi-asserted-by":"crossref","first-page":"7","DOI":"10.3743\/KOSIM.2013.30.1.007","article-title":"A study on the research trends in library and information science in Korea using topic modeling","volume":"30","year":"2013","journal-title":"Journal of the Korean Society for Information Management"},{"issue":"302","key":"key2023122213502755500_ref032","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1080\/14786440009463897","article-title":"On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling","volume":"50","year":"1900","journal-title":"Philosophical Magazine and Journal of Science"},{"issue":"3","key":"key2023122213502755500_ref033","doi-asserted-by":"crossref","first-page":"130","DOI":"10.1108\/eb046814","article-title":"An algorithm for suffix stripping","volume":"14","year":"1980","journal-title":"Program: Electronic Library and Information Systems"},{"issue":"1","key":"key2023122213502755500_ref034","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1089\/big.2013.1508","article-title":"Data science and its relationship to Big Data and data-driven decision making","volume":"1","year":"2013","journal-title":"Big Data"},{"issue":"3","key":"key2023122213502755500_ref035","doi-asserted-by":"crossref","first-page":"1563","DOI":"10.1007\/s11192-020-03371-2","article-title":"The evolution of data science and big data research: a\u00a0bibliometric analysis","volume":"122","year":"2020","journal-title":"Scientometrics"},{"year":"2003","first-page":"616","article-title":"Tackling the poor assumptions of naive bayes text classifiers","key":"key2023122213502755500_ref036"},{"year":"2015","first-page":"399","article-title":"Exploring the space of topic coherence measures","key":"key2023122213502755500_ref037"},{"year":"2004","first-page":"487","article-title":"The author-topic model for authors and documents","key":"key2023122213502755500_ref038"},{"issue":"3","key":"key2023122213502755500_ref039","first-page":"14","article-title":"Data mining is a perpetual concept for library and information science: an estimated overview","volume":"5","year":"2015","journal-title":"International Journal of Digital Library Services"},{"key":"key2023122213502755500_ref040","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1162\/tacl_a_00099","article-title":"Comparing apples to apple: the effects of stemmers on topic models","volume":"4","year":"2016","journal-title":"Transactions of the Association for Computational Linguistics"},{"volume-title":"Introduction to Information Retrieval","year":"2008","key":"key2023122213502755500_ref041"},{"issue":"3","key":"key2023122213502755500_ref043","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1002\/j.1538-7305.1948.tb01338.x","article-title":"A mathematical theory of communication","volume":"27","year":"1948","journal-title":"The Bell System Technical Journal"},{"issue":"10","key":"key2023122213502755500_ref044","first-page":"51","article-title":"Topic analysis of LIS big data research with overlay mapping","volume":"5","year":"2021","journal-title":"Data Analysis and Knowledge Discovery"},{"issue":"1","key":"key2023122213502755500_ref045","first-page":"21","article-title":"Teaching tweeting: recommendations for teaching social media work in LIS and MSIS Programs","volume":"57","year":"2016","journal-title":"Journal of Education for Library and Information Science"},{"issue":"4","key":"key2023122213502755500_ref046","doi-asserted-by":"crossref","first-page":"676","DOI":"10.1108\/OIR-07-2018-0217","article-title":"Natural language processing applications in library and information science","volume":"43","year":"2019","journal-title":"Online Information Review"},{"issue":"2","key":"key2023122213502755500_ref047","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1177\/0961000618793977","article-title":"A data-driven analysis of the knowledge structure of library science with full-text journal articles","volume":"52","year":"2020","journal-title":"Journal of Librarianship and Information Science"},{"doi-asserted-by":"crossref","unstructured":"Togia, A. and Malliari, A. (2017), \u201cResearch method in library and information science, qualitative versus quantitative research\u201d, in Oflazoglu, S. (Ed.), Qualitative versus Quantitative Research, InTech, Rijeka, pp.\u00a043-64.","key":"key2023122213502755500_ref048","DOI":"10.5772\/intechopen.68749"},{"issue":"1","key":"key2023122213502755500_ref049","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/j.lisr.2018.03.002","article-title":"Account of methodologies and methods applied in LIS research: a\u00a0systematic review","volume":"40","year":"2018","journal-title":"Library and Information Science Research"},{"year":"2022","journal-title":"Journal of the Association for Information Science and Technology","article-title":"Evolution of data science and its education in iSchools: an\u00a0impressionistic study using curriculum analysis","key":"key2023122213502755500_ref050"},{"issue":"4","key":"key2023122213502755500_ref051","doi-asserted-by":"crossref","first-page":"422","DOI":"10.1108\/DTA-05-2019-0076","article-title":"Data science from a library and information science perspective","volume":"53","year":"2019","journal-title":"Data Technologies and Applications"},{"issue":"5","key":"key2023122213502755500_ref052","doi-asserted-by":"crossref","first-page":"643","DOI":"10.1108\/DTA-07-2020-0167","article-title":"Data science and its relationship to library and information science: a content analysis","volume":"54","year":"2020","journal-title":"Data Technologies and Applications"},{"issue":"6","key":"key2023122213502755500_ref053","doi-asserted-by":"crossref","first-page":"1243","DOI":"10.1108\/JD-02-2018-0036","article-title":"Twinning data science with information science in schools of library and information science","volume":"74","year":"2018","journal-title":"Journal of Documentation"},{"issue":"6","key":"key2023122213502755500_ref056","doi-asserted-by":"crossref","first-page":"1070","DOI":"10.1108\/EL-02-2016-0042","article-title":"Investigation on the statistical methods in research studies of library and information science","volume":"35","year":"2017","journal-title":"The Electronic Library"},{"doi-asserted-by":"crossref","unstructured":"Zhang, J. and Zhao, Y. (2014), \u201cVisual data mining in a Q&A based social media website\u201d, in Chen, C. and Larsen, R. (Eds), Library and Information Sciences, Springer, Berlin, Heidelberg, pp.\u00a041-55.","key":"key2023122213502755500_ref054","DOI":"10.1007\/978-3-642-54812-3_5"},{"issue":"3","key":"key2023122213502755500_ref055","doi-asserted-by":"crossref","first-page":"416","DOI":"10.1108\/OIR-07-2015-0247","article-title":"A study on statistical methods used in six journals of library and information science","volume":"40","year":"2016","journal-title":"Online Information Review"}],"container-title":["Aslib Journal of Information Management"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/AJIM-05-2022-0260\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/AJIM-05-2022-0260\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T23:00:48Z","timestamp":1753398048000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/ajim\/article\/76\/1\/65-85\/1217072"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,19]]},"references-count":58,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12,19]]},"published-print":{"date-parts":[[2024,1,2]]}},"alternative-id":["10.1108\/AJIM-05-2022-0260"],"URL":"https:\/\/doi.org\/10.1108\/ajim-05-2022-0260","relation":{},"ISSN":["2050-3806"],"issn-type":[{"type":"print","value":"2050-3806"}],"subject":[],"published":{"date-parts":[[2022,12,19]]}}}