{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T00:42:00Z","timestamp":1775608920314,"version":"3.50.1"},"reference-count":56,"publisher":"Emerald","issue":"3","license":[{"start":{"date-parts":[[2020,12,8]],"date-time":"2020-12-08T00:00:00Z","timestamp":1607385600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.emerald.com\/insight\/site-policies"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JD"],"published-print":{"date-parts":[[2021,4,8]]},"abstract":"<jats:sec><jats:title content-type=\"abstract-subheading\">Purpose<\/jats:title><jats:p>The purpose of this study is to develop a model for automated classification of old digitised texts to the Universal Decimal Classification (UDC), using machine-learning methods.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Design\/methodology\/approach<\/jats:title><jats:p>The general research approach is inherent to design science research, in which the problem of UDC assignment of the old, digitised texts is addressed by developing a machine-learning classification model. A corpus of 70,000 scholarly texts, fully bibliographically processed by librarians, was used to train and test the model, which was used for classification of old texts on a corpus of 200,000 items. Human experts evaluated the performance of the model.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Findings<\/jats:title><jats:p>Results suggest that machine-learning models can correctly assign the UDC at some level for almost any scholarly text. Furthermore, the model can be recommended for the UDC assignment of older texts. Ten librarians corroborated this on 150 randomly selected texts.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Research limitations\/implications<\/jats:title><jats:p>The main limitations of this study were unavailability of labelled older texts and the limited availability of librarians.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Practical implications<\/jats:title><jats:p>The classification model can provide a recommendation to the librarians during their classification work; furthermore, it can be implemented as an add-on to full-text search in the library databases.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Social implications<\/jats:title><jats:p>The proposed methodology supports librarians by recommending UDC classifiers, thus saving time in their daily work. By automatically classifying older texts, digital libraries can provide a better user experience by enabling structured searches. These contribute to making knowledge more widely available and useable.<\/jats:p><\/jats:sec><jats:sec><jats:title content-type=\"abstract-subheading\">Originality\/value<\/jats:title><jats:p>These findings contribute to the field of automated classification of bibliographical information with the usage of full texts, especially in cases in which the texts are old, unstructured and in which archaic language and vocabulary are used.<\/jats:p><\/jats:sec>","DOI":"10.1108\/jd-06-2020-0092","type":"journal-article","created":{"date-parts":[[2020,12,8]],"date-time":"2020-12-08T03:46:47Z","timestamp":1607399207000},"page":"755-776","source":"Crossref","is-referenced-by-count":16,"title":["Automatic classification of older electronic texts into the Universal Decimal Classification\u2013UDC"],"prefix":"10.1108","volume":"77","author":[{"given":"Matja\u017e","family":"Kragelj","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4608-9090","authenticated-orcid":false,"given":"Mirjana","family":"Kljaji\u0107 Bor\u0161tnar","sequence":"additional","affiliation":[]}],"member":"140","published-online":{"date-parts":[[2020,12,8]]},"reference":[{"key":"key2022092814221241100_ref001","doi-asserted-by":"publisher","DOI":"10.1016\/j.measurement.2018.01.022","article-title":"A machine learning model for improving healthcare services on cloud computing environment","volume":"119","year":"2018","journal-title":"Measurement: Journal of the International Measurement Confederation"},{"key":"key2022092814221241100_ref002","first-page":"149","article-title":"Homogeneous multi-classifier system for moving vehicles noise classification based on multilayer perceptron","volume":"29","year":"2015"},{"key":"key2022092814221241100_ref003","doi-asserted-by":"crossref","unstructured":"Aggarwal, C.C. and Zhai, C. (2012), \u201cA survey of text clustering algorithms\u201d, in Mining Text Data, Springer US. doi: 10.1007\/978-1-4614-3223-4_4.","DOI":"10.1007\/978-1-4614-3223-4"},{"key":"key2022092814221241100_ref004","doi-asserted-by":"publisher","first-page":"541","DOI":"10.1186\/s12859-018-2496-4","article-title":"Fast and scalable neural embedding models for biomedical sentence classification","volume":"19","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"key2022092814221241100_ref005","doi-asserted-by":"publisher","first-page":"1129","DOI":"10.1016\/j.ipm.2018.08.001","article-title":"Semantic text classification: a survey of past and recent advances","volume-title":"Information Processing and Management","year":"2018"},{"key":"key2022092814221241100_ref006","first-page":"658","article-title":"Automatic news articles classification in Indonesian language by using Naive Bayes Classifier method","year":"2009"},{"issue":"1","key":"key2022092814221241100_ref007","first-page":"4","article-title":"A review of machine learning algorithms for text-documents classification","volume":"1","year":"2010","journal-title":"Journal of Advances in Information Technology"},{"key":"key2022092814221241100_ref008","doi-asserted-by":"publisher","first-page":"9324","DOI":"10.1109\/ACCESS.2018.2890388","article-title":"Scientific paper Recommendation: a survey","volume":"7","year":"2019","journal-title":"IEEE Access"},{"key":"key2022092814221241100_ref009","article-title":"Mr. DLib: recommendations-as-a-service (RaaS) for academia","year":"2017"},{"key":"key2022092814221241100_ref010","doi-asserted-by":"publisher","volume-title":"New Review of Hypermedia and Multimedia an efficient scheme for automatic web pages categorisation using the support vector machine","year":"2016","DOI":"10.1080\/13614568.2016.1152316"},{"key":"key2022092814221241100_ref011","doi-asserted-by":"crossref","first-page":"614","DOI":"10.1016\/j.future.2018.04.054","article-title":"Classification of compressed and uncompressed text documents","volume":"88","year":"2018","journal-title":"Future Generation Computer Systems"},{"key":"key2022092814221241100_ref012","volume-title":"Natural Language Processing with Python","year":"2009"},{"key":"key2022092814221241100_ref013","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1016\/j.csl.2018.12.001","article-title":"Automatic classification of speech overlaps: feature representation and algorithms","volume":"55","year":"2019","journal-title":"Computer Speech and Language"},{"key":"key2022092814221241100_ref014","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1007\/978-0-387-34747-9_18","article-title":"Comparison of SVM and some older classification algorithms in text classification tasks","volume":"217","year":"2006","journal-title":"IFIP International Federation for Information Processing"},{"issue":"4","key":"key2022092814221241100_ref015","doi-asserted-by":"crossref","first-page":"1037","DOI":"10.1016\/j.joi.2016.07.009","article-title":"Clustering citation histories in the physical review","volume":"10","year":"2016","journal-title":"Journal of Informetrics"},{"issue":"4","key":"key2022092814221241100_ref016","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1177\/0340035211430195","article-title":"Udc on the internet: theory and project in evolution for use of indexing and retrieval systems","volume":"37","year":"2011","journal-title":"IFLA Journal"},{"key":"key2022092814221241100_ref017","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1145\/1015330.1015415","article-title":"Links between perceptrons, MLPs and SVMs","volume-title":"Proceedings of the Twenty-first International Conference on Machine Learning (ICML \u201904)","year":"2004"},{"key":"key2022092814221241100_ref018","first-page":"273","article-title":"Support-vector networks","volume":"297","year":"1995"},{"issue":"1","key":"key2022092814221241100_ref019","first-page":"48","article-title":"A nineteenth-century Cameo: Melvil Dewey in 1890","volume":"13","year":"1978","journal-title":"The Journal of Library History"},{"key":"key2022092814221241100_ref020","first-page":"195","article-title":"Automatic text classification algorithm based on Gauss improved convolutional neural network","volume-title":"Journal of Computational Science","year":"2017"},{"issue":"9\u201310","key":"key2022092814221241100_ref021","first-page":"2013","article-title":"Bringing order to digital libraries: from keyphrase extraction to index term assignment","volume":"19","year":"2013","journal-title":"D-lib Magazine"},{"key":"key2022092814221241100_ref022","first-page":"529","article-title":"Automatic free-text-tagging of online news archives","volume":"215","year":"2010","journal-title":"Frontiers in Artificial Intelligence and Applications"},{"key":"key2022092814221241100_ref023","doi-asserted-by":"crossref","unstructured":"Healthy, C. and Survey, K. (2014), \u201cPredicting methamphetamine use of homeless youths attending high school: comparison of decision rules and logistic regression classi fi cation algorithms\u201d, Vol. 5 No. 2, doi: 10.1086\/676830.","DOI":"10.1086\/676830"},{"issue":"1","key":"key2022092814221241100_ref024","doi-asserted-by":"crossref","first-page":"75","DOI":"10.2307\/25148625","article-title":"Design science IN information systems research","volume":"28","year":"2004","journal-title":"MIS Quarterly"},{"key":"key2022092814221241100_ref025","first-page":"966","article-title":"Text classification using machine learning techniques","volume":"4","year":"2005","journal-title":"WSEAS Transactions on Computers"},{"issue":"8","key":"key2022092814221241100_ref026","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1016\/j.patrec.2009.09.011","article-title":"Data clustering: 50 years beyond K-means","volume":"31","year":"2010","journal-title":"Pattern Recognition Letters"},{"issue":"7","key":"key2022092814221241100_ref027","doi-asserted-by":"crossref","first-page":"42","DOI":"10.9781\/ijimai.2016.376","article-title":"Comparative study of clustering algorithms in text mining context","volume":"3","year":"2016","journal-title":"International Journal of Interactive Multimedia and Artificial Intelligence"},{"issue":"May 2018","key":"key2022092814221241100_ref028","first-page":"1","article-title":"Towards a big data framework for analysing social media content","volume":"44","year":"2019","journal-title":"International Journal of Information Management"},{"issue":"8","key":"key2022092814221241100_ref029","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0071246","article-title":"Hidden Markov models: the best models for forager movements?","volume":"8","year":"2013","journal-title":"PLoS ONE"},{"key":"key2022092814221241100_ref030","doi-asserted-by":"crossref","unstructured":"Karras, D.A. and Mertzios, B.G. (2002), in McKay, B. and Slaney, J. (Eds), A Robust Meaning Extraction Methodology Using Supervised Neural Networks BT - AI 2002: Advances in Artificial Intelligence, Springer Berlin Heidelberg, Berlin, Heidelberg, pp. 498-510.","DOI":"10.1007\/3-540-36187-1_44"},{"issue":"2","key":"key2022092814221241100_ref031","first-page":"51","article-title":"Text mining-scope and applications","volume":"5","year":"2013"},{"issue":"3\/4","key":"key2022092814221241100_ref032","first-page":"52","article-title":"Melvil Dewey, compulsive innovator","volume":"45","year":"2014","journal-title":"American Libraries"},{"issue":"6","key":"key2022092814221241100_ref033","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1016\/j.bushor.2016.06.001","article-title":"Managerial work in the realm of the digital universe: the role of the data triad","volume":"59","year":"2016","journal-title":"Business Horizons"},{"issue":"5","key":"key2022092814221241100_ref034","doi-asserted-by":"crossref","first-page":"976","DOI":"10.1108\/JD-07-2014-0103","article-title":"Augmenting Dublin core digital library metadata with Dewey decimal classification","volume":"71","year":"2015","journal-title":"Journal of Documentation"},{"issue":"4","key":"key2022092814221241100_ref035","doi-asserted-by":"crossref","first-page":"317","DOI":"10.1080\/08839519308949993","article-title":"Inductive and bayesian learning in medical diagnosis 1 introduction","volume":"7","year":"1993","journal-title":"Applied Artificial Intelligence"},{"issue":"1","key":"key2022092814221241100_ref200","doi-asserted-by":"publisher","DOI":"10.1504\/jdr.2008.019897","article-title":"The emergence of design research in information systems in North America","volume":"7","year":"2008","journal-title":"Journal of Design Research"},{"key":"key2022092814221241100_ref036","first-page":"369","volume-title":"A Library's Information Retrieval System (In)effectiveness: Case Study","year":"2015"},{"issue":"6","key":"key2022092814221241100_ref037","doi-asserted-by":"crossref","first-page":"1343","DOI":"10.1108\/JD-02-2017-0025","article-title":"The relationship between classification research and information retrieval research , 1952 to 1970","volume":"73","year":"2017","journal-title":"Journal of Documentation"},{"issue":"1","key":"key2022092814221241100_ref038","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1007\/s13042-012-0068-x","article-title":"Comparative study on classification performance between support vector machine and logistic regression","volume":"4","year":"2013","journal-title":"International Journal of Machine Learning and Cybernetics"},{"key":"key2022092814221241100_ref039","doi-asserted-by":"crossref","unstructured":"Na, J., Indra, D. and Santony, J. (2019), \u201cAn artificial neural network approach for detecting skin cancer\u201d, Vol. 17 No. 2, doi: 10.12928\/TELKOMNIKA.v17i2.9547.","DOI":"10.12928\/telkomnika.v17i2.9547"},{"key":"key2022092814221241100_ref040","first-page":"841","article-title":"Regression and naive Bayes","volume":"14","year":"2001","journal-title":"Advances in Neural Information Processing Systems"},{"issue":"October","key":"key2022092814221241100_ref041","first-page":"412","article-title":"Crowdsourcing for botanical data collection towards to automatic plant identification: a review","volume":"155","year":"2018","journal-title":"Computers and Electronics in Agriculture"},{"issue":"3","key":"key2022092814221241100_ref042","doi-asserted-by":"crossref","first-page":"22","DOI":"10.6017\/ital.v34i3.5889","article-title":"Evaluation of semi-automatic metadata generation tools: a survey of the current state of the art","volume":"34","year":"2015","journal-title":"Information Technology and Libraries"},{"issue":"10","key":"key2022092814221241100_ref043","doi-asserted-by":"crossref","first-page":"12520","DOI":"10.1016\/j.eswa.2009.04.038","article-title":"Expert Systems with Applications A multi-disciplinar recommender system to advice research resources in University Digital Libraries","volume":"36","year":"2009","journal-title":"Expert Systems with Applications"},{"key":"key2022092814221241100_ref044","first-page":"1","volume-title":"Document Classification for Newspaper Articles","year":"2009"},{"issue":"3","key":"key2022092814221241100_ref045","first-page":"436","article-title":"A survey on classification techniques in internet environment","volume":"2","year":"2016","journal-title":"International Conference on Advanced Computing and Communication Systems (ICACCS)"},{"key":"key2022092814221241100_ref046","first-page":"7","article-title":"Research of neural networks application efficiency in automatic scientific articles classification according to UDC","year":"2016"},{"key":"key2022092814221241100_ref047","first-page":"84","article-title":"Need to categorize: A comparative look at the categories of universal decimal classification system and Wikipedia","volume-title":"Leonardo","year":"2012"},{"key":"key2022092814221241100_ref048","doi-asserted-by":"publisher","first-page":"125","DOI":"10.3897\/zookeys.480.8803","article-title":"Crowdsourcing the identification of organisms: a case-study of iSpot","volume":"480","year":"2015","journal-title":"ZooKeys"},{"issue":"2","key":"key2022092814221241100_ref049","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1108\/00220410810858029","article-title":"Use of the universal decimal classification: a worldwide survey","volume":"64","year":"2008","journal-title":"Journal of Documentation"},{"issue":"4","key":"key2022092814221241100_ref051","doi-asserted-by":"publisher","DOI":"10.20855\/ijav.2015.20.4387","article-title":"Ball bearing fault diagnosis using supervised and unsupervised machine learning methods","volume":"20","year":"2015","journal-title":"The International Journal of Acoustics and Vibration"},{"issue":"1","key":"key2022092814221241100_ref052","first-page":"1","article-title":"A hybrid approach to assignment of library of congress subject headings 1 introduction 2 related work","volume":"4","year":"2018"},{"key":"key2022092814221241100_ref053","article-title":"Text classification using a hidden Markov model","year":"2005"},{"issue":"4","key":"key2022092814221241100_ref054","first-page":"78","article-title":"Automated text classification using library classification schemes: trends, issues, and challenges","volume":"36","year":"2007","journal-title":"International Cataloguing and Bibliographic Control"},{"issue":"3","key":"key2022092814221241100_ref055","first-page":"177","article-title":"Support vector machines (SVMs) versus multilayer perception (MLP) in data classification","volume":"13","year":"2012","journal-title":"Egyptian Informatics Journal, Ministry of Higher Education and Scientific Research"},{"key":"key2022092814221241100_ref056","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1016\/j.techfore.2016.01.015","article-title":"Topic analysis and forecasting for science, technology and innovation: methodology with a case study focusing on big data research","volume":"105","year":"2016","journal-title":"Technological Forecasting and Social Change"}],"container-title":["Journal of Documentation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-06-2020-0092\/full\/xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.emerald.com\/insight\/content\/doi\/10.1108\/JD-06-2020-0092\/full\/html","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,24]],"date-time":"2025-07-24T22:34:18Z","timestamp":1753396458000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.emerald.com\/jd\/article\/77\/3\/755-776\/195792"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12,8]]},"references-count":56,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2020,12,8]]},"published-print":{"date-parts":[[2021,4,8]]}},"alternative-id":["10.1108\/JD-06-2020-0092"],"URL":"https:\/\/doi.org\/10.1108\/jd-06-2020-0092","relation":{},"ISSN":["0022-0418"],"issn-type":[{"value":"0022-0418","type":"print"}],"subject":[],"published":{"date-parts":[[2020,12,8]]}}}