{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T00:27:29Z","timestamp":1777854449596,"version":"3.51.4"},"reference-count":23,"publisher":"SAGE Publications","issue":"4","license":[{"start":{"date-parts":[[2011,7,15]],"date-time":"2011-07-15T00:00:00Z","timestamp":1310688000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Journal of Information Science"],"published-print":{"date-parts":[[2011,8]]},"abstract":"<jats:p>This paper proposes strategies for feature selection of digital news articles that allow an effective implementation of learning algorithms for the unsupervised classification of news articles. With the appropriate selection of a small subset of features a correct identification of related news can be achieved, thus enabling organizations and individual users to keep track of current events. The paper defines a quality measure of the discriminatory power of each feature and verifies that the selection of a feature subset with higher quality values allows obtaining good classification results. A Particle Swarm Optimization (PSO) based selection method is also proposed. Both proposals are validated on two collections of press clippings collated from news search services in digital media. Experimental results reveal that good classification accuracy can be achieved with small subsets of between 3 per cent and 6 per cent of the features.<\/jats:p>","DOI":"10.1177\/0165551511412028","type":"journal-article","created":{"date-parts":[[2011,7,16]],"date-time":"2011-07-16T03:25:34Z","timestamp":1310786734000},"page":"418-428","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":4,"title":["Feature selection strategies for automated classification of digital media content"],"prefix":"10.1177","volume":"37","author":[{"given":"Roc\u00edo","family":"Rocha","sequence":"first","affiliation":[{"name":"Department of Business Administration, University of Cantabria, Spain,"}]},{"given":"\u00c1ngel","family":"Cobo","sequence":"additional","affiliation":[{"name":"Department of Applied Mathematics and Computational Sciences, University of Cantabria, Spain"}]}],"member":"179","published-online":{"date-parts":[[2011,7,15]]},"reference":[{"key":"atypb1","volume-title":"Text mining. Predictive methods for analyzing unstructured information","author":"Weiss SM","year":"2005"},{"key":"atypb2","volume-title":"A platform for multilingual news summarization. Computer Science Technical Report","author":"Evans D.","year":"2003"},{"key":"atypb3","volume-title":"Introduction to modern information retrieval","author":"Salton G.","year":"1986"},{"key":"atypb4","volume-title":"Modern information retrieval","author":"Baeza R.","year":"1999"},{"key":"atypb5","first-page":"1","author":"Lee S.","year":"2010","journal-title":"Journal of Computer Information Systems"},{"key":"atypb6","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90021-0"},{"key":"atypb7","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4573(01)00051-6"},{"key":"atypb8","volume-title":"Proceedings of the 42nd annual meeting of Association for Computational Linguistics ACL","author":"Gao J."},{"key":"atypb9","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2010.12.160"},{"key":"atypb10","first-page":"1","author":"Mathieu B.","year":"2004","journal-title":"Avignon"},{"key":"atypb11","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2008.08.022"},{"key":"atypb12","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2010.12.156"},{"key":"atypb13","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-4305-0_4"},{"key":"atypb14","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(97)00043-X"},{"key":"atypb15","first-page":"1289","volume":"3","author":"Forman G.","year":"2003","journal-title":"Journal of Machine Learning"},{"key":"atypb16","first-page":"69","volume":"7","author":"Zahran BM","year":"2009","journal-title":"World Applied Science Journal"},{"key":"atypb17","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-36456-0_68"},{"key":"atypb18","volume-title":"Proceedings of workshop on mathematical formal methods in information retrieval. 25th ACM SIGIR Conference","author":"Friburger N."},{"key":"atypb19","doi-asserted-by":"publisher","DOI":"10.2498\/cit.2005.04.01"},{"key":"atypb20","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-4305-0_5"},{"key":"atypb21","doi-asserted-by":"publisher","DOI":"10.1002\/j.1538-7305.1948.tb01338.x"},{"key":"atypb22","volume-title":"Information Retrieval","author":"van Rijsbergen CJ","year":"1979","edition":"2"},{"key":"atypb23","volume-title":"Proceedings of the IEEE international conference on neural networks - ICNN Perth","author":"Kennedy J."}],"container-title":["Journal of Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551511412028","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/0165551511412028","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T23:08:06Z","timestamp":1777504086000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/0165551511412028"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,7,15]]},"references-count":23,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2011,8]]}},"alternative-id":["10.1177\/0165551511412028"],"URL":"https:\/\/doi.org\/10.1177\/0165551511412028","relation":{},"ISSN":["0165-5515","1741-6485"],"issn-type":[{"value":"0165-5515","type":"print"},{"value":"1741-6485","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,7,15]]}}}