{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,13]],"date-time":"2026-06-13T04:10:20Z","timestamp":1781323820525,"version":"3.54.1"},"reference-count":17,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2015,11,16]],"date-time":"2015-11-16T00:00:00Z","timestamp":1447632000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Informatics"],"abstract":"<jats:p>Through recognizing the importance of a qualified workforce, skills research has become one of the focal points in economics, sociology, and education. Great effort is dedicated to analyzing labor demand and supply, and actions are taken at many levels to match one with the other. In this work we concentrate on skills needs, a dynamic variable dependent on many aspects such as geography, time, or the type of industry. Historically, skills in demand were easy to evaluate since transitions in that area were fairly slow, gradual, and easy to adjust to. In contrast, current changes are occurring rapidly and might take an unexpected turn. Therefore, we introduce a relatively simple yet effective method of monitoring skills needs straight from the source\u2014as expressed by potential employers in their job advertisements. We employ open source tools such as RapidMiner and R as well as easily accessible online vacancy data. We demonstrate selected techniques, namely classification with k-NN and information extraction from a textual dataset, to determine effective ways of discovering knowledge from a given collection of vacancies.<\/jats:p>","DOI":"10.3390\/informatics2040031","type":"journal-article","created":{"date-parts":[[2015,11,16]],"date-time":"2015-11-16T10:17:10Z","timestamp":1447669030000},"page":"31-49","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":45,"title":["Skills and Vacancy Analysis with Data Mining Techniques"],"prefix":"10.3390","volume":"2","author":[{"given":"Izabela","family":"Wowczko","sequence":"first","affiliation":[{"name":"Institute of Technology Blanchardstown, Blanchardstown Rd North, Dublin 15, Ireland"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2015,11,16]]},"reference":[{"key":"ref_1","unstructured":"The UK Commission for Employment and Skills (2014). The Labour Market Story: Skills For the Future."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Handel, M. Trends in Job Skill Demands in OECD Countries. Available online: http:\/\/dx.doi.org\/10.1787\/5k8zk8pcq6td-en.","DOI":"10.1787\/5k8zk8pcq6td-en"},{"key":"ref_3","unstructured":"Cedefop (2013). User Guide to Developing an Employer Survey on Skill Needs, Publications Office of the European Union."},{"key":"ref_4","unstructured":"Manacorda, M., and Manning, A. (1999). Just Can\u2019t Get Enough: More on Skill-Biassed Change and Labour Market Performance, London School of Economics and Political Science."},{"key":"ref_5","unstructured":"EGFSN (2007). Tomorrow\u2019s Skills. Towards a National Skills Strategy, Expert Group on Future Skills Needs."},{"key":"ref_6","unstructured":"UNESCO (2012). International Standard Classification of Education ISCED 2011, UNESCO Institute for Statistics."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1109\/MS.2009.150","article-title":"Mining for Computing Jobs","volume":"27","author":"Litecky","year":"2010","journal-title":"IEEE Softw."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1109\/MITP.2012.7","article-title":"Evaluating the Demand for Soft Skills in Software Development","volume":"14","author":"Ahmed","year":"2012","journal-title":"IEEE IT Prof."},{"key":"ref_9","unstructured":"Kurekova, L., Haita, C., and Beblavy, M. (2012). Qualifications or Soft Skills? Studying Demand for Low-Skilled from Job Advertisements, NEUJOBS. NEUJOBS Working Paper No. 4.3.3."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1528","DOI":"10.1016\/j.proeng.2012.01.167","article-title":"Job Opportunity Finding by Text Classification","volume":"29","author":"Zhang","year":"2012","journal-title":"Procedia Eng."},{"key":"ref_11","unstructured":"Jiang, W., Huang, L., Liu, O., and Lu, Y. (2008, January 15\u201320). A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging. Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics, Columbus, OH, USA."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Weiss, S.M., Indurkhya, N., and Zhang, T. (2010). Texts in Computer Science. Fundamentals of Predictive Text Mining, Springer.","DOI":"10.1007\/978-1-84996-226-1"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1007\/s12599-014-0344-2","article-title":"Comparing Business Intelligence and Big Data Skills\u2014A Text Mining Study Using Job Advertisements","volume":"6","author":"Debortoli","year":"2014","journal-title":"Bus. Inf. Syst. Eng."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1080\/01638539809545028","article-title":"Introduction to Latent Semantic Analysis","volume":"25","author":"Landauer","year":"1998","journal-title":"Discourse Process."},{"key":"ref_15","unstructured":"Albright, R. Taming Text with the SVD. Available online: ftp:\/\/ftp.dataflux.com\/techsup\/download\/EMiner\/TamingTextwiththeSVD.pdf."},{"key":"ref_16","unstructured":"Cedefop (2012). Methodological Framework, Publications Office of the European Union. Research Paper No. 25."},{"key":"ref_17","unstructured":"McNaboe, J., Cordon, N., Milicevic, I., Hogan, A., and Wowczko, I. Vacancy Overview 2014. Available online: http:\/\/www.solas.ie\/docs\/VacancyOverviewReport2015.pdf."}],"container-title":["Informatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2227-9709\/2\/4\/31\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T20:52:10Z","timestamp":1760215930000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2227-9709\/2\/4\/31"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,11,16]]},"references-count":17,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2015,12]]}},"alternative-id":["informatics2040031"],"URL":"https:\/\/doi.org\/10.3390\/informatics2040031","relation":{},"ISSN":["2227-9709"],"issn-type":[{"value":"2227-9709","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,11,16]]}}}