{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,27]],"date-time":"2025-03-27T04:31:27Z","timestamp":1743049887663,"version":"3.40.3"},"publisher-location":"Cham","reference-count":26,"publisher":"Springer International Publishing","isbn-type":[{"type":"print","value":"9783030615260"},{"type":"electronic","value":"9783030615277"}],"license":[{"start":{"date-parts":[[2020,1,1]],"date-time":"2020-01-01T00:00:00Z","timestamp":1577836800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,10,15]],"date-time":"2020-10-15T00:00:00Z","timestamp":1602720000000},"content-version":"vor","delay-in-days":288,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>With the pervasiveness of data mining (DM) in many areas of our society, the management of digital data, readily available for analysis, has become increasingly important. Consequently, nearly all community accepted guidelines and principles (e.g. FAIR and TRUST) for publishing such data in the digital ecosystem, stress the importance of semantic data enhancement. Having rich semantic annotation of DM datasets would support the data mining process at various choice points, such as data understanding, automatic identification of the analysis task, and reasoning over the obtained results. In this paper, we report on the developments of an ontology-based annotation schema for semantic description of DM datasets. The annotation schema combines three different aspects of semantic annotation, i.e., annotation of provenance, data mining specific, and domain-specific information. We demonstrate the utility of these annotations in two use cases: semantic annotation of remote sensing data and data about neurodegenerative diseases.<\/jats:p>","DOI":"10.1007\/978-3-030-61527-7_10","type":"book-chapter","created":{"date-parts":[[2020,10,14]],"date-time":"2020-10-14T18:04:39Z","timestamp":1602698679000},"page":"140-155","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Semantic Description of Data Mining Datasets: An Ontology-Based Annotation Schema"],"prefix":"10.1007","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5983-7169","authenticated-orcid":false,"given":"Ana","family":"Kostovska","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2363-712X","authenticated-orcid":false,"given":"Sa\u0161o","family":"D\u017eeroski","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7685-9140","authenticated-orcid":false,"given":"Pan\u010de","family":"Panov","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,10,15]]},"reference":[{"key":"10_CR1","unstructured":"The Data Catalog vocabulary (DCAT) vocabulary (2019). https:\/\/www.w3.org\/TR\/vocab-dcat\/"},{"key":"10_CR2","unstructured":"The PROV Ontology (PROV-O) (2019). https:\/\/www.w3.org\/TR\/prov-o\/"},{"key":"10_CR3","unstructured":"The Schema.org vocabulary (2019). https:\/\/schema.org\/"},{"key":"10_CR4","unstructured":"PPMI website (2020). http:\/\/www.ppmi-info.org\/publications-presentations\/"},{"key":"10_CR5","unstructured":"Chapman, P., et al.: Crisp-DM 1.0 step-by-step data mining guide. Technical report, The CRISP-DM consortium, August 2000"},{"key":"10_CR6","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1016\/j.websem.2012.05.003","volume":"17","author":"M Compton","year":"2012","unstructured":"Compton, M., et al.: The SSN ontology of the W3C semantic sensor network incubator group. Web Semant. Sci. Serv. Agents World Wide Web 17, 25\u201332 (2012)","journal-title":"Web Semant. Sci. Serv. Agents World Wide Web"},{"key":"10_CR7","unstructured":"Esteves, D., Lawrynowicz, A., Panov, P., Soldatova, L., Soru, T., Vanschoren, J.: Ml schema core specification. W3C (2016). http:\/\/www.w3.org\/2016\/10\/mls"},{"key":"10_CR8","doi-asserted-by":"crossref","unstructured":"Esteves, D., et al.: Mex vocabulary: a lightweight interchange format for machine learning experiments. In: Proceedings of the 11th International Conference on Semantic Systems, pp. 169\u2013176 (2015)","DOI":"10.1145\/2814864.2814883"},{"issue":"5\u20136","key":"10_CR9","doi-asserted-by":"publisher","first-page":"907","DOI":"10.1006\/ijhc.1995.1081","volume":"43","author":"T Gruber","year":"1995","unstructured":"Gruber, T.: Toward principles for the design of ontologies used for knowledge sharing? Int. J. Hum. Comput. Stud. 43(5\u20136), 907\u2013928 (1995)","journal-title":"Int. J. Hum. Comput. Stud."},{"key":"10_CR10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.websem.2018.06.003","volume":"56","author":"K Janowicz","year":"2019","unstructured":"Janowicz, K., Haller, A., Cox, S., Le Phuoc, D., Lefran\u00e7ois, M.: SOSA: a lightweight ontology for sensors, observations, samples, and actuators. J. Web Semant. 56, 1\u201310 (2019)","journal-title":"J. Web Semant."},{"key":"10_CR11","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1016\/j.websem.2015.01.001","volume":"32","author":"M Keet","year":"2015","unstructured":"Keet, M., et al.: The data mining optimization ontology. Web Semant. Sci. Serv. Agents World Wide Web 32, 43\u201353 (2015)","journal-title":"Web Semant. Sci. Serv. Agents World Wide Web"},{"key":"10_CR12","series-title":"Lecture Notes in Computer Science (Lecture Notes in Artificial Intelligence)","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1007\/978-3-030-33778-0_19","volume-title":"Discovery Science","author":"A Kostovska","year":"2019","unstructured":"Kostovska, A., Tolovski, I., Maikore, F., Soldatova, L., Panov, P.: Neurodegenerative disease data ontology. In: Kralj Novak, P., \u0160muc, T., D\u017eeroski, S. (eds.) DS 2019. LNCS (LNAI), vol. 11828, pp. 235\u2013245. Springer, Cham (2019). https:\/\/doi.org\/10.1007\/978-3-030-33778-0_19"},{"issue":"1","key":"10_CR13","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41597-020-0486-7","volume":"7","author":"D Lin","year":"2020","unstructured":"Lin, D., et al.: The trust principles for digital repositories. Sci. Data 7(1), 1\u20135 (2020)","journal-title":"Sci. Data"},{"issue":"3","key":"10_CR14","doi-asserted-by":"publisher","first-page":"279","DOI":"10.1016\/j.ecoinf.2007.05.004","volume":"2","author":"J Madin","year":"2007","unstructured":"Madin, J., Bowers, S., Schildhauer, M., Krivov, S., Pennington, D., Villa, F.: An ontology for describing and synthesizing ecological observation data. Ecol. Inf. 2(3), 279\u2013296 (2007)","journal-title":"Ecol. Inf."},{"key":"10_CR15","unstructured":"Mileski, V., Kocev, D., Draganski, B., D\u017eeroski, S.: Multi-dimensional analysis of PPMI data. In: Proceedings of 8th Jo\u017eef Stefan International Postgraduate School Students Conference, pp. 175\u2013178. Jo\u017eef Stefan International Postgraduate School, Ljubljana, Slovenia (2016)"},{"key":"10_CR16","series-title":"Lecture Notes in Computer Science (Lecture Notes in Artificial Intelligence)","doi-asserted-by":"publisher","first-page":"126","DOI":"10.1007\/978-3-642-40897-7_9","volume-title":"Discovery Science","author":"P Panov","year":"2013","unstructured":"Panov, P., Soldatova, L., D\u017eeroski, S.: OntoDM-KDD: ontology for representing the knowledge discovery process. In: F\u00fcrnkranz, J., H\u00fcllermeier, E., Higuchi, T. (eds.) DS 2013. LNCS (LNAI), vol. 8140, pp. 126\u2013140. Springer, Heidelberg (2013). https:\/\/doi.org\/10.1007\/978-3-642-40897-7_9"},{"issue":"5","key":"10_CR17","doi-asserted-by":"publisher","first-page":"1222","DOI":"10.1007\/s10618-014-0363-0","volume":"28","author":"P Panov","year":"2014","unstructured":"Panov, P., Soldatova, L., D\u017eeroski, S.: Ontology of core data mining entities. Data Min. Knowl. Discov. 28(5), 1222\u20131265 (2014). https:\/\/doi.org\/10.1007\/s10618-014-0363-0","journal-title":"Data Min. Knowl. Discov."},{"key":"10_CR18","doi-asserted-by":"publisher","first-page":"900","DOI":"10.1016\/j.ins.2015.08.006","volume":"329","author":"P Panov","year":"2016","unstructured":"Panov, P., Soldatova, L., D\u017eeroski, S.: Generic ontology of datatypes. Inf. Sci. 329, 900\u2013920 (2016)","journal-title":"Inf. Sci."},{"issue":"3","key":"10_CR19","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1212\/WNL.0b013e3181cb3e25","volume":"74","author":"RC Petersen","year":"2010","unstructured":"Petersen, R.C., et al.: Alzheimer\u2019s disease neuroimaging initiative (ADNI): clinical characterization. Neurology 74(3), 201\u2013209 (2010)","journal-title":"Neurology"},{"issue":"9","key":"10_CR20","doi-asserted-by":"publisher","first-page":"1119","DOI":"10.1016\/j.cageo.2004.12.004","volume":"31","author":"R Raskin","year":"2005","unstructured":"Raskin, R., Pan, M.: Knowledge representation in the semantic web for Earth and environmental terminology (SWEET). Comput. Geosci. 31(9), 1119\u20131125 (2005)","journal-title":"Comput. Geosci."},{"issue":"11","key":"10_CR21","doi-asserted-by":"publisher","first-page":"1251","DOI":"10.1038\/nbt1346","volume":"25","author":"B Smith","year":"2007","unstructured":"Smith, B., et al.: The obo foundry: coordinated evolution of ontologies to support biomedical data integration. Nat. Biotechnol. 25(11), 1251 (2007)","journal-title":"Nat. Biotechnol."},{"key":"10_CR22","unstructured":"Stojanova, D.: Estimating forest properties from remotely sensed data by using machine learning. Master\u2019s thesis, Jo\u017eef Stefan International Postgraduate School, Ljubljana, Slovenia (2009)"},{"issue":"4","key":"10_CR23","doi-asserted-by":"publisher","first-page":"256","DOI":"10.1016\/j.ecoinf.2010.03.004","volume":"5","author":"D Stojanova","year":"2010","unstructured":"Stojanova, D., Panov, P., Gjorgjioski, V., Kobler, A., D\u017eeroski, S.: Estimating vegetation height and canopy cover from remotely sensed data with machine learning. Ecol. Inf. 5(4), 256\u2013266 (2010)","journal-title":"Ecol. Inf."},{"key":"10_CR24","unstructured":"Vanschoren, J., Soldatova, L.: Expos\u00e9: an ontology for data mining experiments. In: International Workshop on Third Generation Data Mining: Towards Service-Oriented Knowledge Discovery (SoKD-2010), pp. 31\u201346 (2010)"},{"issue":"1","key":"10_CR25","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1002\/bult.70","volume":"24","author":"S Weibel","year":"1997","unstructured":"Weibel, S.: The Dublin Core: a simple content description model for electronic resources. Bull. Assoc. Inf. Sci. Technol. 24(1), 9\u201311 (1997)","journal-title":"Bull. Assoc. Inf. Sci. Technol."},{"key":"10_CR26","doi-asserted-by":"publisher","first-page":"e0153507","DOI":"10.1038\/sdata.2016.18","volume":"3","author":"M Wilkinson","year":"2016","unstructured":"Wilkinson, M., et al.: The fair guiding principles for scientific data management and stewardship. Sci. Data 3, e0153507 (2016)","journal-title":"Sci. Data"}],"container-title":["Lecture Notes in Computer Science","Discovery Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-030-61527-7_10","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,3,7]],"date-time":"2024-03-07T11:54:43Z","timestamp":1709812483000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-030-61527-7_10"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020]]},"ISBN":["9783030615260","9783030615277"],"references-count":26,"URL":"https:\/\/doi.org\/10.1007\/978-3-030-61527-7_10","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"type":"print","value":"0302-9743"},{"type":"electronic","value":"1611-3349"}],"subject":[],"published":{"date-parts":[[2020]]},"assertion":[{"value":"15 October 2020","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"DS","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"International Conference on Discovery Science","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Thessaloniki","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Greece","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2020","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"19 October 2020","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"21 October 2020","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"23","order":9,"name":"conference_number","label":"Conference Number","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"dis2020","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"https:\/\/ds2020.csd.auth.gr\/","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Single-blind","order":1,"name":"type","label":"Type","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"EasyChair","order":2,"name":"conference_management_system","label":"Conference Management System","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"76","order":3,"name":"number_of_submissions_sent_for_review","label":"Number of Submissions Sent for Review","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"26","order":4,"name":"number_of_full_papers_accepted","label":"Number of Full Papers Accepted","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"19","order":5,"name":"number_of_short_papers_accepted","label":"Number of Short Papers Accepted","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"34% - The value is computed by the equation \"Number of Full Papers Accepted \/ Number of Submissions Sent for Review * 100\" and then rounded to a whole number.","order":6,"name":"acceptance_rate_of_full_papers","label":"Acceptance Rate of Full Papers","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"3","order":7,"name":"average_number_of_reviews_per_paper","label":"Average Number of Reviews per Paper","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"4","order":8,"name":"average_number_of_papers_per_reviewer","label":"Average Number of Papers per Reviewer","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"Yes","order":9,"name":"external_reviewers_involved","label":"External Reviewers Involved","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}},{"value":"The conference took place virtually due to the COVID-19 pandemic","order":10,"name":"additional_info_on_review_process","label":"Additional Info on Review Process","group":{"name":"ConfEventPeerReviewInformation","label":"Peer Review Information (provided by the conference organizers)"}}]}}