{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,4]],"date-time":"2026-02-04T18:33:36Z","timestamp":1770230016515,"version":"3.49.0"},"publisher-location":"Cham","reference-count":22,"publisher":"Springer Nature Switzerland","isbn-type":[{"value":"9783031657931","type":"print"},{"value":"9783031657948","type":"electronic"}],"license":[{"start":{"date-parts":[[2024,1,1]],"date-time":"2024-01-01T00:00:00Z","timestamp":1704067200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,8,15]],"date-time":"2024-08-15T00:00:00Z","timestamp":1723680000000},"content-version":"vor","delay-in-days":227,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Software is a central part of the scientific process and involved in obtaining, analysing, visualising and processing research data. Understanding the provenance of research requires an understanding of the involved software. However, software citations in scientific publications often are informal, what creates challenges when aiming at understanding software adoption. This paper provides an overview of the Software Mention Detection (SOMD) shared task conducted as part of the 2024 Natural Scientific Language Processing Workshop, aiming at advancing the state-of-the-art with respect to NLP methods for detecting software mentions and additional information in scholarly publications. The SOMD shared task encompasses three subtasks, concerned with software mention recognition (subtask I), recognition of additional information (subtask II) and classification of involved relations (subtask III). We present an overview of the tasks, received submissions and used techniques. The best submissions achieved F1 scores of 0.74 (subtask I), 0.838 (subtask II) and 0.911 (subtask III) indicating both task feasibility but also potential for further performance gains.<\/jats:p>","DOI":"10.1007\/978-3-031-65794-8_17","type":"book-chapter","created":{"date-parts":[[2024,8,14]],"date-time":"2024-08-14T06:02:44Z","timestamp":1723615364000},"page":"247-256","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["SOMD@NSLP2024: Overview and\u00a0Insights from\u00a0the\u00a0Software Mention Detection Shared Task"],"prefix":"10.1007","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7925-3363","authenticated-orcid":false,"given":"Frank","family":"Kr\u00fcger","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0007-0124-5316","authenticated-orcid":false,"given":"Saurav","family":"Karmakar","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0009-0001-4364-9243","authenticated-orcid":false,"given":"Stefan","family":"Dietze","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,8,15]]},"reference":[{"key":"17_CR1","unstructured":"Berners-Lee, T.: Is your linked open data 5 star? (2010). http:\/\/www.w3.org\/DesignIssues\/LinkedData#fivestar"},{"key":"17_CR2","doi-asserted-by":"publisher","unstructured":"Duck, G., Nenadic, G., Filannino, M., Brass, A., Robertson, D.L., Stevens, R.: A survey of bioinformatics database and software usage through mining the literature. PloS One 11(6), 1\u201325 (2016). https:\/\/doi.org\/10.1371\/journal.pone.0157989","DOI":"10.1371\/journal.pone.0157989"},{"key":"17_CR3","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1007\/978-3-642-41338-4_7","volume-title":"The Semantic Web \u2013 ISWC 2013","author":"S Hellmann","year":"2013","unstructured":"Hellmann, S., Lehmann, J., Auer, S., Br\u00fcmmer, M.: Integrating NLP using linked data. In: Alani, H., et al. (eds.) ISWC 2013. LNCS, vol. 8219, pp. 98\u2013113. Springer, Heidelberg (2013). https:\/\/doi.org\/10.1007\/978-3-642-41338-4_7"},{"issue":"9","key":"17_CR4","first-page":"2137","volume":"67","author":"J Howison","year":"2016","unstructured":"Howison, J., Bullard, J.: Software in the scientific literature: problems with seeing, finding, and using software mentioned in the biology literature. J. Am. Soc. Inf. Sci. 67(9), 2137\u20132155 (2016)","journal-title":"J. Am. Soc. Inf. Sci."},{"key":"17_CR5","doi-asserted-by":"publisher","unstructured":"Istrate, A.M., Li, D., Taraborelli, D., Torkar, M., Veytsman, B., Williams, I.: A large dataset of software mentions in the biomedical literature (2022).https:\/\/doi.org\/10.48550\/ARXIV.2209.00693","DOI":"10.48550\/ARXIV.2209.00693"},{"key":"17_CR6","doi-asserted-by":"publisher","unstructured":"Katz, D., et\u00a0al.: Recognizing the value of software: a software citation guide. F1000Research 9, 1257 (2021).https:\/\/doi.org\/10.12688\/f1000research.26932.2","DOI":"10.12688\/f1000research.26932.2"},{"key":"17_CR7","doi-asserted-by":"publisher","unstructured":"Kr\u00fcger, F.: SOMD - SOftware Mention Detection (2024). https:\/\/doi.org\/10.5281\/zenodo.10472161","DOI":"10.5281\/zenodo.10472161"},{"issue":"1","key":"17_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1002\/pra2.2016.14505301072","volume":"53","author":"K Li","year":"2016","unstructured":"Li, K., Lin, X., Greenberg, J.: Software citation, reuse and metadata considerations: an exploratory study examining LAMMPS. Proc. Assoc. Inf. Sci. Technol. 53(1), 1\u201310 (2016)","journal-title":"Proc. Assoc. Inf. Sci. Technol."},{"issue":"4","key":"17_CR9","doi-asserted-by":"publisher","first-page":"989","DOI":"10.1016\/j.joi.2017.08.003","volume":"11","author":"K Li","year":"2017","unstructured":"Li, K., Yan, E., Feng, Y.: How is R cited in research outputs? Structure, impacts, and citation standard. J. Informet. 11(4), 989\u20131002 (2017)","journal-title":"J. Informet."},{"key":"17_CR10","doi-asserted-by":"publisher","unstructured":"Manghi, P., et al.: The OpenAIRE research graph data model (2019). https:\/\/doi.org\/10.5281\/ZENODO.2643199","DOI":"10.5281\/ZENODO.2643199"},{"key":"17_CR11","unstructured":"Nakayama, H.: seqeval: a python framework for sequence labeling evaluation (2018). https:\/\/github.com\/chakki-works\/seqeval"},{"key":"17_CR12","doi-asserted-by":"crossref","unstructured":"Nangia, U., Katz, D.S.: Understanding software in research: initial results from examining nature and a call for collaboration. In: 2017 IEEE 13th International Conference on e-Science (e-Science), pp. 486\u2013487. IEEE (2017)","DOI":"10.1109\/eScience.2017.78"},{"issue":"4","key":"17_CR13","doi-asserted-by":"publisher","first-page":"860","DOI":"10.1016\/j.joi.2015.07.012","volume":"9","author":"X Pan","year":"2015","unstructured":"Pan, X., Yan, E., Wang, Q., Hua, W.: Assessing the impact of software on science: a bootstrapped learning of software entities in full-text papers. J. Informet. 9(4), 860\u2013871 (2015)","journal-title":"J. Informet."},{"key":"17_CR14","unstructured":"Pavao, A., et al.: CodaLab competitions: an open source platform to organize scientific challenges. J. Mach. Learn. Res. 24(198), 1\u20136 (2023). http:\/\/jmlr.org\/papers\/v24\/21-1436.html"},{"key":"17_CR15","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825\u20132830 (2011)","journal-title":"J. Mach. Learn. Res."},{"key":"17_CR16","unstructured":"Ronallo, J.: Html5 microdata and schema. org. Code4Lib J. (16) (2012)"},{"key":"17_CR17","doi-asserted-by":"crossref","unstructured":"Schindler, D., Bensmann, F., Dietze, S., Kr\u00fcger, F.: The role of software in science: a knowledge graph-based analysis of software mentions in PubMed Central. PeerJ Comput. Sci. 8, e835 (2022)","DOI":"10.7717\/peerj-cs.835"},{"key":"17_CR18","doi-asserted-by":"publisher","unstructured":"Schindler, D., Bensmann, F., Dietze, S., Kr\u00fcger, F.: SoMeSci-A 5 star open data gold standard knowledge graph of software mentions in scientific articles. In: Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM \u201921). Association for Computing Machinery, Virtual Event, QLD, Australia (2021). https:\/\/doi.org\/10.1145\/3459637.3482017","DOI":"10.1145\/3459637.3482017"},{"key":"17_CR19","doi-asserted-by":"publisher","unstructured":"Schindler, D., Hossain, T., Spors, S., Kr\u00fcger, F.: A multi-level analysis of data quality for formal software citation. Quant. Sci. Stud., 1\u201331 (June 2024). https:\/\/doi.org\/10.48550\/arXiv.2306.17535","DOI":"10.48550\/arXiv.2306.17535"},{"key":"17_CR20","series-title":"Lecture Notes in Computer Science","doi-asserted-by":"publisher","first-page":"271","DOI":"10.1007\/978-3-030-49461-2_16","volume-title":"The Semantic Web","author":"D Schindler","year":"2020","unstructured":"Schindler, D., Zapilko, B., Kr\u00fcger, F.: Investigating software usage in the social sciences: a knowledge graph approach. In: Harth, A., et al. (eds.) ESWC 2020. LNCS, vol. 12123, pp. 271\u2013286. Springer, Cham (2020). https:\/\/doi.org\/10.1007\/978-3-030-49461-2_16"},{"key":"17_CR21","doi-asserted-by":"publisher","unstructured":"Smith, A.M., Katz, D.S., Niemeyer, K.E.: Software citation principles. PeerJ Comput. Sci. 2, e86 (2016). https:\/\/doi.org\/10.7717\/peerj-cs.86","DOI":"10.7717\/peerj-cs.86"},{"key":"17_CR22","doi-asserted-by":"crossref","unstructured":"Yu, Y., et\u00a0al.: Low-rank adaptation of large language model rescoring for parameter-efficient speech recognition. In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp.\u00a01\u20138. IEEE (2023)","DOI":"10.1109\/ASRU57964.2023.10389632"}],"container-title":["Lecture Notes in Computer Science","Natural Scientific Language Processing and Research Knowledge Graphs"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-031-65794-8_17","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,14]],"date-time":"2024-08-14T06:05:48Z","timestamp":1723615548000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/978-3-031-65794-8_17"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024]]},"ISBN":["9783031657931","9783031657948"],"references-count":22,"URL":"https:\/\/doi.org\/10.1007\/978-3-031-65794-8_17","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"value":"0302-9743","type":"print"},{"value":"1611-3349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024]]},"assertion":[{"value":"15 August 2024","order":1,"name":"first_online","label":"First Online","group":{"name":"ChapterHistory","label":"Chapter History"}},{"value":"NSLP","order":1,"name":"conference_acronym","label":"Conference Acronym","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"International Workshop on Natural Scientific Language Processing and Research Knowledge Graphs","order":2,"name":"conference_name","label":"Conference Name","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Hersonissos, Crete","order":3,"name":"conference_city","label":"Conference City","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"Greece","order":4,"name":"conference_country","label":"Conference Country","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"2024","order":5,"name":"conference_year","label":"Conference Year","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"26 May 2024","order":7,"name":"conference_start_date","label":"Conference Start Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"26 May 2024","order":8,"name":"conference_end_date","label":"Conference End Date","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"1","order":9,"name":"conference_number","label":"Conference Number","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"nslp2024","order":10,"name":"conference_id","label":"Conference ID","group":{"name":"ConferenceInfo","label":"Conference Information"}},{"value":"https:\/\/nfdi4ds.github.io\/nslp2024\/","order":11,"name":"conference_url","label":"Conference URL","group":{"name":"ConferenceInfo","label":"Conference Information"}}]}}