{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,17]],"date-time":"2026-02-17T12:14:15Z","timestamp":1771330455386,"version":"3.50.1"},"reference-count":67,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2024,12,7]],"date-time":"2024-12-07T00:00:00Z","timestamp":1733529600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,12,7]],"date-time":"2024-12-07T00:00:00Z","timestamp":1733529600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100002347","name":"Bundesministerium f\u00fcr Bildung und Forschung","doi-asserted-by":"publisher","award":["03WKDA1A"],"award-info":[{"award-number":["03WKDA1A"]}],"id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002347","name":"Bundesministerium f\u00fcr Bildung und Forschung","doi-asserted-by":"publisher","award":["03COV03E"],"award-info":[{"award-number":["03COV03E"]}],"id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002765","name":"Bundesministerium f\u00fcr Wirtschaft und Technologie","doi-asserted-by":"publisher","award":["01MK19011"],"award-info":[{"award-number":["01MK19011"]}],"id":[{"id":"10.13039\/501100002765","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010669","name":"H2020 LEIT Information and Communication Technologies","doi-asserted-by":"publisher","award":["825627"],"award-info":[{"award-number":["825627"]}],"id":[{"id":"10.13039\/100010669","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100006211","name":"Humboldt-Universit\u00e4t zu Berlin","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100006211","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Lang Resources &amp; Evaluation"],"published-print":{"date-parts":[[2025,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Time and again we are faced, in a number of collaborative research projects, with the challenge of interconnecting various language processing tools to implement certain industry-driven use cases focusing, for the most part, upon digital content curation processes. In this paper we first describe several of the relevant projects and their technology platforms, followed by a description of the corresponding use cases and their requirements. The content curation platform we focus upon in this article and which has been implemented as a prototype makes use of a large number of NLP services, which we also build upon for other use cases and prototypes. In addition to the implemented NLP services, the article presents a workflow manager for the flexible creation and customisation of processing workflows that make use of the above mentioned NLP services. Based on the four key principles of generality, flexibility, scalability and efficiency, we present the first version of the workflow manager by providing details on its custom definition language, explaining the communication components and the general system architecture and setup. The paper also addresses challenges in interoperability across different NLP tasks and hardware-based resource use.<\/jats:p>","DOI":"10.1007\/s10579-024-09774-4","type":"journal-article","created":{"date-parts":[[2024,12,7]],"date-time":"2024-12-07T09:37:38Z","timestamp":1733564258000},"page":"4469-4492","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Rapidly developing NLP applications for content curation"],"prefix":"10.1007","volume":"59","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1418-9935","authenticated-orcid":false,"given":"Julian","family":"Moreno-Schneider","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2077-9907","authenticated-orcid":false,"given":"Malte","family":"Ostendorff","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3261-9735","authenticated-orcid":false,"given":"Konstantin","family":"Schulz","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5395-5463","authenticated-orcid":false,"given":"Karolina","family":"Zaczynska","sequence":"additional","affiliation":[]},{"given":"Florian","family":"Kintzel","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7800-1893","authenticated-orcid":false,"given":"Georg","family":"Rehm","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,12,7]]},"reference":[{"key":"9774_CR1","unstructured":"Aksenov, D., Moreno-Schneider, J., Bourgonje, P., Schwarzenberg, R., Hennig, L., & Rehm, G. (2020) Abstractive text summarization based on language model conditioning and locality modeling. In N. Calzolari, F. B\u00e9chet, P. Blache, C. Cieri, K. Choukri, T. Declerck, H. Isahara, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis (Eds.), Proceedings of the 12th Language Resources and Evaluation Conference (LREC\u00a02020), European Language Resources Association (ELRA), Marseille, France, accepted for publication. Submitted version available as preprint."},{"key":"9774_CR2","doi-asserted-by":"crossref","unstructured":"Aksenov, D., Bourgonje, P., Zaczynska, K., Ostendorff, M., Moreno-Schneider, J., & Rehm, G. (2021) Fine-grained Classification of Political Bias in German News: A Data Set and Initial Experiments. In: Mostafazadeh\u00a0Davani A, Kiela D, Lambert M, Vidgen B, Prabhakaran V, Waseem Z (eds) Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH\u00a02021), Association for Computational Linguistics (ACL), Bangkok, Thailand, pp 121\u2013131, co-located with ACL-IJCNLP\u00a02021. 1-6 August 2021","DOI":"10.18653\/v1\/2021.woah-1.13"},{"issue":"1","key":"9774_CR3","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1016\/j.outlook.2020.09.001","volume":"69","author":"A Amit Aharon","year":"2021","unstructured":"Amit Aharon, A., Ruban, A., & Dubovi, I. (2021). Knowledge and information credibility evaluation strategies regarding COVID-19: A cross-sectional study. Nursing Outlook, 69(1), 22\u201331. https:\/\/doi.org\/10.1016\/j.outlook.2020.09.001","journal-title":"Nursing Outlook"},{"key":"9774_CR4","doi-asserted-by":"crossref","unstructured":"Avil\u00e9s Podgurski, LV., Zaczynska, K., & Rehm, G. (2022) Evaluating Web Content Using the W3C Credibility Signals. In A. Dimou, S. Neumaier, T. Pellegrini, & S. Vahdati (Eds.), Towards a Knowledge-Aware AI. SEMANTiCS 2022\u2014Proceedings of the 18th International Conference on Semantic Systems, 13-15 September 2022, Vienna, Austria, IOS Press, Amsterdam, no.\u00a055 in Studies on the Semantic Web, 3\u201320, 13-15 September 2022","DOI":"10.3233\/SSW220005"},{"key":"9774_CR5","doi-asserted-by":"crossref","unstructured":"Bourgonje, P., Moreno-Schneider ,J,. Nehring, J., Rehm, G,. Sasaki, F., & Srivastava, A. (2016) Towards a Platform for Curation Technologies: Enriching Text Collections with a Semantic-Web Layer. In H. Sack, G. Rizzo, N. Steinmetz, D. Mladeni\u0107, S. Auer, & C. Lange (Eds.), The Semantic Web, Springer, no. 9989 in Lecture Notes in Computer Science, 65\u201368, eSWC 2016 Satellite Events. Heraklion, Crete, Greece, May 29 \u2013 June 2, 2016 Revised Selected Papers","DOI":"10.1007\/978-3-319-47602-5_14"},{"key":"9774_CR6","unstructured":"Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V., Aswani, N., Roberts, I., Gorrell, G., Funk, A., Roberts, A., Damljanovic, D., Heitz, T., Greenwood, MA., Saggion, H., Petrak, J., Li, Y., & Peters, W. (2011). Text Processing with GATE (Version 6). http:\/\/tinyurl.com\/gatebook"},{"key":"9774_CR7","first-page":"3","volume-title":"Handbook of natural language processing","author":"R Dale","year":"2010","unstructured":"Dale, R. (2010). Classical approaches to natural language processing. In N. Indurkhya & F. J. Damerau (Eds.), Handbook of natural language processing (2nd ed., pp. 3\u20137). CRC Press, Taylor & Francis Group.","edition":"2"},{"key":"9774_CR8","doi-asserted-by":"publisher","unstructured":"Devlin, J., Chang, MW., Lee, K., & Toutanova, K. (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics, Minneapolis, Minnesota pp 4171\u2013418https:\/\/doi.org\/10.18653\/v1\/N19-1423, https:\/\/www.aclweb.org\/anthology\/N19-1423","DOI":"10.18653\/v1\/N19-1423"},{"key":"9774_CR9","unstructured":"Doran, D., Schulz, S., & Besold, TR. (2018) What Does Explainable AI Really Mean? A New Conceptualization of Perspectives. In: Besold TR, Kutz O (eds) Proceedings of the First International Workshop on Comprehensibility and Explanation in AI and ML 2017 Co-Located with 16th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2017), 1\u20138, 1710.00794"},{"issue":"3\u20134","key":"9774_CR10","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1017\/S1351324904003523","volume":"10","author":"D Ferrucci","year":"2004","unstructured":"Ferrucci, D., & Lally, A. (2004). UIMA: An architectural approach to unstructured information processing in the corporate research environment. Natural Language Engineering, 10(3\u20134), 327\u201334. https:\/\/doi.org\/10.1017\/S1351324904003523","journal-title":"Natural Language Engineering"},{"key":"9774_CR11","doi-asserted-by":"publisher","unstructured":"Fi\u0161er, D., & Witt, A. (2022). CLARIN: The Infrastructure for Language Resources. De Gruyter, Berlin, Bosto d. https:\/\/doi.org\/10.1515\/9783110767377","DOI":"10.1515\/9783110767377"},{"issue":"12","key":"9774_CR12","doi-asserted-by":"publisher","first-page":"1285","DOI":"10.1038\/s41562-020-00994-6","volume":"4","author":"R Gallotti","year":"2020","unstructured":"Gallotti, R., Valle, F., Castaldo, N., Sacco, P., & De Domenico, M. (2020). Assessing the risks of infodemics in response to COVID-19 epidemics. Nature Human Behaviour, 4(12), 1285\u20131293. https:\/\/doi.org\/10.1038\/s41562-020-00994-6","journal-title":"Nature Human Behaviour"},{"key":"9774_CR13","unstructured":"Gonzalez Garcia, M., Schneider, JM., Ostendorff, M., &Rehm, G. (2023) Integration of a semantic storytelling recommender system in speech assistants. In R. Campos, A. Jorge, A. Jatowt, S. Bhatia, & M. Litvak (Eds.), Proceedings of Text2Story \u2013 Sixth International Workshop on Narrative Extraction from Texts held in conjunction with the 45th European Conference on Information Retrieval (ECIR 2023), Dublin, Ireland (pp.\u00a05\u201311). cEUR Workshop Proceedings, Volume 3370. 02 April 2023"},{"key":"9774_CR14","unstructured":"Gurevych, I., M\u00fchlh\u00e4user, M., M\u00fcller, C., Steimle, J., Weimer, M., & Zesch, T. (2007) Darmstadt Knowledge Processing Repository based on UIMA. In Proceedings of the First Workshop on Unstructured Information Management Architecture at Biannual Conference of the Society for Computational Linguistics and Language Technology, T\u00fcbingen, Germany (p.\u00a089)"},{"key":"9774_CR15","doi-asserted-by":"crossref","unstructured":"Hellmann, S., Lehmann, J., Auer, S., & Br\u00fcmmer, M. (2013). Integrating NLP using Linked Data. In The Semantic Web \u2013 ISWC 2013. 12th International Semantic Web Conference, 21-25 October 2013, Sydney, Australia, no. 8219 in Lecture Notes in Computer Science (pp.\u00a098\u2013113).","DOI":"10.1007\/978-3-642-41338-4_7"},{"key":"9774_CR16","unstructured":"Hinrichs, E., Hinrichs, M., & Zastrow, T. (2010). WebLicht: Web-based LRT services for German. In Proceedings of the ACL 2010 System Demonstrations, Association for Computational Linguistics, Uppsala, Sweden (pp.\u00a025\u201329). https:\/\/aclanthology.org\/P10-4005"},{"key":"9774_CR17","unstructured":"Ide, N., Pustejovsky, J., Cieri, C., Nyberg, E., Wang, D., Suderman, K., Verhagen, M., & Wright, J. (2014). The language application grid. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC\u201914), European Language Resources Association (ELRA), Reykjavik, Iceland. http:\/\/www.lrec-conf.org\/proceedings\/lrec2014\/pdf\/926_Paper.pdf"},{"key":"9774_CR18","doi-asserted-by":"publisher","unstructured":"Junczys-Dowmunt, M., Grundkiewicz, R., Dwojak, T., Hoang, H., Heafield, K., Neckermann, T., Seide, F., Germann, U., Aji, AF., Bogoychev, N., Martins, A. F. T., & Birch, A. (2018) Marian: Fast Neural Machine Translation in C++. In: Proceedings of ACL2018, System Demonstrations, Association for Computational Linguistics, Melbourne, Australia, pp 116\u201312https:\/\/doi.org\/10.18653\/v1\/P18-4020, https:\/\/aclanthology.org\/P18-4020","DOI":"10.18653\/v1\/P18-4020"},{"key":"9774_CR19","doi-asserted-by":"crossref","unstructured":"Klein, G., Kim, Y., Deng, Y., Senellart, J., & Rush, A. (2017). OpenNMT: Open-source toolkit for neural machine translation. In Proceedings of ACL 2017, System Demonstrations, Association for Computational Linguistics, Vancouver, Canada (pp.\u00a067\u201372). https:\/\/aclanthology.org\/P17-4012","DOI":"10.18653\/v1\/P17-4012"},{"key":"9774_CR20","unstructured":"Labropoulou, P., Galanis, D., Lempesis, A., Greenwood, M., Knoth, P., Eckart\u00a0de Castilho, R., Sachtouris, S., Georgantopoulos, B., Martziou, S., Anastasiou, L., Gkirtzou, K., Manola, N., & Piperidis, S. (2018). OpenMinTeD: A platform facilitating text mining of scholarly content. In WOSP 2018 Workshop Proceedings, Eleventh International Conference on Language Resources and Evaluation (LREC 2018), European Language Resources Association (ELRA), Miyazaki, Japan (pp.\u00a07\u201312). http:\/\/lrec-conf.org\/workshops\/lrec2018\/W24\/pdf\/13_W24.pdf"},{"key":"9774_CR21","unstructured":"Labropoulou, P., Gkirtzou, K., Gavriilidou, M., Deligiannis, M., Galanis, D., Piperidis, S., Rehm, G., Berger, M., Mapelli, V., Rigault, M., Arranz, V., Choukri, K., Backfried, G., P\u00e9rez, JMG., &Garcia-Silva, A. (2020) Making Metadata Fit for Next Generation Language Technology Platforms: The Metadata Schema of the European Language Grid. In: Calzolari N, B\u00e9chet F, Blache P, Cieri C, Choukri K, Declerck T, Isahara H, Maegaard B, Mariani J, Moreno A, Odijk J, Piperidis S (eds) Proceedings of the 12th Language Resources and Evaluation Conference (LREC\u00a02020), European Language Resources Association (ELRA), Marseille, France, accepted for publication. Submitted version available as preprint."},{"key":"9774_CR22","doi-asserted-by":"publisher","unstructured":"Labropoulou, P., Piperidis, S., Deligiannis, M., Voukoutis, L., Giagkou, M., Ko\u0161arko, O., Haji\u010d, J., & Rehm, G. (2023) Interoperable Metadata Bridges to the wider Language Technology Ecosystem. In G. Rehm (Ed.), European Language Grid: A Language Technology Platform for Multilingual Europe, Cognitive Technologies. Springer International Publishing, Cham, Switzerland, pp. 107\u201312https:\/\/doi.org\/10.1007\/978-3-031-17258-8_6,","DOI":"10.1007\/978-3-031-17258-8_6"},{"issue":"1","key":"9774_CR23","doi-asserted-by":"publisher","first-page":"37","DOI":"10.3233\/DS-190026","volume":"3","author":"AL Lamprecht","year":"2020","unstructured":"Lamprecht, A. L., Garcia, L., Kuzak, M., Martinez, C., Arcila, R., Martin Del Pico, E., Dominguez Del Angel, V., Van De Sandt, S., Ison, J., & Martinez, P. A. (2020). Towards FAIR principles for research software. Data Science, 3(1), 37\u201359.","journal-title":"Data Science"},{"key":"9774_CR24","doi-asserted-by":"crossref","unstructured":"Leitner, E., Rehm, G., & Moreno-Schneider, J. (2019). Fine-grained named entity recognition in legal documents. In M. Acosta, P. Cudr\u00e9-Mauroux, M. Maleshkova, T. Pellegrini, H. Sack, & Y. Sure-Vetter (Eds.), Semantic Systems. The Power of AI and Knowledge Graphs. Proceedings of the 15th International Conference (SEMANTiCS 2019), Springer, Karlsruhe, Germany, no. 11702 in Lecture Notes in Computer Science, pp 272\u2013287, 10\/11 September 2019","DOI":"10.1007\/978-3-030-33220-4_20"},{"key":"9774_CR25","doi-asserted-by":"publisher","unstructured":"Liu, Y., & Lapata, M. (2019) Hierarchical transformers for multi-document summarization. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Florence, Italy (pp.\u00a05070\u201350). https:\/\/doi.org\/10.18653\/v1\/P19-1500, https:\/\/aclanthology.org\/P19-1500","DOI":"10.18653\/v1\/P19-1500"},{"key":"9774_CR26","doi-asserted-by":"crossref","unstructured":"Manning, C. D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S. J., & McClosky, D. (2014) The Stanford CoreNLP natural language processing toolkit. In Association for Computational Linguistics (ACL) System Demonstrations (pp.\u00a055\u201360). http:\/\/www.aclweb.org\/anthology\/P\/P14\/P14-5010","DOI":"10.3115\/v1\/P14-5010"},{"key":"9774_CR27","unstructured":"May, U., Zaczynska, K., Moreno-Schneider, J., & Rehm, G. (2021) Extraction and normalization of vague time expressions in German. In Proceedings of the 17th Conference on Natural Language Processing (KONVENS 2021), KONVENS 2021 Organizers, D\u00fcsseldorf, Germany, 114\u2013126, https:\/\/aclanthology.org\/2021.konvens-1.10"},{"key":"9774_CR28","unstructured":"Moreno-Schneider, J., & Rehm, G. (2018) Towards a Workflow Manager for Curation Technologies in the Legal Domain. In G. Rehm , V. Rodriguez-Doncel, & J. M. Schneider (Eds.), Proc. of the LREC 2018 Workshop on Language Resources and Technologies for the Legal Knowledge Graph, Miyazaki, Japan (pp.\u00a030\u201335)."},{"key":"9774_CR29","doi-asserted-by":"crossref","unstructured":"Moreno-Schneider, J., Srivastava, A., Bourgonje, P., Wabnitz, D., & Rehm, G. (2017). Semantic Storytelling, Cross-lingual Event Detection and other Semantic Services for a Newsroom Content Curation Dashboard. In Proc. of the Second Workshop on Natural Language Processing meets Journalism - EMNLP 2017 Workshop (NLPMJ 2017 (Ed.), Popescu O, Strapparava C (pp. 68\u201373). Copenhagen: Denmark.","DOI":"10.18653\/v1\/W17-4212"},{"key":"9774_CR30","unstructured":"Moreno-Schneider, J., Bourgonje, P., Kintzel, F., & Rehm, G. (2020) A Workflow Manager for Complex NLP and Content Curation Pipelines. In G. Rehm, K. Bontcheva, K. Choukri, J. Hajic, S. Piperidis, & A. Vasiljevs (Eds.), Proceedings of the 1st International Workshop on Language Technology Platforms (IWLTP 2020, co-located with LREC 2020), Marseille, France, pp 73\u201380, 16 May 2020"},{"key":"9774_CR31","doi-asserted-by":"publisher","unstructured":"Moreno-Schneider, J., Plakidis, M., & Rehm, G. (2021a) Annotation of fine-grained geographical entities in german texts. In D. Gromann, G. S\u00e9rasset, T. Declerck, J. P. McCrae, J. Gracia, J. Bosque-Gil, F. Bobillo, & B. Heinisch (Eds.), 3rd Conference on Language, Data and Knowledge, LDK 2021, September 1-3, 2021, Zaragoza, Spain, Schloss Dagstuhl - Leibniz-Zentrum f\u00fcr Informatik, OASIcs (Vol.\u00a093, pp.\u00a011:1\u201311). https:\/\/doi.org\/10.4230\/OASIcs.LDK.2021.11,","DOI":"10.4230\/OASIcs.LDK.2021.11"},{"key":"9774_CR32","doi-asserted-by":"crossref","unstructured":"Moreno-Schneider J, Rehm G, Montiel-Ponsoda E, Rodr\u00edguez-Doncel V, Mart\u00edn-Chozas P, Navas-Loro M, Kaltenb\u00f6ck M, Revenko A, Karampatakis S, Sageder C, Gracia J, Maganza F, Kernerman I, Lonke D, Lagzdins A, Gil JB, Verhoeven P, Diaz EG, Ballesteros PB (2021b) Lynx: A Knowledge-based AI Service Platform for Content Processing, Enrichment and Analysis for the Legal Domain. Information Systems p 101966, special Issue on Managing, Mining and Learning in the Legal Data Domain.","DOI":"10.1016\/j.is.2021.101966"},{"key":"#cr-split#-9774_CR33.1","unstructured":"Moreno-Schneider, J., Calizzano, R., Kintzel, F., Rehm, G., Galanis, D., & Roberts, I. (2022) Towards Practical Semantic Interoperability in NLP Platforms. In H. Bunt (Ed.), Proceedings of the 18th Joint ACL-ISO Workshop on Interoperable Semantic Annotation (ISA 2022"},{"key":"#cr-split#-9774_CR33.2","unstructured":"co-located with LREC 2022), Marseille, France (pp.\u00a0118-126) 20 June 2022"},{"key":"9774_CR34","doi-asserted-by":"publisher","first-page":"151","DOI":"10.1016\/j.artint.2012.03.006","volume":"194","author":"J Nothman","year":"2013","unstructured":"Nothman, J., Ringland, N., Radford, W., Murphy, T., & Curran, J. R. (2013). Learning multilingual named entity recognition from wikipedia. Artificial Intelligence, 194, 151\u2013175.","journal-title":"Artificial Intelligence"},{"key":"9774_CR35","unstructured":"Ostendorff, M., Bourgonje, P., Berger, M., Moreno-Schneider, J., & Rehm, G. (2019) Enriching BERT with Knowledge Graph Embeddings for Document Classification. In S. Remus, R. Aly, & C. Biemann (Eds.,) Proceedings of the GermEval Workshop 2019 \u2013 Shared Task on the Hierarchical Classification of Blurbs, Erlangen, Germany, 8 October 2019"},{"key":"9774_CR36","doi-asserted-by":"publisher","unstructured":"Ostendorff, M., Ruas, T., Schubotz, M., Rehm, G., & Gipp, B (2020) Pairwise multi-class document classification for semantic relations between wikipedia articles. In Proceedings of the ACM\/IEEE Joint Conference on Digital Libraries in 2020, Association for Computing Machinery, New York, NY, USA, JCDL \u201920 (pp.\u00a0127\u201313). https:\/\/doi.org\/10.1145\/3383583.3398525,","DOI":"10.1145\/3383583.3398525"},{"key":"9774_CR37","doi-asserted-by":"crossref","unstructured":"Pasi, G., De\u00a0Grandis, M., & Viviani, M. (2020) Decision making over multiple criteria to assess news credibility in microblogging sites. In 2020 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), IEEE (pp.\u00a01\u20138).","DOI":"10.1109\/FUZZ48607.2020.9177751"},{"key":"9774_CR38","doi-asserted-by":"publisher","first-page":"128","DOI":"10.1016\/j.scico.2016.01.001","volume":"121","author":"M Perov\u0161ek","year":"2016","unstructured":"Perov\u0161ek, M., Kranjc, J., Erjavec, T., Cestnik, B., & Lavra\u010d, N. (2016). Textflows: A visual programming platform for text mining and natural language processing. Science of Computer Programming, 121, 128\u2013152.","journal-title":"Science of Computer Programming"},{"key":"9774_CR39","unstructured":"Pietsch, M., Soni, T., Chan, B.,M\u00f6ller, T., & Kosti\u0107, B. (2020) Haystack. https:\/\/github.com\/deepset-ai\/haystack\/, version 0.5.0"},{"key":"9774_CR40","doi-asserted-by":"publisher","unstructured":"Piperidis, S., Labropoulou, P., Galanis, D., Deligiannis, M., & Rehm, G. (2023) The European Language Grid Platform: Basic Concepts. In G. Rehm (Ed.), European Language Grid: A Language Technology Platform for Multilingual Europe, Cognitive Technologies. Springer, Cham, Switzerland, pp.\u00a013\u20133https:\/\/doi.org\/10.1007\/978-3-031-17258-8_2,","DOI":"10.1007\/978-3-031-17258-8_2"},{"key":"9774_CR41","unstructured":"Raring, M., Ostendorff, M., &Rehm, G. (2022) Semantic Relations between Text Segments for Semantic Storytelling: Annotation Tool \u2013 Dataset \u2013 Evaluation. In: Calzolari N, B\u00e9chet F, Blache P, Cieri C, Choukri K, Declerck T, Isahara H, Maegaard B, Mariani J, Odijk J, Piperidis S (eds) Proceedings of the 13th Language Resources and Evaluation Conference (LREC\u00a02022), European Language Resources Association (ELRA), Marseille, France, pp 4923\u20134932, june 20-25, 2022"},{"key":"9774_CR42","doi-asserted-by":"crossref","unstructured":"Rehbein, M., & Fritze, C. (2015). Hands-On Teaching Digital Humanities: A Didactic Analysis of a Summer School Course on Digital Editing. In B. D. Hirsch (Ed.), Digital Humanities Pedagogy Practices. (47\u201378). Cambridge: Principles and Politics, Digital Humanities Series, Open Book Publishers.","DOI":"10.2307\/j.ctt5vjtt3.7"},{"key":"9774_CR43","doi-asserted-by":"crossref","unstructured":"Rehm, G. (Ed.). (2023). European Language Grid: A Language Technology Platform for Multilingual Europe. Springer, Cham, Switzerland: Cognitive Technologies.","DOI":"10.1007\/978-3-031-17258-8"},{"key":"9774_CR44","doi-asserted-by":"crossref","unstructured":"Rehm, G., Moreno-Schneider, J., Bourgonje, P., Srivastava, A., Nehring, J., Berger, A., K\u00f6nig, L., R\u00e4uchle, S., & Gerth, J. (2017). Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters. In B. Miller, M. van Erp, P. Vossen, M. Palmer, E. Hovy, & T. Mitamura (Eds.), Caselli T (pp. 42\u201351). Association for Computational Linguistics, Vancouver, Canada: Proc. of the Events and Stories in the News Workshop.","DOI":"10.18653\/v1\/W17-2707"},{"key":"9774_CR45","doi-asserted-by":"crossref","unstructured":"Rehm, G., Lee, M., Moreno-Schneider, J., & Bourgonje, P. (2019a). Curation Technologies for a Cultural Heritage Archive: Analysing and transforming a heterogeneous data set into an interactive curation workbench. In A. Antonacopoulos, M. B\u00fcchler (Eds.), DATeCH 2019: Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage, Brussels, Belgium (pp.\u00a0117\u2013122) (2019).","DOI":"10.1145\/3322905.3322909"},{"key":"9774_CR46","unstructured":"Rehm, G., Zaczynska, K., & Schneider, JM. (2019b) Semantic Storytelling: Towards Identifying Storylines in Large Amounts of Text Content. In A. Jorge, R. Campos, A. Jatowt, & S. Bhatia (Eds.), Proc. of Text2Story \u2013 Second Workshop on Narrative Extraction From Texts co-located with 41th European Conf. on Information Retrieval (ECIR 2019), Cologne, Germany (pp.\u00a063\u201370) (2019)."},{"key":"9774_CR47","unstructured":"Rehm G, Bourgonje P, Hegele S, Kintzel F, Moreno-Schneider J, Ostendorff M, Zaczynska K, Berger A, Grill S, R\u00e4uchle S, Rauenbusch J, Rutenburg L, Schmidt A, Wild M, Hoffmann H, Fink J, Schulz S, Seva J, Quantz J, B\u00f6ttger J, Matthey J, Fricke R, Thomsen J, Paschke A, Qundus JA, Hoppe T, Karam N, Weichhardt F, Fillies C, Neudecker C, Gerber M, Labusch K, Rezanezhad V, Schaefer R, Zellh\u00f6fer D, Siewert D, Bunk P, Pintscher L, Aleynikova E, Heine F (2020a) QURATOR: Innovative Technologies for Content and Data Curation. In: Paschke A, Neudecker C, Rehm G, Qundus JA, Pintscher L (eds) Proceedings of QURATOR 2020 \u2013 The conference for intelligent content solutions, Berlin, Germany, cEUR Workshop Proceedings, Volume 2535. 20\/21 January 2020"},{"key":"9774_CR48","unstructured":"Rehm G, Galanis D, Labropoulou P, Piperidis S, Wel\u00df M, Usbeck R, K\u00f6hler J, Deligiannis M, Gkirtzou K, Fischer J, Chiarcos C, Feldhus N, Moreno-Schneider J, Kintzel F, Montiel E, Doncel VR, McCrae JP, Laqua D, Theile IP, Dittmar C, Bontcheva K, Roberts I, Vasiljevs A, Lagzdin\u0161 A (2020b) Towards an Interoperable Ecosystem of AI and LT Platforms: A Roadmap for the Implementation of Different Levels of Interoperability. In: Rehm G, Bontcheva K, Choukri K, Hajic J, Piperidis S, Vasiljevs A (eds) Proceedings of the 1st International Workshop on Language Technology Platforms (IWLTP 2020, co-located with LREC 2020), Marseille, France, pp 96\u2013107, 16 May 2020"},{"key":"9774_CR49","first-page":"240","volume-title":"Computational Analysis of Storylines: Making Sense of Events","author":"G Rehm","year":"2021","unstructured":"Rehm, G., Zaczynska, K., Bourgonje, P., Ostendorff, M., Moreno-Schneider, J., Berger, M., Rauenbusch, J., Schmidt, A., Wild, M., B\u00f6ttger, J., Quantz, J., Thomsen, J., & Fricke, R. (2021). Semantic Storytelling: From Experiments and Prototypes to a Technical Solution. In T. Caselli, E. Hovy, M. Palmer, & P. Vossen (Eds.), Computational Analysis of Storylines: Making Sense of Events (pp. 240\u2013259). Cambridge: Studies in Natural Language Processing, Cambridge University Press."},{"key":"9774_CR50","doi-asserted-by":"publisher","unstructured":"Reul, C., Christ, D., Hartelt, A., Balbach, N., Wehner, M., Springmann, U., Wick, C., Grundig, C., B\u00fcttner, A., & Puppe, F. (2019) OCR4all\u2014An open-source tool providing a (semi-) automatic OCR workflow for historical printings. Applied Sciences 9(22):1\u2013https:\/\/doi.org\/10.3390\/app9224853","DOI":"10.3390\/app9224853"},{"key":"9774_CR51","unstructured":"Richardson, L., Amundsen, M., & Ruby, S. (2013) RESTful Web APIs. O\u2019Reilly Media, Inc."},{"key":"9774_CR52","doi-asserted-by":"publisher","DOI":"10.1633\/JISTaP.2014.2.3.1","author":"SY Rieh","year":"2014","unstructured":"Rieh, S. Y. (2014). Credibility assessment of online information in context. Journal of Information Science Theory and Practice. https:\/\/doi.org\/10.1633\/JISTaP.2014.2.3.1","journal-title":"Journal of Information Science Theory and Practice"},{"key":"9774_CR53","doi-asserted-by":"publisher","unstructured":"Ro, Y., Lee, Y., & Kang, P. (2020) Multi$$^{2}$$OIE: Multilingual open information extraction based on multi-head attention with BERT. In: Findings of the Association for Computational Linguistics: EMNLP 2020, Association for Computational Linguistics, Online, pp 1107\u2013111https:\/\/doi.org\/10.18653\/v1\/2020.findings-emnlp.99, https:\/\/aclanthology.org\/2020.findings-emnlp.99","DOI":"10.18653\/v1\/2020.findings-emnlp.99"},{"key":"9774_CR54","doi-asserted-by":"crossref","unstructured":"Ruan, Q., Ostendorff, M., & Rehm, G. (2022) HiStruct+: Improving Extractive Text Summarization with Hierarchical Structure Information. In: Muresan S, Nakov P, Villavicencio A (eds) Findings of the Association for Computational Linguistics: ACL 2022, Association for Computational Linguistics, Dublin, Ireland, 1292\u20131308, https:\/\/aclanthology.org\/2022.findings-acl.102\/","DOI":"10.18653\/v1\/2022.findings-acl.102"},{"key":"9774_CR55","doi-asserted-by":"crossref","unstructured":"Schulz, K., Rauenbusch, J., Fillies, J., Rutenburg, L., Karvelas, D., & Rehm, G. (2022) User Experience Design for Automatic Credibility Assessment of News Content About COVID-19. In: Meiselwitz G, Moallem A, Zaphiris P, Ioannou A, Sottilare RA, Schwarz J, Fang X (eds) HCI International 2022 \u2013 Late Breaking Papers. Interaction in New Media, Learning and Games, Springer Nature, Cham, Switzerland, 142\u2013165, 26 June-01 July 2022","DOI":"10.1007\/978-3-031-22131-6_11"},{"key":"9774_CR56","unstructured":"Straka, M., Haji\u010d, J., & Strakov\u00e1, J. (2016) UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC\u201916), European Language Resources Association (ELRA), Portoro\u017e, Slovenia, 4290\u20134297, https:\/\/aclanthology.org\/L16-1680"},{"key":"9774_CR57","unstructured":"Str\u00f6tgen, J., & Gertz, M. (2010) HeidelTime: High Quality Rule-based Extraction and Normalization of Temporal Expressions. In: Proceedings of the 5th International Workshop on Semantic Evaluation, Association for Computational Linguistics, Stroudsburg, PA, USA, SemEval \u201910, 321\u2013324, http:\/\/dl.acm.org\/citation.cfm?id=1859664.1859735"},{"issue":"1\u20132","key":"9774_CR58","doi-asserted-by":"publisher","first-page":"1","DOI":"10.2991\/nlpr.d.200522.001","volume":"1","author":"Q Su","year":"2020","unstructured":"Su, Q., Wan, M., Liu, X., & Huang, C. R. (2020). Motivations, Methods and Metrics of Misinformation Detection: An NLP Perspective. Natural Language Processing Research, 1(1\u20132), 1\u20131. https:\/\/doi.org\/10.2991\/nlpr.d.200522.001","journal-title":"Natural Language Processing Research"},{"key":"9774_CR59","unstructured":"Tiedemann, J., & Thottingal, S. (2020) OPUS-MT \u2013 building open translation services for the world. In: Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, European Association for Machine Translation, Lisboa, Portugal, 479\u2013480, https:\/\/aclanthology.org\/2020.eamt-1.61"},{"key":"9774_CR60","doi-asserted-by":"publisher","unstructured":"Wiener, P., & Thoma, S. (2023) Streaming Language Processing in Manufacturing. In: Rehm G (ed) European Language Grid: A Language Technology Platform for Multilingual Europe, Cognitive Technologies, Springer International Publishing, Cham, Switzerland, 337\u201334https:\/\/doi.org\/10.1007\/978-3-031-17258-8_26,","DOI":"10.1007\/978-3-031-17258-8_26"},{"key":"9774_CR61","doi-asserted-by":"publisher","unstructured":"Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J. W., da Silva Santos, L. B., Bourne, P. E., Bouwman, J., Brookes, A. J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R., \u2026 \u2019t Hoen PAC, Hooft R, Kuhn T, Kok R, Kok J, Lusher SJ, Martone ME, Mons A, Packer AL, Persson B, Rocca-Serra P, Roos M, van Schaik R, Sansone SA, Schultes E, Sengstag T, Slater T, Strawn G, Swertz MA, Thompson M, van der Lei J, van Mulligen E, Velterop J, Waagmeester A, Wittenburg P, Wolstencroft K, Zhao J, Mons B,. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3(160018), 1. https:\/\/doi.org\/10.1038\/sdata.2016.18","DOI":"10.1038\/sdata.2016.18"},{"key":"9774_CR62","doi-asserted-by":"publisher","unstructured":"Wobbrock, JO., Hattatoglu, L., Hsu. AK., Burger, MA., & Magee, MJ. (2021) The Goldilocks zone: Young adults\u2019 credibility perceptions of online news articles based on visual appearance. New Review of Hypermedia and Multimedia 0(0):1\u20134https:\/\/doi.org\/10.1080\/13614568.2021.1889690","DOI":"10.1080\/13614568.2021.1889690"},{"key":"9774_CR63","doi-asserted-by":"crossref","unstructured":"Zhan, J., & Zhao, H. (2020) Span model for open information extraction on accurate corpus. In: The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020, AAAI Press, pp 9523\u20139530, https:\/\/ojs.aaai.org\/index.php\/AAAI\/article\/view\/6497","DOI":"10.1609\/aaai.v34i05.6497"},{"key":"9774_CR64","unstructured":"Zhang, J., Zhao, Y., Saleh, M., & Liu, PJ. (2020) Pegasus: Pre-training with extracted gap-sentences for abstractive summarization. In: Proceedings of the 37th International Conference on Machine Learning, JMLR.org, ICML\u201920"},{"key":"9774_CR65","doi-asserted-by":"crossref","unstructured":"Zhao, S., Talasila, M., Jacobson, G., Borcea, C., Aftab, SA., & Murray, JF. (2018) Packaging and Sharing Machine Learning Models via the Acumos AI Open Platform. In: 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), IEEE, 841\u2013846","DOI":"10.1109\/ICMLA.2018.00135"},{"key":"9774_CR66","doi-asserted-by":"publisher","unstructured":"Zhou, X., Mulay, A,. Ferrara, E., & Zafarani, R. (2020) ReCOVery: A Multimodal Repository for COVID-19 News Credibility Research. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, ACM, Virtual Event Ireland, 3205\u2013321https:\/\/doi.org\/10.1145\/3340531.3412880","DOI":"10.1145\/3340531.3412880"}],"container-title":["Language Resources and Evaluation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-024-09774-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10579-024-09774-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-024-09774-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T05:15:41Z","timestamp":1765257341000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10579-024-09774-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,7]]},"references-count":67,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,12]]}},"alternative-id":["9774"],"URL":"https:\/\/doi.org\/10.1007\/s10579-024-09774-4","relation":{},"ISSN":["1574-020X","1574-0218"],"issn-type":[{"value":"1574-020X","type":"print"},{"value":"1574-0218","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,7]]},"assertion":[{"value":"27 June 2024","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 December 2024","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no Conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}