{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T06:12:46Z","timestamp":1765260766275,"version":"3.46.0"},"reference-count":36,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2024,10,12]],"date-time":"2024-10-12T00:00:00Z","timestamp":1728691200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,10,12]],"date-time":"2024-10-12T00:00:00Z","timestamp":1728691200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001871","name":"Funda\u00e7\u00e3o para a Ci\u00eancia e a Tecnologia","doi-asserted-by":"publisher","award":["PINFRA\/22117\/2016"],"award-info":[{"award-number":["PINFRA\/22117\/2016"]}],"id":[{"id":"10.13039\/501100001871","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100005765","name":"Universidade de Lisboa","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100005765","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Lang Resources &amp; Evaluation"],"published-print":{"date-parts":[[2025,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>While language processing services are key assets for the science and technology of language, the possible ways under which they may be made available to the widest range of their end users are critical to support an Open Science policy for this scientific domain. Although providing such processing services under some web-based interface, at large, offers itself as an immediate and cogent response to that challenge, turning this view into an effective access to language processing services is an undertaking deserving a clear conceptual direction and a corresponding robust empirical validation. Based on an extensive overview of major undertakings towards making language processing tools available and on the design principles worked out and implemented in the PORTULAN CLARIN infrastructure, in this paper we advocate for a Research-Infrastructure-as-a-Service (RIaaS) model. This model unleashes accessibility to language processing services in as many web-based interface modalities as the current stage of technological development permits to support, in order to serve as many types of end users as possible, from IT developers to Digital Humanities researchers, and including citizen scientists, teachers, students and digital artists among many others.<\/jats:p>","DOI":"10.1007\/s10579-024-09772-6","type":"journal-article","created":{"date-parts":[[2024,10,12]],"date-time":"2024-10-12T09:02:02Z","timestamp":1728723722000},"page":"4391-4420","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["From greatest simplicity to full power"],"prefix":"10.1007","volume":"59","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3119-4189","authenticated-orcid":false,"given":"Lu\u00eds","family":"Gomes","sequence":"first","affiliation":[]},{"given":"Ant\u00f3nio","family":"Branco","sequence":"additional","affiliation":[]},{"given":"Jo\u00e3o","family":"Silva","sequence":"additional","affiliation":[]},{"given":"Ruben","family":"Branco","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,10,12]]},"reference":[{"key":"9772_CR1","unstructured":"Barreto, F., Branco, A., Ferreira, E., Mendes, A., Bacelar do Nascimento, M. F., Nunes, F., & Silva, J. R. (2006). Open resources and tools for the shallow processing of Portuguese: The TagShare project. In Proceedings of the 5th international conference on language resources and evaluation (LREC) (pp. 1438\u20131443)."},{"key":"9772_CR2","unstructured":"Branco, A., Castro, S., Silva, F., & Costa, F. (2011). CINTIL DepBank handbook: Design options for the representation of grammatical dependencies. Technical Report DI-FCUL-TR-2011-03, University of Lisbon."},{"key":"9772_CR8","unstructured":"Branco, A., Costa, F., Martins, P., Nunes, F., Silva, J., & Silveira, S. (2008). LXService: Web services of language technology for Portuguese. In Proceedings of the 6th international conference on language resources and evaluation (LREC) (pp. 2577\u20132583)."},{"key":"9772_CR3","unstructured":"Branco, A., Costa, F., Silva, J., Silveira, S., Castro, S., Avel\u00e3s, M., Pinto, C., & Gra\u00e7a, J. (2010). Developing a deep linguistic databank supporting a collection of treebanks: the CINTIL DeepGramBank. In Proceedings of the 7th international conference on language resources and evaluation (LREC) (pp. 1810\u20131815)."},{"key":"9772_CR9","unstructured":"Branco, A., & Henriques, T. (2003). Aspects of verbal inflection and lemmatization: Generalizations and algorithms. In Proceedings of XVIII annual meeting of the portuguese association of linguistics (APL) (pp. 201\u2013210)."},{"key":"9772_CR4","unstructured":"Branco, A., Mendes, A., Quaresma, P., Gomes, L., Silva, J., & Teixeira, A. (2020). Infrastructure for the science and technology of language PORTULAN CLARIN. In Proceedings of the 1st international workshop on language technology platforms (pp. 1\u20137). European Language Resources Association."},{"key":"9772_CR5","doi-asserted-by":"crossref","unstructured":"Branco, A., & Nunes, F. (2012). Verb analysis in a highly inflective language with an MFF algorithm. In Proceedings of the 11th international conference on the computational processing of Portuguese (PROPOR), number 7243 in Lecture Notes in Artificial Intelligence (pp. 1\u201311). Springer.","DOI":"10.1007\/978-3-642-28885-2_1"},{"key":"9772_CR6","doi-asserted-by":"crossref","unstructured":"Branco, A., Rodrigues, J., Silva, J., Costa, F., & Vaz, R. (2014). Assessing automatic text classification for interactive language learning. In Proceedings of the IEEE international conference on information society (iSociety) (pp. 72\u201380).","DOI":"10.1109\/i-Society.2014.7009014"},{"key":"9772_CR7","doi-asserted-by":"crossref","unstructured":"Branco, A., & Silva, J. (2006). A suite of shallow processing tools for Portuguese: LX-Suite. In Proceedings of the 11th conference of the European chapter of the association for computational linguistics (EACL) (pp. 179\u2013182).","DOI":"10.3115\/1608974.1609003"},{"key":"9772_CR10","unstructured":"Costa, F., & Branco, A. (2012). Aspectual type and temporal relation classification. In Proceedings of the 13th conference of the European chapter of the association for computational linguistics (pp. 266\u2013275)."},{"key":"9772_CR11","doi-asserted-by":"crossref","unstructured":"Cruz, A. F., Rocha, G., & Cardoso, H. L.. (2018). Exploring Spanish corpora for Portuguese coreference resolution. In 2018 fifth international conference on social networks analysis, management and security (SNAMS) (pp. 290\u2013295).","DOI":"10.1109\/SNAMS.2018.8554705"},{"key":"9772_CR12","unstructured":"Dale, R. (2020). Text analytics APIs: A consumer guide. The Language Technology Group."},{"key":"9772_CR13","doi-asserted-by":"crossref","unstructured":"de\u00a0Jong, F., Van Uytvanck, D., Frontini, F., van\u00a0den Bosch, A., Fis\u030cer, D., & Witt, A. (2022). Language matters. The European research infrastructure clarin, today and tomorrow. In D. Fis\u030cer & A. Witt, (Eds.), CLARIN. The infrastructure for language resources (pp. 31 \u2013 57). de Gruyter.","DOI":"10.1515\/9783110767377-002"},{"key":"9772_CR14","unstructured":"Eskevich, M., de\u00a0Jong, F., K\u00f6nig, A., Fi\u0161er, D., Van\u00a0Uytvanck, D., Aalto, T., Borin, L., Gerassimenko, O., Hajic, J., van\u00a0den Heuvel, H., Kahusk, N., Liin, K., Matthiesen, M., Piperidis, S., & Vider, K. (2020). CLARIN: Distributed language resources and technology in a European infrastructure. In Proceedings of the 1st international workshop on language technology platforms (pp. 28\u201334). European Language Resources Association."},{"key":"9772_CR24","unstructured":"European Strategy\u00a0Forum on\u00a0Research Infrastructures\u00a0(ESFRI). (2020). Making science happen\u2014a new ambition for research infrastructures in the European research area. White paper."},{"key":"9772_CR25","unstructured":"European Strategy\u00a0Forum on\u00a0Research Infrastructures\u00a0(ESFRI). (2021). Roadmap 2021\u2014strategy report on research infrastructures. White paper."},{"key":"9772_CR15","unstructured":"Goosen, T., & Eckart, T. (2014). Virtual language observatory 3.0: What\u2019s new. In CLARIN annual conference."},{"key":"9772_CR16","doi-asserted-by":"crossref","unstructured":"Hajic\u0306, J., Hajic\u0306ov\u00e0, E., Hladk\u00e0, B., Mis\u030cutka, J., Kos\u030carko, O., & Stran\u0306\u00e0k, P. (2022). Lindat\/clariah-cz: Where we are and where we go. In D. Fis\u030cer & A. Witt (Eds.), CLARIN\u2014the infrastructure for language resources (pp. 61\u201382). De Gruyter.","DOI":"10.1515\/9783110767377-003"},{"key":"9772_CR17","unstructured":"Hinrichs, E. W., Hinrichs, M., & Zastrow, T. (2010). WebLicht: Web-based LRT services for German. In Proceedings of the ACL 2010 system demonstrations (pp. 25\u201329)."},{"volume-title":"The language grid: Service-oriented collective intelligence for language resource interoperability","year":"2011","key":"9772_CR18","unstructured":"Ishida, T. (Ed.). (2011). The language grid: Service-oriented collective intelligence for language resource interoperability. Springer."},{"key":"9772_CR19","unstructured":"Jones, S., Dimper, R., Dillo, I., Hanahoe, H., & Kurapati, S. (2021). Making the European Open Science Cloud (EOSC) work: Where to go from here?."},{"key":"9772_CR20","doi-asserted-by":"crossref","unstructured":"Jupyter, P., Bussonnier, M., Forde, J., Freeman, J., Granger, B., Head, T., Holdgraf, C., Kelley, K., Nalvarte, G., Osheroff, A., Pacer, M., Panda, Y., Perez, F., Ragan-Kelley, B., & Willing, C. (2018). Binder 2.0\u2014reproducible, interactive, sharable environments for science at scale. In F. Akici, D. Lippa, D. Niederhut, & M.\u00a0Pacer (Eds.), Proceedings of the 17th Python in science conference (pp. 113\u2013120).","DOI":"10.25080\/Majora-4af1f417-011"},{"issue":"2","key":"9772_CR21","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1093\/comjnl\/27.2.97","volume":"27","author":"DE Knuth","year":"1984","unstructured":"Knuth, D. E. (1984). Literate programming. The Computer Journal, 27(2), 97\u2013111.","journal-title":"The Computer Journal"},{"key":"9772_CR22","doi-asserted-by":"crossref","unstructured":"Lagoze, C., Van\u00a0de Sompel, H., Nelson, M., & Warner, S. (2002). Open archives initiative-protocol for metadata harvesting-v. 2.0. http:\/\/www.openarchives.org\/OAI\/openarchivesprotocol.html","DOI":"10.1108\/07378830310479776"},{"key":"9772_CR23","unstructured":"Miranda, N., Raminhos, R., Seabra, P., Sequeira, J., Gon\u00e7alves, T., & Quaresma, P. (2011). Named entity recognition using machine learning techniques. In EPIA-11, 15th Portuguese conference on artificial intelligence (pp. 818\u2013831)."},{"key":"9772_CR26","unstructured":"Rehm, G., Berger, M., Elsholz, E., Hegele, S., Kintzel, F., Marheinecke, K., Piperidis, S., Deligiannis, M., Galanis, D., Gkirtzou, K., Labropoulou, P., Bontcheva, K., Jones, D., Roberts, I., Haji\u010d, J., Hamrlov\u00e1, J., Ka\u010dena, L., Choukri, K., Arranz, V., Vasil\u0327jevs, A., Anvari, O., Lagzdi\u0146\u0161, A., Mel\u0327\u0146ika, J., Backfried, G., Dikici, E., Janosik, M., Prinz, K., Prinz, C., Stampler, S., Thomas-Aniola, D., G\u00f3mez-P\u00e9rez, J. M., Silva, A. G., Berr\u00edo, C., Germann, U., Renals, S., & Klejch, O. (2020). European language grid: An overview. In Proceedings of the twelfth language resources and evaluation conference (pp. 3366\u20133380). European Language Resources Association."},{"key":"9772_CR27","unstructured":"Rehm, G., Marheinecke, K., Hegele, S., Piperidis, S., Bontcheva, K., Haji\u010d, J., Choukri, K., Vasil\u0327jevs, A., Backfried, G., Prinz, C., G\u00f3mez-P\u00e9rez, J. M., Meertens, L., Lukowicz, P., van Genabith, J., L\u00f6sch, A., Slusallek, P., Irgens, M., Gatellier, P., K\u00f6hler, J., Le\u00a0Bars, L., Anastasiou, D., Auksori\u016bt\u0117, A., Bel, N., Branco, A., Budin, G., Daelemans, W., De\u00a0Smedt, K., Garab\u00edk, R., Gavriilidou, M., Gromann, D., Koeva, S., Krek, S., Krstev, C., Lind\u00e9n, K., Magnini, B., Odijk, J., Ogrodniczuk, M., R\u00f6gnvaldsson, E., Rosner, M., Pedersen, B., Skadi\u0146a, I., Tadi\u0107, M., Tufis, D., V\u00e1radi, T., Vider, K., Way, A., & Yvon, F. (2020). The European language technology landscape in 2020: Language-centric and human-centric AI for cross-cultural communication in multilingual Europe. In Proceedings of the Twelfth Language Resources and Evaluation Conference (pp. 3322\u20133332). European Language Resources Association."},{"key":"9772_CR29","doi-asserted-by":"crossref","unstructured":"Rodrigues, J., Branco, A., Neale, S., & Silva, J. (2016). LX-DSemVectors: Distributional semantics models for the Portuguese language. In Proceedings of the 12th international conference on the computational processing of Portuguese (PROPOR\u201916) (pp. 259\u2013270).","DOI":"10.1007\/978-3-319-41552-9_27"},{"key":"9772_CR28","unstructured":"Rodrigues, J., Costa, F., Silva, J., & Branco, A. (2020). Automatic syllabification of Portuguese. Revista da Associa\u00e7\u00e3o Portuguesa de Lingu\u00edstica, (1)."},{"key":"9772_CR30","doi-asserted-by":"crossref","unstructured":"Santos, R., Silva, J., Branco, A., & Xiong, D. (2019). The direct path may not be the best: Portuguese-Chinese neural machine translation. In Proceedings of the 19th EPIA conference on artificial intelligence (pp. 757\u2013768).","DOI":"10.1007\/978-3-030-30244-3_62"},{"key":"9772_CR31","doi-asserted-by":"crossref","unstructured":"Silva, J., Branco, A., Castro, S., & Reis, R. (2009). Out-of-the-box robust parsing of Portuguese. In Proceedings of the 9th international conference on language resources and evaluation (LREC) (pp. 75\u201385).","DOI":"10.1007\/978-3-642-12320-7_10"},{"key":"9772_CR32","unstructured":"Van\u00a0Uytvanck, D., Stehouwer, H., & Lampen, L. (2012). Semantic metadata mapping in practice: the virtual language observatory. In LREC 2012: 8th international conference on language resources and evaluation, (pp. 1029\u20131034). European Language Resources Association (ELRA)."},{"key":"9772_CR33","unstructured":"Veiga, A., Candeias, S., & Perdig\u00e3o, F. (2011). Generating a pronunciation dictionary for European Portuguese using a joint-sequence model with embedded stress assignment. In Proceedings of the 8th Brazilian symposium in information and human language technology."},{"key":"9772_CR34","doi-asserted-by":"crossref","unstructured":"Windhouwer, M., & Goosen., T. (2022). Component metadata infrastructure. In D. Fis\u030cer & A. Witt, (Eds.), CLARIN\u2014the infrastructure for language resources (pp. 191\u2013222). De Gruyter .","DOI":"10.1515\/9783110767377-008"},{"issue":"4","key":"9772_CR35","doi-asserted-by":"publisher","first-page":"631","DOI":"10.1162\/coli_a_00329","volume":"44","author":"C Zinn","year":"2018","unstructured":"Zinn, C. (2018). The language resource switchboard. Computational Linguistics, 44(4), 631\u2013639.","journal-title":"Computational Linguistics"},{"key":"9772_CR36","doi-asserted-by":"crossref","unstructured":"Zinn, C., & Campbell, B. (2023). Weblicht-batch\u2014a web-based interface for batch processing large input with the weblicht workflow engine. In CLARIN annual conference (pp. 133\u2013141).","DOI":"10.3384\/ecp198013"}],"container-title":["Language Resources and Evaluation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-024-09772-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10579-024-09772-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-024-09772-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T05:15:27Z","timestamp":1765257327000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10579-024-09772-6"}},"subtitle":["Research-Infrastructure-as-a-Service for language science and technology"],"short-title":[],"issued":{"date-parts":[[2024,10,12]]},"references-count":36,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2025,12]]}},"alternative-id":["9772"],"URL":"https:\/\/doi.org\/10.1007\/s10579-024-09772-6","relation":{},"ISSN":["1574-020X","1574-0218"],"issn-type":[{"type":"print","value":"1574-020X"},{"type":"electronic","value":"1574-0218"}],"subject":[],"published":{"date-parts":[[2024,10,12]]},"assertion":[{"value":"26 August 2024","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 October 2024","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors are (or have been) part of the PORTULAN CLARIN staff, with the following roles: Lu\u00eds Gomes as technical manager, Ant\u00f3nio Branco as the director general, Jo\u00e3o Silva as scientific resources and users support manager, and Ruben Branco as developer.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}