{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,5,18]],"date-time":"2025-05-18T15:40:02Z","timestamp":1747582802772,"version":"3.40.5"},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2024,7,6]],"date-time":"2024-07-06T00:00:00Z","timestamp":1720224000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,7,6]],"date-time":"2024-07-06T00:00:00Z","timestamp":1720224000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"ILC - PISA"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Lang Resources &amp; Evaluation"],"published-print":{"date-parts":[[2025,6]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>The paper presents ParlaMint-It, a new treebank of Italian parliamentary debates, linguistically annotated based on the Universal Dependencies (UD) framework. The resource comprises 20,460 tokens and represents a hybrid language variety that is underrepresented in the UD initiative. ParlaMint-It results from a manual revision process that relies on a semi-automatic methodology able to identify sentences that are most likely to contain inconsistencies and recurrent error patterns generated by the automatic annotation. Such a method made the revision process faster and more efficient than revising the entire treebank. In addition, it allowed the identification and correction of annotation errors resulting from linguistic constructions inconsistently represented in UD treebanks and from characteristics specific to parliamentary speeches. Hence, the treebank is deemed as an 18-karat resource, since, although not fully manually revised, it is a valuable resource for researchers working on Italian language processing tasks.<\/jats:p>","DOI":"10.1007\/s10579-024-09748-6","type":"journal-article","created":{"date-parts":[[2024,7,6]],"date-time":"2024-07-06T12:01:38Z","timestamp":1720267298000},"page":"1659-1683","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Parlamint-it: an 18-karat UD treebank of Italian parliamentary speeches"],"prefix":"10.1007","volume":"59","author":[{"given":"Chiara","family":"Alzetta","sequence":"first","affiliation":[]},{"given":"Simonetta","family":"Montemagni","sequence":"additional","affiliation":[]},{"given":"Marta","family":"Sartor","sequence":"additional","affiliation":[]},{"given":"Giulia","family":"Venturi","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,7,6]]},"reference":[{"key":"9748_CR1","unstructured":"Agnoloni, T., Bartolini, R., & Frontini, F., et al. (2022). Making Italian parliamentary records machine-actionable: The construction of the parlamint-it corpus. In Proceedings of the workshop ParlaCLARIN III within the 13th language resources and evaluation conference. European Language Resources Association, Marseille, France, pp. 117\u2013124."},{"key":"9748_CR2","doi-asserted-by":"crossref","unstructured":"Agrawal, B., Agarwal, R., Husain, S., et\u00a0al. (2013). An automatic approach to treebank error detection using a dependency parser (pp. 294\u2013303). Springer.","DOI":"10.1007\/978-3-642-37247-6_24"},{"key":"9748_CR3","unstructured":"Alzetta, C., Dell\u2019Orletta, F., & Montemagni, S., et\u00a0al. (2017). Dangerous relations in dependency treebanks. In Proceedings of the 16th international workshop on treebanks and linguistic theories (pp 201\u2013210)."},{"issue":"6\u20132","key":"9748_CR4","doi-asserted-by":"publisher","first-page":"37","DOI":"10.4000\/ijcol.719","volume":"6","author":"C Alzetta","year":"2020","unstructured":"Alzetta, C., Dell\u2019Orletta, F., Montemagni, S., et al. (2020). Linguistically-driven selection of difficult-to-parse dependency structures. IJCoL Italian Journal of Computational Linguistics, 6(6\u20132), 37\u201360.","journal-title":"IJCoL Italian Journal of Computational Linguistics"},{"key":"9748_CR5","unstructured":"Ambati, B. R., Agarwal, R., & Gupta, M. et al. (2011). Error detection for treebank validation. In Proceedings of 9th international workshop on Asian Language Resources (ALR)."},{"key":"9748_CR6","unstructured":"Arnard\u00f3ttir \u00de, Hafsteinsson, H., & Sigur\u00f0sson, E. F., et al. (2020). A universal dependencies conversion pipeline for a Penn-format constituency treebank. In Proceedings of the fourth workshop on universal dependencies (UDW 2020). Association for Computational Linguistics, Barcelona, Spain (Online) (pp. 16\u201325)."},{"key":"9748_CR7","unstructured":"Bosco, C., Montemagni, S., & Simi, M. (2013). Converting Italian treebanks: Towards an Italian Stanford dependency treebank. In Proceedings of the 7th linguistic annotation workshop and interoperability with discourse. Association for Computational Linguistics, Sofia, Bulgaria (pp. 61\u201369)."},{"issue":"2","key":"9748_CR8","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1007\/s11168-008-9051-9","volume":"6","author":"A Boyd","year":"2008","unstructured":"Boyd, A., Dickinson, M., & Meurers, W. D. (2008). On detecting errors in dependency treebanks. Research on Language & Computation, 6(2), 113\u2013137.","journal-title":"Research on Language & Computation"},{"key":"9748_CR9","unstructured":"Croft, W. B., Nordquist, D., & Looney, K., et al. (2017). Linguistic typology meets universal dependencies. In International workshop on treebanks and linguistic theories"},{"key":"9748_CR10","first-page":"125","volume":"2","author":"F Dell\u2019Orletta","year":"2013","unstructured":"Dell\u2019Orletta, F., Venturi, G., & Montemagni, S. (2013). Linguistically-driven selection of correct arcs for dependency parsing. Computaci\u00f2n y Sistemas, 2, 125\u2013136.","journal-title":"Computaci\u00f2n y Sistemas"},{"issue":"2","key":"9748_CR11","first-page":"125","volume":"17","author":"F Dell\u2019Orletta","year":"2013","unstructured":"Dell\u2019Orletta, F., Venturi, G., & Montemagni, S. (2013). Linguistically-driven selection of correct arcs for dependency parsing. Computaci\u00f3n y Sistemas, 17(2), 125\u2013136.","journal-title":"Computaci\u00f3n y Sistemas"},{"key":"9748_CR12","unstructured":"Dickinson, M., & Meurers, W. D. (2003). Detecting inconsistencies in treebank. In Proceedings of the second workshop on treebanks and linguistic theories (TLT 2003)."},{"key":"9748_CR13","doi-asserted-by":"crossref","unstructured":"Dickinson, M., & Meurers, W. D. (2005). Detecting errors in discontinuous structural annotation. In Proceedings of the 43rd annual meeting of the ACL (pp. 322\u2013329).","DOI":"10.3115\/1219840.1219880"},{"key":"9748_CR14","doi-asserted-by":"publisher","unstructured":"Erjavec, T., & Pan\u010dur, A. (2019). Parla-CLARIN TEI guidelines for corpora of parliamentary proceedings. https:\/\/doi.org\/10.5281\/zenodo.3446164","DOI":"10.5281\/zenodo.3446164"},{"key":"9748_CR15","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-021-09574-0","author":"T Erjavec","year":"2022","unstructured":"Erjavec, T., Ogrodniczuk, M., Osenova, P., et al. (2022). The parlamint corpora of parliamentary proceedings. Language Resources and Evaluation. https:\/\/doi.org\/10.1007\/s10579-021-09574-0","journal-title":"Language Resources and Evaluation"},{"key":"9748_CR16","unstructured":"Fi\u0161er, D., Eskevich, M., de\u00a0Jong, F. (eds.) (2020). Proceedings of the second ParlaCLARIN Workshop, European Language Resources Association (ELRA), Marseille, France. https:\/\/www.aclweb.org\/anthology\/2020.parlaclarin-1.0"},{"key":"9748_CR17","unstructured":"Fi\u0161er, D., Eskevich, M., & Lenardi\u010d, J. et\u00a0al. (eds.). (2022). Proceedings of the workshop ParlaCLARIN III within the 13th language resources and evaluation conference. European Language Resources Association, Marseille, France. https:\/\/aclanthology.org\/2022.parlaclariniii-1"},{"key":"9748_CR18","unstructured":"Fi\u0161er, D., Eskevich, M., de\u00a0Jong, F. (eds.). (2018). Proceedings of LREC 2018 workshop ParlaCLARIN Creating and using parliamentary corpora, European Language Resources Association (ELRA), Paris, France. http:\/\/lrec-conf.org\/workshops\/lrec2018\/W2\/pdf\/book_of_proceedings.pdf"},{"key":"9748_CR19","first-page":"895","volume":"2012","author":"K Fort","year":"2012","unstructured":"Fort, K., Nazarenko, A., & Rosset, S. (2012). Modeling the complexity of manual annotation tasks: A grid of analysis. Proceedings of COLING, 2012, 895\u2013910.","journal-title":"Proceedings of COLING"},{"key":"9748_CR20","doi-asserted-by":"crossref","unstructured":"Hladk\u00e1, B., Hajic, J., Hana, J., et\u00a0al. (2008). The czech academic corpus 2.0 guide. The Prague Bulletin of Mathematical Linguistics, 89, 41.","DOI":"10.2478\/v10108-009-0003-9"},{"key":"9748_CR21","doi-asserted-by":"crossref","unstructured":"Ilie, C. (2015). Parliamentary discourse. The International Encyclopedia of language and social interaction (pp. 1\u201315).","DOI":"10.1002\/9781118611463.wbielsi201"},{"key":"9748_CR22","volume-title":"Speech and language processing (3rd edn)","author":"D Jurafsky","year":"2023","unstructured":"Jurafsky, D., & Martin, J. H. (2023). Speech and language processing (3rd edn). Prentice-Hall."},{"key":"9748_CR23","doi-asserted-by":"publisher","unstructured":"Kondratyuk, D., & Straka, M. (2019). 75 languages, 1 model: Parsing universal dependencies universally. In Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). Association for Computational Linguistics, Hong Kong, China (pp. 2779\u20132795). https:\/\/doi.org\/10.18653\/v1\/D19-1279, https:\/\/aclanthology.org\/D19-1279","DOI":"10.18653\/v1\/D19-1279"},{"key":"9748_CR24","unstructured":"Kr\u00ed\u017e, V., Hladk\u00e1, B., & Ure\u0161ov\u00e1, Z. (2016) .Czech legal text treebank 1.0. In Proceedings of the tenth international conference on language resources and evaluation (LREC\u201916) (pp. 2387\u20132392)."},{"key":"9748_CR25","doi-asserted-by":"crossref","unstructured":"Kuhlmann, M., & Nivre, J. (2006). Mildly non-projective dependency structures. In Proceedings of the COLING\/ACL 2006 main conference poster sessions (pp. 507\u2013514).","DOI":"10.3115\/1273073.1273139"},{"key":"9748_CR26","unstructured":"Lynn, T., & Foster, J. (2016). Universal dependencies for Irish. In Proceedings of the second Celtic language technology workshop."},{"key":"9748_CR27","unstructured":"de\u00a0Marneffe, M., Grioni, M., & Kanerva, J., et\u00a0al. (2017). Assessing the annotation consistency of the universal dependencies corpora. In Proceedings of the 4th international conference on dependency linguistics (Depling 2007), Pisa, Italy (pp. 108\u2013115)."},{"issue":"2","key":"9748_CR28","doi-asserted-by":"publisher","first-page":"308","DOI":"10.1162\/coli_a_00402","volume":"47","author":"MC de Marneffe","year":"2021","unstructured":"de Marneffe, M. C., Manning, C. D., Nivre, J., et al. (2021). Universal dependencies. Computational Linguistics, 47(2), 308. https:\/\/doi.org\/10.1162\/coli_a_00402","journal-title":"Computational Linguistics"},{"key":"9748_CR29","unstructured":"McDonald, R., & Nivre, J. (2007). Characterizing the errors of data-driven dependency parsing models. In Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL). Association for Computational Linguistics, Prague, Czech Republic (pp. 122\u2013131). https:\/\/www.aclweb.org\/anthology\/D07-1013"},{"key":"9748_CR30","doi-asserted-by":"publisher","unstructured":"M\u00fcller-Eberstein, M., van\u00a0der Goot, R., & Plank, B. (2021a). Genre as weak supervision for cross-lingual dependency parsing. In Proceedings of the 2021 conference on empirical methods in natural language processing. Association for Computational Linguistics, Online and Punta Cana, Dominican Republic (pp. 4786\u20134802)https:\/\/doi.org\/10.18653\/v1\/2021.emnlp-main.393, https:\/\/aclanthology.org\/2021.emnlp-main.393","DOI":"10.18653\/v1\/2021.emnlp-main.393"},{"key":"9748_CR31","unstructured":"M\u00fcller-Eberstein, M., van\u00a0der Goot, R., & Plank, B. (2021b). How universal is genre in universal dependencies? In Proceedings of the 20th international workshop on treebanks and linguistic theories (TLT, SyntaxFest 2021). Association for Computational Linguistics, Sofia, Bulgaria (pp. 69\u201385). https:\/\/aclanthology.org\/2021.tlt-1.7"},{"key":"9748_CR32","unstructured":"Nencioni, G. (1976). Parlato-parlato, parlato-scritto, parlato-recitato. Strumenti critici (29)."},{"key":"9748_CR33","unstructured":"Nivre, J., de\u00a0Marneffe, M. C., Ginter, F., et\u00a0al. (2020). Universal dependencies v2: An evergrowing multilingual treebank collection. In Proceedings of the twelfth language resources and evaluation conference. European Language Resources Association, Marseille, France, pp. 4034\u20134043. https:\/\/aclanthology.org\/2020.lrec-1.497"},{"key":"9748_CR34","unstructured":"Pyysalo, S., Kanerva, J., & Missil\u00e4, A., et\u00a0al. (2015). Universal Dependencies for Finnish. In Proceedings of NoDaLiDa 2015. NEALT, pp 163\u2013172, https:\/\/aclweb.org\/anthology\/W\/W15\/W15-1821.pdf"},{"key":"9748_CR35","doi-asserted-by":"publisher","unstructured":"Qi, P., & Zhang, Y., Zhang, Y., et\u00a0al. (2020). Stanza: A python natural language processing toolkit for many human languages. In Proceedings of the 58th annual meeting of the Association for Computational Linguistics: System demonstrations. Association for Computational Linguistics, Online, pp. 101\u2013108, https:\/\/doi.org\/10.18653\/v1\/2020.acl-demos.14, https:\/\/aclanthology.org\/2020.acl-demos.14","DOI":"10.18653\/v1\/2020.acl-demos.14"},{"key":"9748_CR36","doi-asserted-by":"crossref","unstructured":"Sanguinetti, M., & Bosco, C. (2015). Parttut: The Turin University parallel treebank. In Italian natural language processing within the PARLI project","DOI":"10.1007\/978-3-319-14206-7_3"},{"key":"9748_CR37","unstructured":"Volokh, A, & Neumann, G. (2011). Automatic detection and correction of errors in dependency treebanks. In Proceedings of ACL-HLT 2011."}],"container-title":["Language Resources and Evaluation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-024-09748-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10579-024-09748-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-024-09748-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,5,18]],"date-time":"2025-05-18T15:03:29Z","timestamp":1747580609000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10579-024-09748-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,7,6]]},"references-count":37,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2025,6]]}},"alternative-id":["9748"],"URL":"https:\/\/doi.org\/10.1007\/s10579-024-09748-6","relation":{},"ISSN":["1574-020X","1574-0218"],"issn-type":[{"type":"print","value":"1574-020X"},{"type":"electronic","value":"1574-0218"}],"subject":[],"published":{"date-parts":[[2024,7,6]]},"assertion":[{"value":"30 April 2024","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 July 2024","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they do not have any conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"Our work has limited ethical implications since we mainly introduced a novel treebank enriched with morpho-syntactic annotations compliant with the Universal Dependencies standard. The ParlaMint treebank from which ParlaMint-It originates was used in compliance with the Terms of Use and the resources and materials produced during this study will be distributed in compliance with the license agreement of the UD project.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical approval"}}]}}