{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T13:13:10Z","timestamp":1740143590114,"version":"3.37.3"},"reference-count":36,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,2,21]],"date-time":"2021-02-21T00:00:00Z","timestamp":1613865600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,2,21]],"date-time":"2021-02-21T00:00:00Z","timestamp":1613865600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100004569","name":"Ministerstwo Nauki i Szkolnictwa Wyzszego","doi-asserted-by":"crossref","award":["CLARIN-PL"],"award-info":[{"award-number":["CLARIN-PL"]}],"id":[{"id":"10.13039\/501100004569","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100004569","name":"Ministerstwo Nauki i Szkolnictwa Wyzszego","doi-asserted-by":"publisher","award":["CLARIN-PL"],"award-info":[{"award-number":["CLARIN-PL"]}],"id":[{"id":"10.13039\/501100004569","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Lang Resources &amp; Evaluation"],"published-print":{"date-parts":[[2021,3]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This paper reports on the developments in three interrelated linguistic resources for Polish. The first is \u015awigra\u00a02\u2014a rule based constituency parser for Polish. The second is Sk\u0142adnica\u2014a treebank built using \u015awigra\u00a02. The third resource is valency dictionary Walenty, which became available when the work on the first two was already advanced. However, since the dictionary is much more comprehensive than the ad-hoc dictionary used previously with \u015awigra, a decision was made to switch the parser and the treebank to the new dictionary. The switch required several modifications to the \u015awigra\u00a02 parser, including implementation of unlike coordination, introducing semantically motivated phrases, and non-standard case values. A semi-automated procedure to upgrade previously disambiguated trees in Sk\u0142adnica was required as well. Modifications introduced in the treebank during the upgrade included systematic changes of notation and resolving newly introduced ambiguities resulting from the use of the more detailed distinctions made in the dictionary. The procedure for confronting Sk\u0142adnica with the trees generated with the new version of the \u015awigra\u00a02 parser using the Walenty dictionary allowed us to check all of these resources for consistency. This resulted in several corrections being introduced in both the treebank and the valency dictionary.<\/jats:p>","DOI":"10.1007\/s10579-020-09511-7","type":"journal-article","created":{"date-parts":[[2021,2,21]],"date-time":"2021-02-21T14:02:45Z","timestamp":1613916165000},"page":"209-239","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Sk\u0142adnica: a\u00a0constituency treebank of Polish harmonised with the Walenty valency dictionary"],"prefix":"10.1007","volume":"55","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7498-1484","authenticated-orcid":false,"given":"Marcin","family":"Woli\u0144ski","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9024-6191","authenticated-orcid":false,"given":"El\u017cbieta","family":"Hajnicz","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,2,21]]},"reference":[{"key":"9511_CR1","doi-asserted-by":"publisher","first-page":"103","DOI":"10.1007\/978-94-010-0201-1_7","volume-title":"Treebanks: Building and using parsed corpora, language and speech","author":"A B\u00f6hmov\u00e1","year":"2003","unstructured":"B\u00f6hmov\u00e1, A., Haji\u010dov\u00e1, E., Haji\u010d, J., & Hladk\u00e1, B. (2003). The Prague dependency treebank: A three-level annotation scenario. In A. Abeill\u00e9 (Ed.), Treebanks: Building and using parsed corpora, language and speech (pp. 103\u2013127). Dordrecht: Kluwer Academic Publishers."},{"issue":"3","key":"9511_CR2","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/ijl\/16.3.235","volume":"16","author":"CJ Fillmore","year":"2003","unstructured":"Fillmore, C. J., Johnson, C. R., & Petruck, M. R. L. (2003). Background to FrameNet. International Journal of Lexicography, 16(3), 235\u2013250.","journal-title":"International Journal of Lexicography"},{"key":"9511_CR3","first-page":"54","volume-title":"Insight into Slovak and Czech Corpus Linguistics","author":"J Haji\u010d","year":"2005","unstructured":"Haji\u010d, J. (2005). Complex corpus annotation: The Prague dependency treebank. In M. \u0160imkov\u00e1 (Ed.), Insight into Slovak and Czech Corpus Linguistics (pp. 54\u201373). Bratislava: Veda."},{"key":"9511_CR4","unstructured":"Hajnicz, E., Andrzejczuk, A., & Bartosiak, T. (2016a). Semantic layer of the valence dictionary of Polish Walenty. In N. Calzolari, K. Choukri, T. Declerck, M. Grobelnik, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis (Eds.), Proceedings of the Tenth International Conference on Language Resources and Evaluation, LREC\u00a02016, ELRA (pp. 2625\u20132632). Portoro\u017e: European Language Resources Association (ELRA). http:\/\/www.lrec-conf.org\/proceedings\/lrec2016\/index.html."},{"key":"9511_CR5","unstructured":"Hajnicz, E., Patejuk, A., Przepi\u00f3rkowski, A., & Woli\u0144ski, M. (2016b). Walenty: s\u0142ownik walencyjny j\n                  \n                \n\n\n\n\n\n\n\n\n\nzyka polskiego z bogatym komponentem frazeologicznym. In K. Skwarska & E. Kaczmarska (Eds.), V\u00fdzkum slovesn\u00e9 valence ve slovansk\u00fdch zem\u00edch (pp. 71\u2013102). Prague: Slovansk\u00fd \u00fastav AV \u010cR."},{"key":"9511_CR6","unstructured":"Kettnerov\u00e1, V., Lopatkov\u00e1, M., & Bej\u010dek, E. (2012). The syntax-semantics interface of Czech verbs in the valency lexicon. In Proceedings of the 15th EURALEX International Congress (pp. 434\u2013443). Oslo: Department of Linguistics and Scandinavian Studies, University of Oslo."},{"key":"9511_CR7","unstructured":"Kingsbury, P., & Palmer, M. (2002). From TreeBank to PropBank. In Proceedings of the 3rd International Conference on Language Resources and Evaluation (LREC-2002) (pp. 1989\u20131993). Las Palmas, Spain"},{"issue":"1","key":"9511_CR8","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1007\/s10579-007-9048-2","volume":"42","author":"K Kipper","year":"2008","unstructured":"Kipper, K., Korhonen, A., Ryant, N., & Palmer, M. (2008). A large-scale classification of English verbs. Languge Resources and Evaluation Journal, 42(1), 21\u201340.","journal-title":"Languge Resources and Evaluation Journal"},{"key":"9511_CR9","first-page":"309","volume-title":"Travaux de slavistique : Actes du VIe congr\u00e8s de la Slavic Linguistic Society","author":"B Lewandowska-Tomaszczyk","year":"2013","unstructured":"Lewandowska-Tomaszczyk, B., G\u00f3rski, R., \u0141azi\u0144ski, M., & Przepi\u00f3rkowski, A. (2013). The National Corpus of Polish (NKJP). Language use and data analysis. In I. KorChahine & C. Zaremba (Eds.), Travaux de slavistique : Actes du VIe congr\u00e8s de la Slavic Linguistic Society (pp. 309\u2013319). Aix-en-Provence: Presses Universitaires de Provence."},{"key":"9511_CR10","doi-asserted-by":"publisher","unstructured":"Maier, W., & Lichte, T. (2011). Characterizing discontinuity in constituent treebanks. In P. Groote, M. Egg, & L. Kallmeyer (Eds.), Formal Grammar: 14th International Conference, FG 2009, Bordeaux, France, July 25\u201326, 2009, Revised Selected Papers (pp. 167\u2013182). Berlin: Springer. https:\/\/doi.org\/10.1007\/978-3-642-20169-1_11.","DOI":"10.1007\/978-3-642-20169-1_11"},{"issue":"2","key":"9511_CR11","first-page":"313","volume":"19","author":"MP Marcus","year":"1993","unstructured":"Marcus, M. P., Santorini, B., & Marcinkiewicz, M. A. (1993). Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2), 313\u2013330.","journal-title":"Computational Linguistics"},{"issue":"1","key":"9511_CR12","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1162\/0891201053630264","volume":"31","author":"M Palmer","year":"2005","unstructured":"Palmer, M., Kingsbury, P., & Gildea, D. J. (2005). The proposition bank: An annotated corpus of semantic roles. Computational Linguistics, 31(1), 71\u2013106.","journal-title":"Computational Linguistics"},{"key":"9511_CR13","unstructured":"Patejuk, A., & Przepi\u00f3rkowski, A. (2014). Synergistic development of grammatical resources: A valence dictionary, an LFG grammar, and an LFG structure bank for Polish. In V. Henrich, E. Hinrichs, D. de Kok, P. Osenova, & A. Przepi\u00f3rkowski (Eds.), Proceedings of the Thirteenth International Workshop on Treebanks and Linguistic Theories (TLT\u00a013) (pp. 113\u2013126). Department of Linguistics (SfS), University of T\u00fcbingen, T\u00fcbingen. http:\/\/tlt13.sfs.uni-tuebingen.de\/tlt13-proceedings.pdf."},{"key":"9511_CR14","doi-asserted-by":"crossref","unstructured":"Pereira, F., & Warren, D. H. D. (1980). Definite clause grammars for language analysis\u2014A survey of the formalism and a comparison with augmented transition networks. Artificial Intelligence, 13, 231\u2013278.","DOI":"10.1016\/0004-3702(80)90003-X"},{"key":"9511_CR15","unstructured":"Przepi\u00f3rkowski, A. (2004). O warto\u015bci przypadka podmiot\u00f3w liczebnikowych. Biuletyn Polskiego Towarzystwa J\n\n                  \n                zykoznawczego, LX, 133\u2013143."},{"key":"9511_CR16","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1007\/s10579-018-9433-z","volume":"54","author":"A Przepi\u00f3rkowski","year":"2020","unstructured":"Przepi\u00f3rkowski, A., & Patejuk, A. (2020). From Lexical Functional Grammar to enhanced Universal Dependencies: The UD-LFG treebank of Polish. Language Resources and Evaluation, 54, 185\u2013221. https:\/\/doi.org\/10.1007\/s10579-018-9433-z.","journal-title":"Language Resources and Evaluation"},{"key":"9511_CR17","unstructured":"Przepi\u00f3rkowski, A., Ba\u0144ko, M., G\u00f3rski, R. L., & Lewandowska-Tomaszczyk, B. (Eds.). (2012). Narodowy Korpus J\n\n                  \n                zyka Polskiego. Warsaw: Wydawnictwo Naukowe PWN."},{"key":"9511_CR18","doi-asserted-by":"crossref","unstructured":"Przepi\u00f3rkowski, A., Hajnicz, E., Patejuk, A., & Woli\u0144ski, M. (2014a). Extended phraseological information in a valence dictionary for NLP applications. In Proceedings of the Workshop on Lexical and Grammatical Resources for Language Processing (LG-LP 2014) (pp. 83\u201391). Dublin: Association for Computational Linguistics and Dublin City University; http:\/\/www.aclweb.org\/anthology\/siglex.html#2014_0.","DOI":"10.3115\/v1\/W14-5811"},{"key":"9511_CR19","unstructured":"Przepi\u00f3rkowski, A., Hajnicz, E., Patejuk, A., Woli\u0144ski, M., Skwarski, F., & \u015awidzi\u0144ski, M. (2014b). Walenty: Towards a comprehensive valence dictionary of Polish. In N. Calzolari, K. Choukri, T. Declerck, H. Loftsson, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis (Eds.), Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC\u00a02014, ELRA, Reykjav\u00edk, Iceland (pp. 2785\u20132792). http:\/\/www.lrec-conf.org\/proceedings\/lrec2014\/index.html."},{"key":"9511_CR20","unstructured":"Przepi\u00f3rkowski, A., Skwarski, F., Hajnicz, E., Patejuk, A., \u015awidzi\u0144ski, M., & Woli\u0144ski, M. (2014c). Modelowanie w\u0142asno\u015bci sk\u0142adniowych czasownik\u00f3w w nowym s\u0142owniku walencyjnym j\n                  \n                \n\n\n\n\n\n\n\n\n\nzyka polskiego. Polonica, XXXIII, 159\u2013178."},{"issue":"1","key":"9511_CR21","first-page":"1","volume":"30","author":"A Przepi\u00f3rkowski","year":"2017","unstructured":"Przepi\u00f3rkowski, A., Haji\u010d, J., Hajnicz, E., & Ure\u0161ov\u00e1, Z. (2017). Phraseology in two Slavic valency dictionaries: Limitations and perspectives. International Journal of Lexicography, 30(1), 1\u201338.","journal-title":"International Journal of Lexicography"},{"key":"9511_CR22","unstructured":"Seddah, D., Tsarfaty, R., K\u00fcbler, S., Candito, M., Choi, J. D., Farkas, R., et al. (2013). Overview of the SPMRL 2013 shared task: A cross-framework evaluation of parsing morphologically rich languages. In Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages (pp. 146\u2013182). Seattle: Association for Computational Linguistics."},{"key":"9511_CR23","volume-title":"The meaning of the sentence in its semantic and pragmatic aspects","author":"P Sgall","year":"1986","unstructured":"Sgall, P., Haji\u010dov\u00e1, E., & Panevov\u00e1, J. (1986). The meaning of the sentence in its semantic and pragmatic aspects. Dordrecht: D. Reidel."},{"key":"9511_CR24","unstructured":"\u015awidzi\u0144ski, M. (1992). Gramatyka formalna j\n\n                  \n                zyka polskiego. Rozprawy Uniwersytetu Warszawskiego, Wydawnictwa Uniwersytetu Warszawskiego, Warszawa"},{"key":"9511_CR25","unstructured":"\u015awidzi\u0144ski, M. (1994). Syntactic dictionary of Polish verbs, manuscript, Uniwersytet Warszawski and Universiteit van Amsterdam."},{"key":"9511_CR26","unstructured":"\u015awidzi\u0144ski, M., & Woli\u0144ski, M. (2010). Towards a bank of constituent parse trees for Polish. In P. Sojka, A. Hor\u00e1k, I. Kope\u010dek, & K. Pala (Eds.), Text, Speech and Dialogue: 13th International Conference, TSD\u00a02010, Brno, Czech Republic (pp. 197\u2013204). Heidelberg: Springer-Verlag. no. 6231 in Lecture Notes in Artificial Intelligence."},{"key":"9511_CR27","unstructured":"Ure\u0161ov\u00e1, Z. (2009). Building the PDT-Vallex valency lexicon. In Proceedings of the 5th Corpus Linguistics Conference, University of Liverpool."},{"key":"9511_CR28","unstructured":"Woli\u0144ski, M. (2004). Komputerowa weryfikacja gramatyki \u015awidzi\u0144skiego. Ph.D. dissertation, Institute of Computer Science, Polish Academy of Sciences, Warsaw."},{"key":"9511_CR29","unstructured":"Woli\u0144ski, M. (2010). Dendrarium - an open source tool for treebank building. In M. A. K\u0142opotek, M. Marciniak, A. Mykowiecka, W. Penczek, & S. T. Wierzcho\u0144 (Eds.), Proceedings of IIS\u20192010, Wydawnictwo Akademii Podlaskiej (pp. 193\u2013204)."},{"key":"9511_CR30","unstructured":"Woli\u0144ski, M. (2015). Deploying the new valency dictionary Walenty in a DCG parser of Polish. In M. Dickinson, E. Hinrichs, A. Patejuk, & A. Przepi\u00f3rkowski (Eds.), Proceedings of the Fourteenth International Workshop on Treebanks and Linguistic Theories (TLT\u00a014), Institute of Computer Science, Polish Academy of Sciences, Warsaw (pp. 221\u2013229). http:\/\/tlt14.ipipan.waw.pl\/proceedings\/."},{"key":"9511_CR31","doi-asserted-by":"crossref","unstructured":"Woli\u0144ski, M. (2019). Automatyczna analiza sk\u0142adnikowa j\n\n                  \n                zyka polskiego. Warszawa: Warsaw University Press.","DOI":"10.31338\/uw.9788323536147"},{"key":"9511_CR32","unstructured":"Woli\u0144ski, M., G\u0142owi\u0144ska, K., & \u015awidzi\u0144ski, M. (2011). A preliminary version of Sk\u0142adnica\u2014a treebank of Polish. In Z. Vetulani (Ed.) Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics (pp. 299\u2013303). Poland: Pozna\u0144."},{"key":"9511_CR33","unstructured":"Wr\u00f3blewska, A. (2014). Polish dependency parser trained on an automatically induced dependency bank. Ph.D. dissertation, Institute of Computer Science, Polish Academy of Sciences, Warsaw."},{"key":"9511_CR34","doi-asserted-by":"publisher","unstructured":"Wr\u00f3blewska, A. (2018) Extended and enhanced Polish dependency bank in Universal Dependencies format. In Proceedings of the Second Workshop on Universal Dependencies (UDW 2018) (pp. 173\u2013182). Brussels: Association for Computational Linguistics. https:\/\/doi.org\/10.18653\/v1\/W18-6020, https:\/\/www.aclweb.org\/anthology\/W18-6020.","DOI":"10.18653\/v1\/W18-6020"},{"key":"9511_CR35","unstructured":"Wr\u00f3blewska, A., & Woli\u0144ski, M. (2012). Preliminary experiments in Polish dependency parsing. In P. Bouvry, M. A. K\u0142opotek, F. Leprevost, M. Marciniak, A. Mykowiecka, & H. Rybi\u0144ski (Eds.), Security and Intelligent Information Systems: International Joint Conference, SIIS 2011, Warsaw, Poland, June 13-14, 2011, Revised Selected Papers, Springer-Verlag, no. 7053 in Lecture Notes in Computer Science (pp. 279\u2013292). http:\/\/www.springer.com\/computer\/communication+networks\/book\/978-3-642-25260-0."},{"key":"9511_CR36","first-page":"41","volume":"87","author":"Z \u017dabokrtsk\u00fd","year":"2007","unstructured":"\u017dabokrtsk\u00fd, Z., & Lopatkov\u00e1, M. (2007). Valency information in VALLEX 2.0: Logical structure of the lexicon. The Prague Bulletin of Mathematical Linguistics, 87, 41\u201360.","journal-title":"The Prague Bulletin of Mathematical Linguistics"}],"container-title":["Language Resources and Evaluation"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-020-09511-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10579-020-09511-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-020-09511-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,4,2]],"date-time":"2021-04-02T19:08:30Z","timestamp":1617390510000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10579-020-09511-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,21]]},"references-count":36,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,3]]}},"alternative-id":["9511"],"URL":"https:\/\/doi.org\/10.1007\/s10579-020-09511-7","relation":{},"ISSN":["1574-020X","1574-0218"],"issn-type":[{"type":"print","value":"1574-020X"},{"type":"electronic","value":"1574-0218"}],"subject":[],"published":{"date-parts":[[2021,2,21]]},"assertion":[{"value":"1 October 2020","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 February 2021","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}