{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,25]],"date-time":"2025-06-25T14:12:33Z","timestamp":1750860753491},"reference-count":86,"publisher":"MIT Press - Journals","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computational Linguistics"],"published-print":{"date-parts":[[2019,3]]},"abstract":"<jats:p> Nominal compounds such as red wine and nut case display a continuum of compositionality, with varying contributions from the components of the compound to its semantics. This article proposes a framework for compound compositionality prediction using distributional semantic models, evaluating to what extent they capture idiomaticity compared to human judgments. For evaluation, we introduce data sets containing human judgments in three languages: English, French, and Portuguese. The results obtained reveal a high agreement between the models and human predictions, suggesting that they are able to incorporate information about idiomaticity. We also present an in-depth evaluation of various factors that can affect prediction, such as model and corpus parameters and compositionality operations. General crosslingual analyses reveal the impact of morphological variation and corpus size in the ability of the model to predict compositionality, and of a uniform combination of the components for best results. <\/jats:p>","DOI":"10.1162\/coli_a_00341","type":"journal-article","created":{"date-parts":[[2018,12,21]],"date-time":"2018-12-21T16:12:19Z","timestamp":1545408739000},"page":"1-57","source":"Crossref","is-referenced-by-count":16,"title":["Unsupervised Compositionality Prediction of Nominal Compounds"],"prefix":"10.1162","volume":"45","author":[{"given":"Silvio","family":"Cordeiro","sequence":"first","affiliation":[{"name":"Federal University of Rio Grande do Sul and Aix Marseille University, CNRS, LIS."}]},{"given":"Aline","family":"Villavicencio","sequence":"additional","affiliation":[{"name":"University of Essex and Federal University of Rio Grande do Sul."}]},{"given":"Marco","family":"Idiart","sequence":"additional","affiliation":[{"name":"Federal University of Rio Grande do Sul."}]},{"given":"Carlos","family":"Ramisch","sequence":"additional","affiliation":[{"name":"Aix Marseille University, CNRS, LIS."}]}],"member":"281","reference":[{"key":"bib1","doi-asserted-by":"publisher","DOI":"10.3115\/1620754.1620758"},{"key":"bib2","doi-asserted-by":"publisher","DOI":"10.1162\/coli.07-034-R2"},{"key":"bib3","first-page":"267","volume-title":"Handbook of Natural Language Processing","author":"Baldwin Timothy","year":"2010","edition":"2"},{"key":"bib4","doi-asserted-by":"publisher","DOI":"10.3115\/1119282.1119291"},{"key":"bib5","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-009-9081-4"},{"key":"bib6","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-1023"},{"key":"bib7","doi-asserted-by":"publisher","DOI":"10.1162\/coli_a_00016"},{"key":"bib8","unstructured":"Bick, Eckhard. 2000. The Parsing System \u201cpalavras\u201d: Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. Ph.D. thesis, University of Aarhus."},{"key":"bib9","first-page":"728","volume-title":"Proceedings of the Conference on Language Resources and Evaluation 2014","author":"Boos Rodrigo","year":"2014"},{"key":"bib10","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P15-1028"},{"key":"bib11","doi-asserted-by":"publisher","DOI":"10.3758\/s13428-011-0183-8"},{"key":"bib12","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P15-2001"},{"key":"bib13","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W15-0903"},{"key":"bib14","first-page":"242","volume-title":"Proceedings of NAACL\/HLT 2010","author":"Carpuat Marine","year":"2010"},{"issue":"1","key":"bib15","first-page":"22","volume":"16","author":"Church Kenneth Ward","year":"1990","journal-title":"Computational Linguistics"},{"key":"bib16","doi-asserted-by":"publisher","DOI":"10.1177\/001316446002000104"},{"key":"bib17","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00302"},{"key":"bib18","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1187"},{"key":"bib19","first-page":"1221","volume-title":"Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)","author":"Cordeiro Silvio","year":"2016"},{"key":"bib20","first-page":"231","volume-title":"Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics","author":"Curran James R.","year":"2002"},{"key":"bib21","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"bib22","unstructured":"Evert, Stefan. 2004. The Statistics of Word Cooccurrences: Word Pairs and Collocations. Ph.D. thesis, Institut f\u00fcr maschinelle Sprachverarbeitung, University of Stuttgart, Stuttgart, Germany."},{"key":"bib23","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W15-0904"},{"key":"bib24","doi-asserted-by":"publisher","DOI":"10.1162\/coli.08-010-R1-07-048"},{"key":"bib25","first-page":"561","volume-title":"Association for Computational Linguistics (1)","author":"Ferret Olivier","year":"2013"},{"key":"bib26","first-page":"20","volume-title":"Proceedings of the Association for Computational Linguistics 2011 Workshop on MWEs","author":"Finlayson Mark","year":"2011"},{"key":"bib27","first-page":"168","volume-title":"Selected Papers of J. R. Firth","author":"Firth John R","year":"1957"},{"key":"bib28","doi-asserted-by":"publisher","DOI":"10.1177\/001316447303300309"},{"key":"bib29","first-page":"25","volume":"100","author":"Frege Gottlob","year":"1892","journal-title":"Zeitschrift f\u00fcr Philosophie und philosophische Kritik"},{"key":"bib30","doi-asserted-by":"publisher","DOI":"10.3115\/1706543.1706548"},{"key":"bib31","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2005.02.006"},{"key":"bib32","volume-title":"Compositionality","author":"Goldberg Adele E.","year":"2015"},{"key":"bib33","series-title":"IWCS \u201911","first-page":"135","volume-title":"Proceedings of the Ninth International Conference on Computational Semantics","author":"Guevara Emiliano","year":"2011"},{"key":"bib34","doi-asserted-by":"publisher","DOI":"10.1080\/00437956.1954.11659520"},{"key":"bib35","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/E17-1006"},{"key":"bib36","first-page":"138","volume-title":"Proceedings of *SEM 2013 (Volume 2 \u2014 SemEval)","author":"Hendrickx Iris","year":"2013"},{"key":"bib37","first-page":"82","volume-title":"Proceedings of the LAW 2010","author":"Hwang Jena D.","year":"2010"},{"key":"bib38","first-page":"33","volume-title":"Proceedings of NAACL Student Research Workshop","author":"Jagfeld Glorianna","year":"2015"},{"key":"bib39","volume-title":"Speech and Language Processing","author":"Jurafsky Daniel","year":"2009","edition":"2"},{"key":"bib40","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-1503"},{"key":"bib41","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-1039"},{"key":"bib42","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/S14-1021"},{"key":"bib43","doi-asserted-by":"publisher","DOI":"10.1080\/01638539809545028"},{"key":"bib44","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00201"},{"key":"bib45","first-page":"394","author":"Lapesa Gabriella","year":"2017","journal-title":"EACL 2017"},{"key":"bib46","author":"Lauer Mark","year":"1995","journal-title":"CoRR"},{"key":"bib47","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00134"},{"key":"bib48","first-page":"768","volume-title":"Proceedings of the 17th International Conference on Computational Linguistics (Volume 2)","author":"Lin Dekang","year":"1998"},{"key":"bib49","doi-asserted-by":"publisher","DOI":"10.3115\/1034678.1034730"},{"key":"bib50","doi-asserted-by":"publisher","DOI":"10.3115\/1119282.1119292"},{"key":"bib51","first-page":"3111","volume-title":"Advances in Neural Information Processing Systems","author":"Mikolov Tomas","year":"2013"},{"key":"bib52","first-page":"746","volume-title":"HLT-NAACL","author":"Mikolov Tomas","year":"2013"},{"key":"bib53","first-page":"236","volume-title":"Association for Computational Linguistics","author":"Mitchell Jeff","year":"2008"},{"key":"bib54","doi-asserted-by":"publisher","DOI":"10.1111\/j.1551-6709.2010.01106.x"},{"key":"bib55","author":"Mohammad Saif","year":"2012","journal-title":"CoRR"},{"key":"bib56","first-page":"46","volume-title":"Proceedings of the LREC Workshop Towards a Shared Task for MWEs","author":"Nakov Preslav","year":"2008"},{"key":"bib57","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324913000065"},{"key":"bib58","first-page":"2216","volume-title":"Proceedings of the Conference on Language Resources and Evaluation (Volume 6)","author":"Nivre Joakim","year":"2006"},{"key":"bib59","doi-asserted-by":"publisher","DOI":"10.3115\/1075096.1075113"},{"key":"bib60","doi-asserted-by":"publisher","DOI":"10.1162\/coli.2007.33.2.161"},{"key":"bib61","first-page":"2964","volume-title":"Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)","author":"Padr\u00f3 Muntsa","year":"2014"},{"key":"bib62","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1047"},{"key":"bib63","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1162"},{"key":"bib64","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-2026"},{"key":"bib65","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/W16-1804"},{"key":"bib66","first-page":"210","volume-title":"Proceedings of the 5th International Joint Conference on Natural Language Processing 2011 (IJCNLP 2011)","author":"Reddy Siva","year":"2011"},{"key":"bib67","doi-asserted-by":"publisher","DOI":"10.3115\/1698239.1698249"},{"key":"bib68","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1290"},{"key":"bib69","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W14-0818"},{"key":"bib70","first-page":"32","volume-title":"Proceedings of the 9th Workshop on Multiword Expressions","author":"Roller Stephen","year":"2013"},{"key":"bib71","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-45715-1_1"},{"key":"bib72","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/E14-1050"},{"key":"bib73","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/N15-1099"},{"key":"bib74","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/W15-0909"},{"key":"bib75","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-2068"},{"key":"bib76","first-page":"28","volume":"43","author":"Schmid Helmut","year":"1995","journal-title":"Institut f\u00fcr Maschinelle Sprachverarbeitung, Universit\u00e4t Stuttgart"},{"key":"bib77","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/S16-1084"},{"key":"bib78","first-page":"100","volume-title":"Proceedings of Empirical Methods in Natural Language Processing","author":"Schone Patrick","year":"2001"},{"key":"bib79","first-page":"2285","volume-title":"Proceedings of the Conference on Language Resources and Evaluation","author":"Schulte im Walde Sabine","year":"2016"},{"key":"bib80","first-page":"255","volume-title":"Proceedings of *SEM 2013 (Volume 1)","author":"Schulte im Walde Sabine","year":"2013"},{"key":"bib81","first-page":"1201","volume-title":"Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning","author":"Socher Richard","year":"2012"},{"key":"bib82","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00162"},{"key":"bib83","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324912000101"},{"key":"bib84","doi-asserted-by":"publisher","DOI":"10.1613\/jair.2934"},{"key":"bib85","first-page":"2703","volume-title":"COLING 2012","author":"Van de Cruys Tim","year":"2012"},{"key":"bib86","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1201"}],"container-title":["Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/coli_a_00341","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T21:28:20Z","timestamp":1615584500000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/coli\/article\/45\/1\/1-57\/1621"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,3]]},"references-count":86,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,3]]}},"alternative-id":["10.1162\/coli_a_00341"],"URL":"https:\/\/doi.org\/10.1162\/coli_a_00341","relation":{},"ISSN":["0891-2017","1530-9312"],"issn-type":[{"value":"0891-2017","type":"print"},{"value":"1530-9312","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,3]]}}}