{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,16]],"date-time":"2026-02-16T20:55:12Z","timestamp":1771275312373,"version":"3.50.1"},"reference-count":11,"publisher":"MIT Press - Journals","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["TACL"],"published-print":{"date-parts":[[2014,12]]},"abstract":"<jats:p> We present a novel representation, evaluation measure, and supervised models for the task of identifying the multiword expressions (MWEs) in a sentence, resulting in a lexical semantic segmentation. Our approach generalizes a standard chunking representation to encode MWEs containing gaps, thereby enabling efficient sequence tagging algorithms for feature-rich discriminative models. Experiments on a new dataset of English web text offer the first linguistically-driven evaluation of MWE identification with truly heterogeneous expression types. Our statistical sequence model greatly outperforms a lookup-based segmentation procedure, achieving nearly 60% F<jats:sub>1<\/jats:sub> for MWE identification. <\/jats:p>","DOI":"10.1162\/tacl_a_00176","type":"journal-article","created":{"date-parts":[[2018,12,28]],"date-time":"2018-12-28T15:43:26Z","timestamp":1546011806000},"page":"193-206","source":"Crossref","is-referenced-by-count":17,"title":["Discriminative Lexical Semantic Segmentation with Gaps: Running the                     MWE Gamut"],"prefix":"10.1162","volume":"2","author":[{"given":"Nathan","family":"Schneider","sequence":"first","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University, Pittsburgh, PA                         15213, USA,"}]},{"given":"Emily","family":"Danchik","sequence":"additional","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University, Pittsburgh, PA                         15213, USA,"}]},{"given":"Chris","family":"Dyer","sequence":"additional","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University, Pittsburgh, PA                         15213, USA,"}]},{"given":"Noah A.","family":"Smith","sequence":"additional","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University, Pittsburgh, PA                         15213, USA,"}]}],"member":"281","reference":[{"issue":"4","key":"p_11","first-page":"467","volume":"18","author":"Brown Peter F.","year":"1992","journal-title":"Computational Linguistics"},{"issue":"3","key":"p_21","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1002\/j.1545-7249.2008.tb00137.x","volume":"42","author":"Ellis Nick C.","year":"2008","journal-title":"TESOL Quarterly"},{"issue":"3","key":"p_23","doi-asserted-by":"crossref","first-page":"501","DOI":"10.2307\/414531","volume":"64","author":"Fillmore Charles J.","year":"1988","journal-title":"Language"},{"issue":"3","key":"p_24","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1023\/A:1007662407062","volume":"37","author":"Freund Yoav","year":"1999","journal-title":"Machine Learning"},{"issue":"1","key":"p_31","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1162\/COLI_a_00139","volume":"39","author":"Green Spence","year":"2012","journal-title":"Computational Linguistics"},{"issue":"2","key":"p_39","first-page":"313","volume":"19","author":"Marcus Mitchell P.","year":"1993","journal-title":"Computational Linguistics"},{"issue":"3","key":"p_47","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1353\/lan.1994.0007","volume":"70","author":"Nunberg Geoffrey","year":"1994","journal-title":"Language"},{"issue":"1","key":"p_50","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1007\/s10579-009-9101-4","volume":"44","author":"Pecina Pavel","year":"2010","journal-title":"Language Resources and Evaluation"},{"issue":"04","key":"p_56","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1017\/S135132491000029X","volume":"17","author":"Recasens Marta","year":"2011","journal-title":"Natural Language Engineering"},{"issue":"4","key":"p_61","doi-asserted-by":"crossref","first-page":"317","DOI":"10.1016\/S0022-0000(67)80022-9","volume":"1","author":"Thatcher James W.","year":"1967","journal-title":"Journal of Computer and System Sciences"},{"issue":"2","key":"p_71","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2483691.2483695","volume":"10","author":"Vincze Veronika","year":"2013","journal-title":"ACM Transactions on Speech and Language Processing"}],"container-title":["Transactions of the Association for Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/tacl_a_00176","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T21:38:54Z","timestamp":1615585134000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/tacl\/article\/43301"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,12]]},"references-count":11,"alternative-id":["10.1162\/tacl_a_00176"],"URL":"https:\/\/doi.org\/10.1162\/tacl_a_00176","relation":{},"ISSN":["2307-387X"],"issn-type":[{"value":"2307-387X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,12]]}}}