{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T09:40:04Z","timestamp":1773740404328,"version":"3.50.1"},"reference-count":3,"publisher":"MIT Press - Journals","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["TACL"],"published-print":{"date-parts":[[2015,12]]},"abstract":"<jats:p> We show how to train the fast dependency parser of Smith and Eisner (2008) for improved accuracy. This parser can consider higher-order interactions among edges while retaining O( n<jats:sup>3<\/jats:sup>) runtime. It outputs the parse with maximum expected recall\u2014but for speed, this expectation is taken under a posterior distribution that is constructed only approximately, using loopy belief propagation through structured factors. We show how to adjust the model parameters to compensate for the errors introduced by this approximation, by following the gradient of the actual loss on training data. We find this gradient by back-propagation. That is, we treat the entire parser (approximations and all) as a differentiable circuit, as others have done for loopy CRFs (Domke, 2010; Stoyanov et al., 2011; Domke, 2011; Stoyanov and Eisner, 2012). The resulting parser obtains higher accuracy with fewer iterations of belief propagation than one trained by conditional log-likelihood. <\/jats:p>","DOI":"10.1162\/tacl_a_00153","type":"journal-article","created":{"date-parts":[[2018,12,28]],"date-time":"2018-12-28T15:44:35Z","timestamp":1546011875000},"page":"489-501","source":"Crossref","is-referenced-by-count":5,"title":["Approximation-Aware Dependency Parsing by Belief                     Propagation"],"prefix":"10.1162","volume":"3","author":[{"given":"Matthew R.","family":"Gormley","sequence":"first","affiliation":[{"name":"Human Language Technology Center of Excellence, Center for Language and                         Speech Processing, Department of Computer Science, Johns Hopkins University,                         Baltimore, MD,"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mark","family":"Dredze","sequence":"additional","affiliation":[{"name":"Human Language Technology Center of Excellence, Center for Language and                         Speech Processing, Department of Computer Science, Johns Hopkins University,                         Baltimore, MD,"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jason","family":"Eisner","sequence":"additional","affiliation":[{"name":"Human Language Technology Center of Excellence, Center for Language and                         Speech Processing, Department of Computer Science, Johns Hopkins University,                         Baltimore, MD,"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"281","reference":[{"key":"p_11","author":"Duchi John","year":"2011","journal-title":"The Journal of Machine Learning Research."},{"key":"p_26","author":"Kschischang Frank R.","year":"2001","journal-title":"IEEE Transactions on Information Theory, 47(2)."},{"key":"p_46","author":"Wainwright Martin J.","year":"2006","journal-title":"The Journal of Machine Learning Research, 7."}],"container-title":["Transactions of the Association for Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/tacl_a_00153","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T21:38:47Z","timestamp":1615585127000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/tacl\/article\/43280"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,12]]},"references-count":3,"alternative-id":["10.1162\/tacl_a_00153"],"URL":"https:\/\/doi.org\/10.1162\/tacl_a_00153","relation":{},"ISSN":["2307-387X"],"issn-type":[{"value":"2307-387X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,12]]}}}