{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T05:57:41Z","timestamp":1775109461075,"version":"3.50.1"},"reference-count":39,"publisher":"Springer Science and Business Media LLC","issue":"2-3","license":[{"start":{"date-parts":[[2009,4,11]],"date-time":"2009-04-11T00:00:00Z","timestamp":1239408000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Mach Learn"],"published-print":{"date-parts":[[2009,12]]},"DOI":"10.1007\/s10994-009-5110-1","type":"journal-article","created":{"date-parts":[[2009,4,10]],"date-time":"2009-04-10T14:38:46Z","timestamp":1239374326000},"page":"303-337","source":"Crossref","is-referenced-by-count":32,"title":["Training parsers by inverse reinforcement learning"],"prefix":"10.1007","volume":"77","author":[{"given":"Gergely","family":"Neu","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Csaba","family":"Szepesv\u00e1ri","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2009,4,11]]},"reference":[{"key":"5110_CR1","doi-asserted-by":"crossref","unstructured":"Abbeel, P., & Ng, A. (2004). Apprenticeship learning via inverse reinforcement learning. In ICML\u201904 (pp.\u00a01\u20138).","DOI":"10.1145\/1015330.1015430"},{"key":"5110_CR2","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/7443.001.0001","volume-title":"Predicting structured data (neural information processing)","author":"G. H. Bakir","year":"2007","unstructured":"Bakir, G. H., Hofmann, T., Sch\u00f6lkopf, B., Smola, A. J., Taskar, B., & Vishwanathan, S. V. N. (2007). Predicting structured data (neural information processing). Cambridge: MIT Press."},{"key":"5110_CR3","first-page":"113","volume-title":"Advances in neural information processing systems","author":"P. L. Bartlett","year":"2005","unstructured":"Bartlett, P. L., Collins, M., Taskar, B., & McAllester, D. (2005). Exponentiated gradient algorithms for large-margin structured classification. In Advances in neural information processing systems (Vol.\u00a017, pp.\u00a0113\u2013120). Cambridge: MIT Press."},{"key":"5110_CR4","volume-title":"Neuro-dynamic programming","author":"D. P. Bertsekas","year":"1996","unstructured":"Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-dynamic programming. Belmont: Athena Scientific."},{"key":"5110_CR5","unstructured":"Black, E. (1992). Meeting of interest group on evaluation of broad-coverage parsers of English. In LINGUIST list\u00a03.587. http:\/\/www.linguistlist.org\/issues\/3\/3-587.html ."},{"key":"5110_CR6","series-title":"Studies in applied mathematics","doi-asserted-by":"crossref","DOI":"10.1137\/1.9781611970777","volume-title":"Linear matrix inequalities in system and control theory","author":"S. Boyd","year":"1994","unstructured":"Boyd, S., El Ghaoui, L., Feron, E., & Balakrishnan, V. (1994). Studies in applied mathematics: Vol.\u00a015. Linear matrix inequalities in system and control theory. Philadelphia: SIAM."},{"key":"5110_CR7","doi-asserted-by":"crossref","first-page":"548","DOI":"10.1145\/1150402.1150466","volume-title":"KDD\u201906","author":"V. R. Carvalho","year":"2006","unstructured":"Carvalho, V. R., & Cohen, W. W. (2006). Single-pass online learning: performance, voting schemes and online feature selection. In KDD\u201906 (pp.\u00a0548\u2013553). New York: ACM."},{"key":"5110_CR8","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511546921","volume-title":"Prediction, learning, and games","author":"N. Cesa-Bianchi","year":"2006","unstructured":"Cesa-Bianchi, N., & Lugosi, G. (2006). Prediction, learning, and games. Cambridge: Cambridge University Press."},{"key":"5110_CR9","first-page":"173","volume-title":"ACL\u00a0\u201905: Proceedings of the 43rd annual meeting on association for computational linguistics","author":"E. Charniak","year":"2005","unstructured":"Charniak, E., & Johnson, M. (2005). Coarse-to-fine n-best parsing and MaxEnt discriminative reranking. In ACL\u00a0\u201905: Proceedings of the 43rd annual meeting on association for computational linguistics (pp.\u00a0173\u2013180). Morristown: Association for Computational Linguistics."},{"key":"5110_CR10","unstructured":"Collins, M. (1999). Head-driven statistical models for natural language processing. Ph.D.\u00a0thesis, University of Pennsylvania."},{"key":"5110_CR11","unstructured":"Collins, M. (2000). Discriminative reranking for natural language parsing. In ICML\u201900 (pp.\u00a0175\u2013182)."},{"key":"5110_CR12","first-page":"1","volume-title":"EMNLP\u00a0\u201902: Proceedings of the ACL-02 conference on Empirical methods in natural language processing","author":"M. Collins","year":"2002","unstructured":"Collins, M. (2002). Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms. In EMNLP\u00a0\u201902: Proceedings of the ACL-02 conference on Empirical methods in natural language processing (pp.\u00a01\u20138). Morristown: Association for Computational Linguistics."},{"key":"5110_CR13","first-page":"111","volume-title":"ACL\u00a0\u201904: Proceedings of the 42nd annual meeting on association for computational linguistics","author":"M. Collins","year":"2004","unstructured":"Collins, M., & Roark, B. (2004). Incremental parsing with the perceptron algorithm. In ACL\u00a0\u201904: Proceedings of the 42nd annual meeting on association for computational linguistics (pp.\u00a0111\u2013118). Morristown: Association for Computational Linguistics."},{"key":"5110_CR14","unstructured":"Daum\u00e9 III, H. (2006). Practical structured learning techniques for natural language processing. Ph.D.\u00a0thesis, University of Southern California, Los Angeles, CA."},{"key":"5110_CR15","doi-asserted-by":"crossref","unstructured":"Elliott, H., Derin, H., Cristi, R., & Geman, D. (1984). Application of the Gibbs distribution to image segmentation. In Proc. 1984 int. conf. acoust., speech, signal processing, ICASSP\u201984 (pp.\u00a032.5.1\u201332.5.4).","DOI":"10.1109\/ICASSP.1984.1172637"},{"key":"5110_CR16","first-page":"959","volume-title":"ACL\u00a008","author":"J. R. Finkel","year":"2008","unstructured":"Finkel, J. R., Kleeman, A., & Manning, C. D. (2008). Efficient, feature-based, conditional random field parsing. In ACL\u00a008 (pp.\u00a0959\u2013967). Morristown: Association for Computational Linguistics."},{"issue":"3","key":"5110_CR17","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1023\/A:1007662407062","volume":"37","author":"Y. Freund","year":"1999","unstructured":"Freund, Y., & Schapire, R. E. (1999). Large margin classification using the perceptron algorithm. Machine Learning, 37(3), 277\u2013296.","journal-title":"Machine Learning"},{"key":"5110_CR18","doi-asserted-by":"crossref","first-page":"305","DOI":"10.1145\/1273496.1273535","volume-title":"ICML\u00a0\u201907: Proceedings of the 24th international conference on machine learning","author":"A. Globerson","year":"2007","unstructured":"Globerson, A., Koo, T. Y., Carreras, X., & Collins, M. (2007). Exponentiated gradient algorithms for log-linear structured prediction. In ICML\u00a0\u201907: Proceedings of the 24th international conference on machine learning (pp.\u00a0305\u2013312). New York: ACM."},{"key":"5110_CR19","first-page":"115","volume-title":"Proceedings of the second international ICSC symposium on neural computation (NC 2000)","author":"C. Igel","year":"2000","unstructured":"Igel, C., & H\u00fcsken, M. (2000). Improving the Rprop learning algorithm. In Proceedings of the second international ICSC symposium on neural computation (NC 2000) (pp.\u00a0115\u2013121). San Diego: Academic Press."},{"issue":"4","key":"5110_CR20","doi-asserted-by":"crossref","first-page":"620","DOI":"10.1103\/PhysRev.106.620","volume":"106","author":"E. T. Jaynes","year":"1957","unstructured":"Jaynes, E. T. (1957). Information theory and statistical mechanics. Physical Review, 106(4), 620\u2013630.","journal-title":"Physical Review"},{"key":"5110_CR21","first-page":"40","volume-title":"NAACL\u00a0\u201903: Proceedings of the 2003 conference of the North American chapter of the association for computational linguistics on human language technology","author":"D. Klein","year":"2003","unstructured":"Klein, D., & Manning, C. D. (2003). A * parsing: fast exact viterbi parse selection. In NAACL\u00a0\u201903: Proceedings of the 2003 conference of the North American chapter of the association for computational linguistics on human language technology (pp.\u00a040\u201347). Morristown: Association for Computational Linguistics."},{"key":"5110_CR22","first-page":"282","volume-title":"Proc. 18th international conf. on machine learning","author":"J. Lafferty","year":"2001","unstructured":"Lafferty, J., McCallum, A., & Pereira, F. (2001). Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. 18th international conf. on machine learning (pp.\u00a0282\u2013289). San Mateo: Morgan Kaufmann."},{"key":"5110_CR23","doi-asserted-by":"crossref","unstructured":"Maes, F., Denoyer, L., & Gallinari, P. (2007). Sequence labeling with reinforcement learning and ranking algorithms. In ECML (pp.\u00a0648\u2013657).","DOI":"10.1007\/978-3-540-74958-5_64"},{"key":"5110_CR24","volume-title":"Foundations of statistical natural language processing","author":"C. D. Manning","year":"1999","unstructured":"Manning, C. D., & Sch\u00fctze, H. (1999). Foundations of statistical natural language processing. Cambridge: MIT Press."},{"key":"5110_CR25","unstructured":"Neu, G., & Szepesv\u00e1ri, Cs. (2007). Apprenticeship learning using inverse reinforcement learning and gradient methods. In Conference on uncertainty in artificial intelligence (UAI) (pp.\u00a0295\u2013302)."},{"key":"5110_CR26","unstructured":"Ng, A., & Russell, S. (2000). Algorithms for inverse reinforcement learning. In ICML-2000 (pp.\u00a0663\u2013670)."},{"key":"5110_CR27","unstructured":"Ng, A. Y., & Jordan, M. I. (2001). On discriminative vs. generative classifiers: a comparison of logistic regression and naive bayes. In NIPS-14 (pp.\u00a0841\u2013848)."},{"key":"5110_CR28","unstructured":"Petrov, S., & Klein, D. (2007). Learning and inference for hierarchically split PCFGs. In AAAI 2007 (nectar track) (pp.\u00a01663\u20131666)."},{"key":"5110_CR29","doi-asserted-by":"crossref","unstructured":"Ratliff, N., Bagnell, J., & Zinkevich, M. (2006). Maximum margin planning. In. ICML\u201906 (pp.\u00a0729\u2013736).","DOI":"10.1145\/1143844.1143936"},{"key":"5110_CR30","unstructured":"Ratliff, N., Bagnell, J. D., & Zinkevich, M. (2007). Subgradient methods for structured prediction. In Eleventh international conference on artificial intelligence and statistics (AIStats) (pp.\u00a02:380\u2013387). (Online)."},{"issue":"5","key":"5110_CR31","doi-asserted-by":"crossref","first-page":"2053","DOI":"10.1006\/jmbi.1998.2436","volume":"285","author":"E. Rivas","year":"1999","unstructured":"Rivas, E., & Eddy, S. R. (1999). A dynamic programming algorithm for RNA structure prediction including pseudoknots. Journal of Molecular Biology, 285(5), 2053\u20132068.","journal-title":"Journal of Molecular Biology"},{"key":"5110_CR32","doi-asserted-by":"crossref","first-page":"807","DOI":"10.1145\/1273496.1273598","volume-title":"ICML\u00a0\u201907: Proceedings of the 24th international conference on machine learning","author":"S. Shalev-Shwartz","year":"2007","unstructured":"Shalev-Shwartz, S., Singer, Y., & Srebro, N. (2007). Pegasos: primal estimated sub-GrAdient SOlver for SVM. In ICML\u00a0\u201907: Proceedings of the 24th international conference on machine learning (pp.\u00a0807\u2013814). New York: ACM."},{"key":"5110_CR33","first-page":"1449","volume-title":"Advances in neural information processing systems","author":"U. Syed","year":"2008","unstructured":"Syed, U., & Schapire, R. (2008). A game-theoretic approach to apprenticeship learning. In Advances in neural information processing systems (Vol.\u00a020, pp.\u00a01449\u20131456). Cambridge: MIT Press."},{"key":"5110_CR34","unstructured":"Taskar, B., Klein, D., Collins, M., Koller, D., & Manning, C. (2004). Max-margin parsing. In Proceedings of the conference on empirical methods in natural language processing (EMNLP) (pp.\u00a01\u20138)."},{"key":"5110_CR35","doi-asserted-by":"crossref","first-page":"896","DOI":"10.1145\/1102351.1102464","volume-title":"ICML\u00a0\u201905: Proceedings of the 22nd international conference on machine learning","author":"B. Taskar","year":"2005","unstructured":"Taskar, B., Chatalbashev, V., Koller, D., & Guestrin, C. (2005). Learning structured prediction models: a large margin approach. In ICML\u00a0\u201905: Proceedings of the 22nd international conference on machine learning (pp.\u00a0896\u2013903). New York: ACM."},{"key":"5110_CR36","first-page":"632","volume-title":"Proceedings of the 45th annual meeting of the association of computational linguistics","author":"I. Titov","year":"2007","unstructured":"Titov, I., & Henderson, J. (2007). Constituent parsing with incremental sigmoid belief networks. In Proceedings of the 45th annual meeting of the association of computational linguistics (pp.\u00a0632\u2013639). Prague: Association for Computational Linguistics."},{"key":"5110_CR37","first-page":"873","volume-title":"ACL\u00a0\u201906: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the ACL","author":"J. Turian","year":"2006","unstructured":"Turian, J., & Melamed, I. D. (2006). Advances in discriminative parsing. In ACL\u00a0\u201906: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the ACL (pp.\u00a0873\u2013880). Morristown: Association for Computational Linguistics."},{"key":"5110_CR38","unstructured":"Warmuth, M. K., & Jagota, A. K. (1997). Continuous and discrete-time nonlinear gradient descent: relative loss bounds and convergence (Technical Report). Fifth International Symposium on Artificial Intelligence and Mathematics."},{"key":"5110_CR39","unstructured":"Ziebart, B., Maas, A. L., Bagnell, J. A., & Dey, A. K. (2008). Maximum entropy inverse reinforcement learning. In AAAI (pp.\u00a01433\u20131438)."}],"container-title":["Machine Learning"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-009-5110-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10994-009-5110-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-009-5110-1","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,24]],"date-time":"2023-05-24T20:20:35Z","timestamp":1684959635000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10994-009-5110-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,4,11]]},"references-count":39,"journal-issue":{"issue":"2-3","published-print":{"date-parts":[[2009,12]]}},"alternative-id":["5110"],"URL":"https:\/\/doi.org\/10.1007\/s10994-009-5110-1","relation":{},"ISSN":["0885-6125","1573-0565"],"issn-type":[{"value":"0885-6125","type":"print"},{"value":"1573-0565","type":"electronic"}],"subject":[],"published":{"date-parts":[[2009,4,11]]}}}