{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:13:10Z","timestamp":1750306390311,"version":"3.41.0"},"reference-count":62,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2015,12,4]],"date-time":"2015-12-04T00:00:00Z","timestamp":1449187200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Intell. Syst. Technol."],"published-print":{"date-parts":[[2016,1,22]]},"abstract":"<jats:p>We consider the task of record extraction from text documents, where the goal is to automatically populate the fields of target relations, such as scientific seminars or corporate acquisition events. There are various inferences involved in the record-extraction process, including mention detection, unification, and field assignments. We use structured learning to find the appropriate field-value assignments. Unlike previous works, the proposed approach generates feature-rich models that enable the modeling of domain semantics and structural coherence at all levels and across fields. Given labeled examples, such an approach can, for instance, learn likely event durations and the fact that start times should come before end times. While the inference space is large, effective learning is achieved using a perceptron-style method and simple, greedy beam decoding. A main focus of this article is on practical aspects involved in implementing the proposed framework for real-world applications. We argue and demonstrate that this approach is favorable in conditions of data shift, a real-world setting in which models learned using a limited set of labeled examples are applied to examples drawn from a different data distribution. Much of the framework\u2019s robustness is attributed to the modeling of domain knowledge. We describe design and implementation details for the case study of seminar event extraction from email announcements, and discuss design adaptations across different domains and text genres.<\/jats:p>","DOI":"10.1145\/2801131","type":"journal-article","created":{"date-parts":[[2015,12,4]],"date-time":"2015-12-04T13:43:07Z","timestamp":1449236587000},"page":"1-34","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Event Extraction using Structured Learning and Rich Domain Knowledge"],"prefix":"10.1145","volume":"7","author":[{"given":"Einat","family":"Minkov","sequence":"first","affiliation":[{"name":"University of Haifa, Haifa, Israel"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2015,12,4]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2003.1179189"},{"key":"e_1_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Galia\n       \n      Angelova\n    . 2010.\n      \n  \n   \n  Use of domain knowledge in the automatic extraction of structured representations from patient-related texts\n  . \n  Conceptual Structures\n  : From Information to Intelligence Lecture Notes in Computer Science vol. \n  6208 Springer Berlin 14--27.   Galia Angelova. 2010. Use of domain knowledge in the automatic extraction of structured representations from patient-related texts. Conceptual Structures: From Information to Intelligence Lecture Notes in Computer Science vol. 6208 Springer Berlin 14--27.","DOI":"10.1007\/978-3-642-14197-3_6"},{"volume-title":"Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI).","year":"2008","author":"Banko Michele","key":"e_1_2_1_3_1"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2003.1234765"},{"key":"e_1_2_1_5_1","unstructured":"Mary Elaine Califf and Raymond J. Mooney. 1999. Relational learning of pattern-match rules for information extraction. In AAAI\/IAAI.  Mary Elaine Califf and Raymond J. Mooney. 1999. Relational learning of pattern-match rules for information extraction. In AAAI\/IAAI."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1162\/153244304322972685"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/1718487.1718501"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-012-5296-5"},{"volume-title":"Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI).","year":"2005","author":"Cohen William W.","key":"e_1_2_1_9_1"},{"key":"e_1_2_1_10_1","unstructured":"William W. Cohen Pradeep Ravikumar and Stephen Fienberg. 2003. A comparison of string distance metrics for name-matching tasks. In IIWEB.  William W. Cohen Pradeep Ravikumar and Stephen Fienberg. 2003. A comparison of string distance metrics for name-matching tasks. In IIWEB."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/1622859.1622867"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118693.1118694"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1162\/0891201053630273"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.3115\/1218955.1218970"},{"volume-title":"First PASCAL Challenges Workshop.","year":"2005","author":"Cox Christopher","key":"e_1_2_1_15_1"},{"key":"e_1_2_1_16_1","unstructured":"Koby Crammer Alex Kulesza and Mark Dredze. 2009. Adaptive regularization of weight vectors. In Advances in Neural Information Processing Systems (NIPS).  Koby Crammer Alex Kulesza and Mark Dredze. 2009. Adaptive regularization of weight vectors. In Advances in Neural Information Processing Systems (NIPS)."},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2105-6-S1-S13"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btg452"},{"volume-title":"Proceedings of the Annual Meeting of the Association of Computational Linguistics (ACL).","year":"2007","author":"Hal Daum\u00e9","key":"e_1_2_1_19_1"},{"key":"e_1_2_1_20_1","unstructured":"M. C. de Marneffe B. Maccartney and C. D. Manning. 2006. Generating typed dependency parses from phrase structure parses. In LREC.  M. C. de Marneffe B. Maccartney and C. D. Manning. 2006. Generating typed dependency parses from phrase structure parses. In LREC."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1409360.1409378"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.3115\/1219840.1219885"},{"key":"e_1_2_1_23_1","unstructured":"Aidan Finn. 2006. A multi-level boundary classification approach to information extraction. In PhD thesis. University College Dublin Ireland.  Aidan Finn. 2006. A multi-level boundary classification approach to information extraction. In PhD thesis. University College Dublin Ireland."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007601113994"},{"key":"e_1_2_1_25_1","unstructured":"Dayne Freitag and Andrew McCallum. 2000. Information extraction with HMM structures learned by stochastic optimization. In AAAI\/IAAI.   Dayne Freitag and Andrew McCallum. 2000. Information extraction with HMM structures learned by stochastic optimization. In AAAI\/IAAI."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007662407062"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.3115\/992628.992709"},{"volume-title":"Proceedings of ACL.","year":"2010","author":"Haghighi Aria","key":"e_1_2_1_28_1"},{"volume-title":"Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT).","year":"2012","author":"Huang Liang","key":"e_1_2_1_29_1"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/bti1006"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-008-9079-3"},{"volume-title":"Proceedings of AAAI Conference on Artificial Intelligence (AAAI).","author":"Ling Xiao","key":"e_1_2_1_32_1"},{"key":"e_1_2_1_33_1","unstructured":"Inderjeet Mani. 2001. Automatic Summarization. John Benjamins Publishing Co. Philadelphia PA.  Inderjeet Mani. 2001. Automatic Summarization. John Benjamins Publishing Co. Philadelphia PA."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.3115\/1219840.1219852"},{"volume-title":"Proceedings of the Annual Conference of the North American Chapter of the ACL (NAACL).","year":"2010","author":"McDonald Ryan","key":"e_1_2_1_35_1"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871542"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/1877766.1877770"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220575.1220631"},{"key":"e_1_2_1_39_1","doi-asserted-by":"crossref","unstructured":"Einat Minkov Richard C. Wang Anthony Tomasic and William W. Cohen. 2006. NER systems that suit user\u2019s preferences: Adjusting the recall-precision trade-off for entity extraction. In HLT\/NAACL.   Einat Minkov Richard C. Wang Anthony Tomasic and William W. Cohen. 2006. NER systems that suit user\u2019s preferences: Adjusting the recall-precision trade-off for entity extraction. In HLT\/NAACL.","DOI":"10.3115\/1614049.1614073"},{"volume-title":"Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL).","author":"Minkov Einat","key":"e_1_2_1_40_1"},{"key":"e_1_2_1_41_1","unstructured":"Minorthird. 2008. Methods for identifying names and ontological relations in text using heuristics for inducing regularities from data. Retrieved November 4 2015 from http:\/\/sourceforge.net\/projects\/minorthird\/.  Minorthird. 2008. Methods for identifying names and ontological relations in text using heuristics for inducing regularities from data. Retrieved November 4 2015 from http:\/\/sourceforge.net\/projects\/minorthird\/."},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.5555\/1690219.1690287"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/1089815.1089817"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2009.191"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/1814433.1814461"},{"volume-title":"Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI).","year":"2003","author":"Peshkin Leonid","key":"e_1_2_1_46_1"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220175.1220230"},{"key":"e_1_2_1_48_1","doi-asserted-by":"crossref","unstructured":"Joaquin Quionero-Candela Masashi Sugiyama Anton Schwaighofer and Neil D. Lawrence. 2009. Dataset Shift in Machine Learning. MIT Press Cambridge MA.   Joaquin Quionero-Candela Masashi Sugiyama Anton Schwaighofer and Neil D. Lawrence. 2009. Dataset Shift in Machine Learning. MIT Press Cambridge MA.","DOI":"10.7551\/mitpress\/9780262170055.001.0001"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1037\/h0042519"},{"volume-title":"Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI).","year":"2001","author":"Roth Dan","key":"e_1_2_1_50_1"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072228.1072379"},{"volume-title":"Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence (AAAI).","year":"2013","author":"Samadi Mehdi","key":"e_1_2_1_52_1"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1561\/1900000003"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-007-9019-4"},{"key":"e_1_2_1_55_1","unstructured":"Christian Siefkes. 2008. An Incrementally Trainable Statistical Approach to Information Extraction. VDM Verlag Saarbr\u00fccken Germany.   Christian Siefkes. 2008. An Incrementally Trainable Statistical Approach to Information Extraction. VDM Verlag Saarbr\u00fccken Germany."},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-11964-9_33"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/1132956.1132957"},{"volume-title":"Proceedings of EMNLP.","year":"2011","author":"Yao Limin","key":"e_1_2_1_59_1"},{"volume-title":"Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (ACL-AFNLP).","author":"Luke","key":"e_1_2_1_60_1"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/E14-1018"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1162\/coli_a_00037"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150457"}],"container-title":["ACM Transactions on Intelligent Systems and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2801131","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2801131","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:07:13Z","timestamp":1750223233000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2801131"}},"subtitle":["Application across Domains and Data Sources"],"short-title":[],"issued":{"date-parts":[[2015,12,4]]},"references-count":62,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2016,1,22]]}},"alternative-id":["10.1145\/2801131"],"URL":"https:\/\/doi.org\/10.1145\/2801131","relation":{},"ISSN":["2157-6904","2157-6912"],"issn-type":[{"type":"print","value":"2157-6904"},{"type":"electronic","value":"2157-6912"}],"subject":[],"published":{"date-parts":[[2015,12,4]]},"assertion":[{"value":"2014-09-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-07-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2015-12-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}