{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,13]],"date-time":"2026-05-13T20:09:16Z","timestamp":1778702956272,"version":"3.51.4"},"reference-count":42,"publisher":"Cambridge University Press (CUP)","issue":"1","license":[{"start":{"date-parts":[[2014,10,8]],"date-time":"2014-10-08T00:00:00Z","timestamp":1412726400000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2016,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Rule-based information extraction is an important approach for processing the increasingly available amount of unstructured data. The manual creation of rule-based applications is a time-consuming and tedious task, which requires qualified knowledge engineers. The costs of this process can be reduced by providing a suitable rule language and extensive tooling support. This paper presents UIMA Ruta, a tool for rule-based information extraction and text processing applications. The system was designed with focus on rapid development. The rule language and its matching paradigm facilitate the quick specification of comprehensible extraction knowledge. They support a compact representation while still providing a high level of expressiveness. These advantages are supplemented by the development environment UIMA Ruta Workbench. It provides, in addition to extensive editing support, essential assistance for explanation of rule execution, introspection, automatic validation, and rule induction. UIMA Ruta is a useful tool for academia and industry due to its open source license. We compare UIMA Ruta to related rule-based systems especially concerning the compactness of the rule representation, the expressiveness, and the provided tooling support. The competitiveness of the runtime performance is shown in relation to a popular and freely-available system. A selection of case studies implemented with UIMA Ruta illustrates the usefulness of the system in real-world scenarios.<\/jats:p>","DOI":"10.1017\/s1351324914000114","type":"journal-article","created":{"date-parts":[[2014,10,8]],"date-time":"2014-10-08T14:13:32Z","timestamp":1412777612000},"page":"1-40","source":"Crossref","is-referenced-by-count":47,"title":["UIMA Ruta: Rapid development of rule-based information extraction applications"],"prefix":"10.1017","volume":"22","author":[{"given":"PETER","family":"KLUEGL","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"MARTIN","family":"TOEPFER","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"PHILIP-DANIEL","family":"BECK","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"GEORG","family":"FETTE","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"FRANK","family":"PUPPE","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"56","published-online":{"date-parts":[[2014,10,8]]},"reference":[{"key":"S1351324914000114_ref036","doi-asserted-by":"publisher","DOI":"10.3115\/1610075.1610143"},{"key":"S1351324914000114_ref035","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-28569-1_2"},{"key":"S1351324914000114_ref030","first-page":"233","volume-title":"Proceedings of the Biennial GSCL Conference 2009, 2nd UIMA@GSCL Workshop","author":"Kluegl","year":"2009"},{"key":"S1351324914000114_ref029","first-page":"233","volume-title":"32nd Annual German Conference on Artificial Intelligence (KI 2009)","author":"Kluegl","year":"2009"},{"key":"S1351324914000114_ref024","volume-title":"Proceedings of the First Workshop on Unstructured Information Management Architecture at Biannual Conference of the Society for Computational Linguistics and Language Technology","author":"Gurevych","year":"2007"},{"key":"S1351324914000114_ref022","doi-asserted-by":"publisher","DOI":"10.1609\/aimag.v31i3.2303"},{"key":"S1351324914000114_ref021","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324904003523"},{"key":"S1351324914000114_ref019","first-page":"199","volume-title":"Workshop Notes of the LWA 2011 - Learning, Knowledge, Adaptation","author":"Eckstein","year":"2011"},{"key":"S1351324914000114_ref017","volume-title":"Special Issue on Managing Information Extraction","author":"Doan","year":"2008"},{"key":"S1351324914000114_ref011","doi-asserted-by":"crossref","first-page":"827","DOI":"10.18653\/v1\/D13-1079","volume-title":"Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing","author":"Chiticariu","year":"2013"},{"key":"S1351324914000114_ref034","first-page":"564","volume-title":"ICSE '03: Proceedings of the 25th International Conference on Software Engineering","author":"Maximilien","year":"2003"},{"key":"S1351324914000114_ref005","unstructured":"Boguraev B. , and Neff M. 2006. An annotation-Based finite-state system for UIMA: pattern matching over annotations. Technical Report, IBM T.J. Watson Research Center."},{"key":"S1351324914000114_ref004","unstructured":"Black W. J. , McNaught J. , Vasilakopoulos A. , Zervanou K. , Theodoulidis B. , and Rinaldi F. 2005. CAFETIERE: conceptual annotations for facts, events, terms, individual entities and RElations. Technical Report TR\u2013U4.3.1. Parmenides Technical Report."},{"key":"S1351324914000114_ref018","first-page":"17","article-title":"Shallow processing with unification and typed feature structures - foundations and applications","volume":"18","author":"Drozdzynski","year":"2004","journal-title":"K\u00fcnstliche Intelligenz"},{"key":"S1351324914000114_ref008","first-page":"543","article-title":"Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging","volume":"21","author":"Brill","year":"1995","journal-title":"Computational Linguistics"},{"key":"S1351324914000114_ref037","doi-asserted-by":"publisher","DOI":"10.1136\/jamia.2009.001560"},{"key":"S1351324914000114_ref031","first-page":"40","volume-title":"33rd Annual German Conference on Artificial Intelligence (KI 2010)","author":"Kluegl","year":"2010"},{"key":"S1351324914000114_ref025","first-page":"37","volume-title":"Web Semantics: Science, Services and Agents on the World Wide Web","author":"IJntema","year":"2012"},{"key":"S1351324914000114_ref033","first-page":"109","volume-title":"Proceedings of the ACL 2012 System Demonstrations, ACL '12","author":"Li","year":"2012"},{"key":"S1351324914000114_ref039","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007562322031"},{"key":"S1351324914000114_ref009","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989479"},{"key":"S1351324914000114_ref003","unstructured":"Beck P.-D. 2013. Identifikation und Klassifikation von Abschnitten in Arztbriefen (in German). Master Thesis, University of W\u00fcrzburg."},{"key":"S1351324914000114_ref028","first-page":"19","volume-title":"4th International Workshop on Knowledge Engineering and Software Engineering (KESE 2008), 31th German Conference on Artificial Intelligence (KI-2008)","author":"Kluegl","year":"2009"},{"key":"S1351324914000114_ref001","first-page":"23","volume-title":"Proceedings of a Workshop on Held at Baltimore, Maryland: October 13\u201315, 1998, (TIPSTER '98)","author":"Appelt","year":"1998"},{"key":"S1351324914000114_ref016","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1109\/MC.2005.314","article-title":"Test-driven development: concepts, taxonomy, and future direction","volume":"38","author":"David","year":"2005","journal-title":"Computer"},{"key":"S1351324914000114_ref015","unstructured":"Cunningham H. , Maynard D. , and Tablan V. 2000. JAPE: a java annotation patterns engine (Second Edition). Research Memorandum CS\u201300\u201310, Department of Computer Science, University of Sheffield, Sheffield."},{"key":"S1351324914000114_ref014","volume-title":"Text Processing with GATE (Version 6)","author":"Cunningham","year":"2011"},{"key":"S1351324914000114_ref012","unstructured":"Ciravegna F. 2003. (LP)2, rule induction for information extraction using linguistic constraints. Technical Report CS\u201303\u201307, Department of Computer Science, University of Sheffield."},{"key":"S1351324914000114_ref040","doi-asserted-by":"publisher","DOI":"10.1145\/1132956.1132957"},{"key":"S1351324914000114_ref002","first-page":"1","volume-title":"LWA-2008 (Special Track on Knowledge Discovery and Machine Learning)","author":"Atzmueller","year":"2008"},{"key":"S1351324914000114_ref041","unstructured":"Wittek A. , Toepfer M. , Fette G. , Kluegl P. , and Puppe F. 2013. Constraint-driven evaluation in UIMA Ruta. In Kluegl, Eckart de Castilho and Tomanek (ed.), UIMA@GSCL, CEUR Workshop Proceedings, vol. 1038. CEUR-WS.org, pp. 58\u201365."},{"key":"S1351324914000114_ref042","doi-asserted-by":"publisher","DOI":"10.1145\/2470654.2466212"},{"key":"S1351324914000114_ref013","first-page":"35","volume-title":"Recent Advances in Natural Language Processing IV: Selected papers from RANLP 2005","author":"Cunningham","year":"2007"},{"key":"S1351324914000114_ref032","first-page":"282","volume-title":"Proceedings 18th International Conference on Machine Learning","author":"Lafferty","year":"2001"},{"key":"S1351324914000114_ref026","first-page":"1624","volume-title":"ICDE","author":"Khaitan","year":"2008"},{"key":"S1351324914000114_ref027","first-page":"56","volume-title":"Proceedings LWA 2009 (KDML - Special Track on Knowledge Discovery and Machine Learning)","author":"Kluegl","year":"2009"},{"key":"S1351324914000114_ref020","doi-asserted-by":"publisher","DOI":"10.1145\/2463664.2463665"},{"key":"S1351324914000114_ref038","first-page":"1033","volume-title":"Proceedings of the 33rd International Conference on Very Large Data Bases, (VLDB '07)","author":"Shen","year":"2007"},{"key":"S1351324914000114_ref010","first-page":"128","volume-title":"Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics","author":"Chiticariu","year":"2010"},{"key":"S1351324914000114_ref007","doi-asserted-by":"publisher","DOI":"10.1145\/1519103.1519107"},{"key":"S1351324914000114_ref006","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-010-9123-y"},{"key":"S1351324914000114_ref023","volume-title":"Proceedings of the 10th International Semantic Web Conference (ISWC2011)","author":"Greenwood","year":"2011"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324914000114","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,5,5]],"date-time":"2025-05-05T02:01:42Z","timestamp":1746410502000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324914000114\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,10,8]]},"references-count":42,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2016,1]]}},"alternative-id":["S1351324914000114"],"URL":"https:\/\/doi.org\/10.1017\/s1351324914000114","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"value":"1351-3249","type":"print"},{"value":"1469-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,10,8]]}}}