{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T13:51:10Z","timestamp":1773237070677,"version":"3.50.1"},"reference-count":45,"publisher":"Cambridge University Press (CUP)","issue":"3","license":[{"start":{"date-parts":[[2013,9,16]],"date-time":"2013-09-16T00:00:00Z","timestamp":1379289600000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2015,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Paraphrase corpora are an essential but scarce resource in Natural Language Processing. In this paper, we present the Wikipedia-based Relational Paraphrase Acquisition (WRPA) method, which extracts relational paraphrases from Wikipedia, and the derived WRPA paraphrase corpus. The WRPA corpus currently covers person-related and authorship relations in English and Spanish, respectively, suggesting that, given adequate Wikipedia coverage, our method is independent of the language and the relation addressed. WRPA extracts entity pairs from structured information in Wikipedia applying distant learning and, based on the distributional hypothesis, uses them as anchor points for candidate paraphrase extraction from the free text in the body of Wikipedia articles. Focussing on relational paraphrasing and taking advantage of Wikipedia-structured information allows for an automatic and consistent evaluation of the results. The WRPA corpus characteristics distinguish it from other types of corpora that rely on string similarity or transformation operations. WRPA relies on distributional similarity and is the result of the free use of language outside any reformulation framework. Validation results show a high precision for the corpus.<\/jats:p>","DOI":"10.1017\/s1351324913000235","type":"journal-article","created":{"date-parts":[[2013,9,16]],"date-time":"2013-09-16T13:50:37Z","timestamp":1379339437000},"page":"355-389","source":"Crossref","is-referenced-by-count":5,"title":["Relational paraphrase acquisition from Wikipedia: The WRPA method and corpus"],"prefix":"10.1017","volume":"21","author":[{"given":"M.","family":"VILA","sequence":"first","affiliation":[]},{"given":"H.","family":"RODR\u00cdGUEZ","sequence":"additional","affiliation":[]},{"given":"M. A.","family":"MART\u00cd","sequence":"additional","affiliation":[]}],"member":"56","published-online":{"date-parts":[[2013,9,16]]},"reference":[{"key":"S1351324913000235_ref006","first-page":"50","volume-title":"Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics (ACL 2001)","author":"Barzilay","year":"2001"},{"key":"S1351324913000235_ref042","unstructured":"Wubben S. , van den Bosch A. , and Krahmer E. 2010. Paraphrase generation as monolingual translation: data and evaluation. In Proceedings of the 6th International Language Generation Conference (INLG 2010), pp. 203\u20137. Dublin: ACL."},{"key":"S1351324913000235_ref002","doi-asserted-by":"publisher","DOI":"10.1075\/ijcl.9.1.03are"},{"key":"S1351324913000235_ref015","doi-asserted-by":"publisher","DOI":"10.3115\/1599081.1599099"},{"key":"S1351324913000235_ref029","doi-asserted-by":"publisher","DOI":"10.1162\/coli_a_00002"},{"key":"S1351324913000235_ref022","volume-title":"Proceedings of the Fifth Text Analysis Conference (TAC 2012)","author":"Gonz\u00e0lez","year":"2012"},{"key":"S1351324913000235_ref007","first-page":"674","volume-title":"Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (HLT\/ACL 2008)","author":"Bhagat","year":"2008"},{"key":"S1351324913000235_ref019","first-page":"367","article-title":"CoCo, a web interface for corpora compilation","volume":"43","author":"Espa\u00f1a-Bonet","year":"2009","journal-title":"Procesamiento del Lenguaje Natural"},{"key":"S1351324913000235_ref009","doi-asserted-by":"publisher","DOI":"10.1145\/2483669.2483676"},{"key":"S1351324913000235_ref038","doi-asserted-by":"crossref","unstructured":"Ravichandran D. , and Hovy E. 2002. Learning surface text patterns for a question answering system. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002), pp. 41\u20137. Philadelphia, PA: ACL.","DOI":"10.3115\/1073083.1073092"},{"key":"S1351324913000235_ref018","doi-asserted-by":"publisher","DOI":"10.3115\/1220355.1220406"},{"key":"S1351324913000235_ref011","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-58473-0_144"},{"key":"S1351324913000235_ref004","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00153"},{"key":"S1351324913000235_ref005","first-page":"16","volume-title":"Proceedings of the 4th Annual Meeting of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (HLT\/NAACL 2003)","author":"Barzilay","year":"2003"},{"key":"S1351324913000235_ref010","first-page":"217","volume-title":"Proceedings of the HLT\/NAACL 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk (CSLDAMT 2010)","author":"Buzek","year":"2010"},{"key":"S1351324913000235_ref013","doi-asserted-by":"publisher","DOI":"10.1007\/s10579-009-9112-1"},{"key":"S1351324913000235_ref031","first-page":"3143","volume-title":"Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010)","author":"Max","year":"2010"},{"key":"S1351324913000235_ref008","doi-asserted-by":"publisher","DOI":"10.1007\/10704656_11"},{"key":"S1351324913000235_ref012","first-page":"190","volume-title":"Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (HLT\/ACL 2011)","author":"Chen","year":"2011"},{"key":"S1351324913000235_ref023","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"},{"key":"S1351324913000235_ref045","unstructured":"Zhu Z. , Bernhard D. , and Gurevych I. 2010. A monolingual tree-based translation method for sentence simplification. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), pp. 1353\u201361. Beijing: International Committee on Computational Linguistics."},{"key":"S1351324913000235_ref024","doi-asserted-by":"publisher","DOI":"10.1080\/00437956.1954.11659520"},{"key":"S1351324913000235_ref025","first-page":"37","article-title":"Paraphrase extraction from validated question answering corpora in Spanish","volume":"39","author":"Herrera","year":"2007","journal-title":"Procesamiento del Lenguaje Natural"},{"key":"S1351324913000235_ref026","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(02)00222-9"},{"key":"S1351324913000235_ref035","unstructured":"Padr\u00f3 L. , Collado M. , Reese S. , Lloberes M. , and Castell\u00f3n I. 2010. Freeling 2.1: five years of open-source language processing tools. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 931\u20136. Valletta, Malta: European Language Resources Association."},{"key":"S1351324913000235_ref028","first-page":"323","volume-title":"Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2001)","author":"Lin","year":"2001"},{"key":"S1351324913000235_ref027","first-page":"42","volume-title":"Proceedings of the ACL 2010 System Demonstrations (ACLDemos 2010)","author":"Kouylekov","year":"2010"},{"key":"S1351324913000235_ref030","first-page":"2","volume-title":"Proceedings of the 13th Conference of the European Chapter on the Association for Computational Linguistics (EACL 2012)","author":"Martzoukos","year":"2012"},{"key":"S1351324913000235_ref032","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijhcs.2009.05.004"},{"key":"S1351324913000235_ref034","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-662-09438-9"},{"key":"S1351324913000235_ref041","first-page":"11","article-title":"WRPA: a system for relational paraphrase acquisition from Wikipedia","volume":"45","author":"Vila","year":"2010","journal-title":"Procesamiento del Lenguaje Natural"},{"key":"S1351324913000235_ref037","unstructured":"Potthast M. , Stein B. , Barr\u00f3n-Cede\u00f1o A. , and Rosso P. 2010. An evaluation framework for plagiarism detection. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), pp. 997\u20131005. Beijing: International Committee on Computational Linguistics."},{"key":"S1351324913000235_ref014","doi-asserted-by":"publisher","DOI":"10.1162\/coli.08-003-R1-07-044"},{"key":"S1351324913000235_ref036","doi-asserted-by":"crossref","unstructured":"Pang B. , Knight K. , and Marcu D. 2003. Syntax-based alignment of multiple translations: extracting paraphrases and generating new sentences. In Proceedings of the 4th Annual Meeting of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (HLT\/NAACL 2003), pp. 102\u20139. Edmonton, Canada: ACL.","DOI":"10.3115\/1073445.1073469"},{"key":"S1351324913000235_ref020","first-page":"35","volume-title":"Directions in Corpus Linguistics. Proceedings of Nobel Symposium 82","author":"Fillmore","year":"1992"},{"key":"S1351324913000235_ref033","first-page":"1003","volume-title":"Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL\/IJCNLP 2009)","author":"Mintz","year":"2009"},{"key":"S1351324913000235_ref044","unstructured":"Zesch T. , M\u00fcller C. , and Gurevych I. 2008. Extracting lexical semantic knowledge from Wikipedia and Wiktionary. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), pp. 1646\u201352. Marrakech, Morocco: European Language Resources Association."},{"key":"S1351324913000235_ref040","doi-asserted-by":"crossref","unstructured":"Vila M. , Bertran M. , Mart\u00ed M. A. , and Rodr\u00edguez H. 2013. Corpus annotation with paraphrase types: new annotation scheme and inter-annotator agreement measures (submitted).","DOI":"10.1007\/s10579-014-9272-5"},{"key":"S1351324913000235_ref039","unstructured":"Szpektor I. , Tanev H. , Dagan I. , and Coppola B. 2004. Scaling web-based acquisition of entailment relations. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP 2004), pp. 41\u20138. Barcelona, Spain: ACL."},{"key":"S1351324913000235_ref003","first-page":"597","volume-title":"Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005)","author":"Bannard","year":"2005"},{"key":"S1351324913000235_ref001","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1613\/jair.2985","article-title":"A survey of paraphrasing and textual entailment methods","volume":"38","author":"Androutsopoulos","year":"2010","journal-title":"Journal of Artificial Intelligence Research"},{"key":"S1351324913000235_ref017","first-page":"9","volume-title":"Proceedings of the 3rd International Workshop on Paraphrasing (IWP 2005)","author":"Dolan","year":"2005"},{"key":"S1351324913000235_ref016","first-page":"665","volume-title":"Proceeding of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (HLT\/ACL 2011)","author":"Coster","year":"2011"},{"key":"S1351324913000235_ref043","unstructured":"Yatskar M. , Pang B. , Danescu-Niculescu-Mizil C. , and Lee L. 2010. For the sake of simplicity: unsupervised extraction of lexical simplifications from Wikipedia. In Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (HLT\/NAACL 2010), pp. 365\u20138. Los Angeles, CA: ACL."},{"key":"S1351324913000235_ref021","first-page":"25","volume-title":"Proceedings of the 3rd International Workshop on Paraphrasing (IWP 2005)","author":"Fujita","year":"2005"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324913000235","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,7,24]],"date-time":"2019-07-24T04:11:13Z","timestamp":1563941473000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324913000235\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,9,16]]},"references-count":45,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2015,5]]}},"alternative-id":["S1351324913000235"],"URL":"https:\/\/doi.org\/10.1017\/s1351324913000235","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"value":"1351-3249","type":"print"},{"value":"1469-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,9,16]]}}}