{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T11:01:18Z","timestamp":1740135678012,"version":"3.37.3"},"reference-count":79,"publisher":"Cambridge University Press (CUP)","issue":"1","license":[{"start":{"date-parts":[[2018,10,31]],"date-time":"2018-10-31T00:00:00Z","timestamp":1540944000000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2019,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This article presents a new method to automatically simplify English sentences. The approach is designed to reduce the number of compound clauses and nominally bound relative clauses in input sentences. The article provides an overview of a corpus annotated with information about various explicit signs of syntactic complexity and describes the two major components of a sentence simplification method that works by exploiting information on the signs occurring in the sentences of a text. The first component is a sign tagger which automatically classifies signs in accordance with the annotation scheme used to annotate the corpus. The second component is an iterative rule-based sentence transformation tool. Exploiting the sign tagger in conjunction with other NLP components, the sentence transformation tool automatically rewrites long sentences containing compound clauses and nominally bound relative clauses as sequences of shorter single-clause sentences. Evaluation of the different components reveals acceptable performance in rewriting sentences containing compound clauses but less accuracy when rewriting sentences containing nominally bound relative clauses. A detailed error analysis revealed that the major sources of error include inaccurate sign tagging, the relatively limited coverage of the rules used to rewrite sentences, and an inability to discriminate between various subtypes of clause coordination. Despite this, the system performed well in comparison with two baselines. This finding was reinforced by automatic estimations of the readability of system output and by surveys of readers\u2019 opinions about the accuracy, accessibility, and meaning of this output.<\/jats:p>","DOI":"10.1017\/s1351324918000384","type":"journal-article","created":{"date-parts":[[2018,10,31]],"date-time":"2018-10-31T10:48:48Z","timestamp":1540982928000},"page":"69-119","source":"Crossref","is-referenced-by-count":4,"title":["Identifying signs of syntactic complexity for rule-based sentence simplification"],"prefix":"10.1017","volume":"25","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1220-8605","authenticated-orcid":false,"given":"RICHARD","family":"EVANS","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"CONSTANTIN","family":"OR\u0102SAN","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"56","published-online":{"date-parts":[[2018,10,31]]},"reference":[{"key":"S1351324918000384_ref021","first-page":"20","volume-title":"Proceedings of the 11th Annual College of Education and GSN Research Conference","author":"DeFrancesco","year":"2012"},{"key":"S1351324918000384_ref018","first-page":"665","volume-title":"Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL-2011)","author":"Coster","year":"2011"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref011","DOI":"10.3758\/BRM.40.2.540"},{"key":"S1351324918000384_ref008","first-page":"75","volume-title":"Proceedings of the NAACL-HLT 2012 Workshop on Speech and Language Processing for Assistive Technologies (SLPAT)","author":"Bott","year":"2012"},{"key":"S1351324918000384_ref042","first-page":"707","article-title":"Binary codes capable of correcting deletions and insertions and reversals","volume":"10","author":"Levenshtein","year":"1966","journal-title":"Soviet Physics Doklady"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref001","DOI":"10.3115\/981967.981970"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref017","DOI":"10.1613\/jair.2655"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref065","DOI":"10.3115\/v1\/E14-1076"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref016","DOI":"10.1177\/001316446002000104"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref057","DOI":"10.3115\/974147.974173"},{"unstructured":"Scarton C. , Palmero Aprosio A. , Tonelli S. , Martin-Wanton T. , and Specia L. 2017. MUSST: a multilingual syntactic simplification tool. In The Companion Volume of the IJCNLP 2017 Proceedings: System Demonstrations, Taipei, Taiwan: AFNLP, pp. 25\u20138.","key":"S1351324918000384_ref059"},{"unstructured":"Max A. 2000. Syntactic Simplification \u2013 An Application to Text for Aphasic Readers. Mphil in Computer Speech and Language Processing, Wolfson College, University of Cambridge.","key":"S1351324918000384_ref046"},{"key":"S1351324918000384_ref022","first-page":"449","volume-title":"Proceedings of the International Conference on Language Resources and Evaluation (LREC)","author":"de Marneffe","year":"2006"},{"key":"S1351324918000384_ref005","first-page":"1996","volume-title":"Proceedings of the 25th International Conference on Computational Linguistics: Technical Papers (COLING 2014)","author":"Angrosh","year":"2014"},{"key":"S1351324918000384_ref054","first-page":"311","volume-title":"Proceedings of the 40th annual meeting for Computational Linguistics","author":"Papineni","year":"2002"},{"key":"S1351324918000384_ref038","first-page":"1528","volume-title":"Proceedings of North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2016)","author":"Klerke","year":"2016"},{"key":"S1351324918000384_ref064","first-page":"2","volume-title":"Proceedings of the 13th European Workshop on Natural Language Generation (ENLG \u201911)","author":"Siddharthan","year":"2011"},{"key":"S1351324918000384_ref010","first-page":"47","volume-title":"Proceedings of the 3rd Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR) at EACL 2014","author":"Brouwers","year":"2014"},{"key":"S1351324918000384_ref026","first-page":"1","volume-title":"Proceedings of the 2nd Workshop on Predicting and Improving Text Readability for Target Reader Populations","author":"Feblowitz","year":"2013"},{"volume-title":"The Psychology of Language","year":"2003","author":"Jay","key":"S1351324918000384_ref033"},{"key":"S1351324918000384_ref043","first-page":"166","volume-title":"Proceedings of the 6th Linguistic Annotation Workshop","author":"Maier","year":"2012"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref002","DOI":"10.1145\/1410140.1410191"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref048","DOI":"10.1017\/CBO9780511894664"},{"key":"S1351324918000384_ref079","first-page":"1353","volume-title":"Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)","author":"Zhu","year":"2010"},{"key":"S1351324918000384_ref015","first-page":"184","volume-title":"Readings in English Transformational Grammar","author":"Chomsky","year":"1970"},{"unstructured":"Canning Y. 2002. Syntactic Simplification of Text. Ph.d. thesis, University of Sunderland.","key":"S1351324918000384_ref012"},{"key":"S1351324918000384_ref019","first-page":"1045","volume-title":"Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC-2004)","author":"Daelemans","year":"2004"},{"key":"S1351324918000384_ref070","first-page":"293","volume-title":"Proceedings of the 14th IEEE International Conference on Tools with Artificial Intelligence (ICTAI \u201902)","author":"Van Delden","year":"2002"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref006","DOI":"10.2307\/1131734"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref014","DOI":"10.3115\/993268.993361"},{"key":"S1351324918000384_ref056","first-page":"317","volume-title":"Proceedings of the 20th Nordic Conference of Computational Linguistics (NODALIDA 2015)","author":"Rennes","year":"2015"},{"key":"S1351324918000384_ref003","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1145\/1456536.1456540","volume-title":"Proceedings of the 26th Annual ACM International Conference on Design of Communication (SIGDOC \u201908)","author":"Aluisio","year":"2008"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref068","DOI":"10.1561\/2200000013"},{"key":"S1351324918000384_ref027","first-page":"191","article-title":"A web-based text simplification system for english","volume":"55","author":"Ferr\u00e9s","year":"2015","journal-title":"Procesamiento del Lenguaje Natural"},{"key":"S1351324918000384_ref020","first-page":"19","volume-title":"Proceedings of the SIGIR Workshop on Accessible Search Systems","author":"De Belder","year":"2010"},{"key":"S1351324918000384_ref007","first-page":"277","volume-title":"Proceedings of the 2008 Conference in Semantics in Text Processing","author":"Bos","year":"2008"},{"key":"S1351324918000384_ref004","first-page":"16","volume-title":"Proceedings of the 8th International Natural Language Generation Conference","author":"Angrosh","year":"2014"},{"key":"S1351324918000384_ref060","first-page":"4019","volume-title":"Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC\u201912)","author":"Seretan","year":"2012"},{"unstructured":"Martos J. , Freire S. , Gonz\u00e1lez A. , Gil D. , Evans R. , Jordanova V. , Cerga A. , Shishkova A. , and Orasan C. 2013. User preferences: Updated. Technical Report D2.2, Deletrea, Madrid, Spain.","key":"S1351324918000384_ref045"},{"key":"S1351324918000384_ref023","first-page":"221","volume-title":"Proceedings of the 9th International Conference on Recent Advances in Natural Language Processing (RANLP-2013)","author":"Dornescu","year":"2013"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref024","DOI":"10.1093\/llc\/fqr034"},{"key":"S1351324918000384_ref025","first-page":"92","volume-title":"Text, Speech and Dialogue. Proceedings of the 16th International Conference TSD 2013","author":"Evans","year":"2013"},{"key":"S1351324918000384_ref029","first-page":"71","volume-title":"Proceedings of the Student Workshop held in conjunction with RANLP 2013","author":"Glavas","year":"2013"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref030","DOI":"10.1007\/s10579-017-9407-6"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref078","DOI":"10.18653\/v1\/D17-1062"},{"key":"S1351324918000384_ref031","first-page":"1147","volume-title":"Proceedings of the 2nd International Conference on Language Resources and Evaluation","author":"Grover","year":"2000"},{"key":"S1351324918000384_ref032","first-page":"278","volume-title":"Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics","author":"Hepple","year":"2000"},{"unstructured":"Jel\u00ednek T. 2014. Improvements to dependency parsing using automatic simplification of data. In Proceedings of Language Resources and Evaluation (LREC-2014), Reykjavik, Iceland: European Language Resources Association, pp. 73\u20137.","key":"S1351324918000384_ref034"},{"key":"S1351324918000384_ref035","first-page":"177","volume-title":"Proceedings of NAACL HLT 2009: Short Papers","author":"Jonnalagadda","year":"2009"},{"doi-asserted-by":"crossref","unstructured":"Kincaid J. P. , Fishburne R. P. , Rogers R. L. , and Chissom B. S. 1975. Derivation of new readability formulas (Automatic readability index, fog count and flesch reading ease formula) for Navy enlisted personnel. CNTECHTRA Research Branch Report 8-75, CNTECHTRA.","key":"S1351324918000384_ref036","DOI":"10.21236\/ADA006655"},{"unstructured":"Kintsch W. , and Welsch D. M. 1991. The construction\u2013integration model: a framework for studying memory for text. In W. E. Hockley , and S. Lewandowsky (eds.), Relating Theory and Data: Essays on Human Memory, pp. 367\u201385. NJ, Erlbaum: Hillsdale.","key":"S1351324918000384_ref037"},{"unstructured":"Kudo T. 2005. Crf++: yet another crf toolkit. http:\/\/crfpp.sourceforge.net.","key":"S1351324918000384_ref039"},{"key":"S1351324918000384_ref040","first-page":"282","volume-title":"Proceedings of the 18th International Conference on Machine Learning","author":"Lafferty","year":"2001"},{"unstructured":"Lei C.-U. , Man K. L. , and Ting T. O. 2014. Using Coh-Metrix to analyse writing skills of students: a case study in a technological common core curriculum course. In Proceedings of the International MultiConference of Engineers and Computer Scientists 2014 Vol II (IMECS 2014), Hong Kong: IMECS, pp. 3\u20136.","key":"S1351324918000384_ref041"},{"key":"S1351324918000384_ref044","first-page":"313","article-title":"Building a large annotated corpus of english: the penn treebank","volume":"19","author":"Marcus","year":"1993","journal-title":"Computational Linguistics"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref047","DOI":"10.1162\/coli_a_00039"},{"key":"S1351324918000384_ref028","first-page":"214","volume-title":"Proceedings of Corpus Linguistics 2001 Conference","author":"Gaizauskas","year":"2001"},{"key":"S1351324918000384_ref050","first-page":"788","volume-title":"Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)","author":"Miwa","year":"2010"},{"key":"S1351324918000384_ref051","first-page":"435","volume-title":"Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics","author":"Narayan","year":"2014"},{"volume-title":"Basic English: A General Introduction with Rules and Grammar","year":"1932","author":"Ogden","key":"S1351324918000384_ref052"},{"key":"S1351324918000384_ref053","first-page":"116","volume-title":"Proceedings of the 9th Brazilian Symposium in Information and Human Language Technology","author":"Paetzold","year":"2013"},{"volume-title":"A Comprehensive Grammar of the English Language","year":"1985","author":"Quirk","key":"S1351324918000384_ref055"},{"key":"S1351324918000384_ref058","first-page":"14:1","article-title":"Making it simplext: implementation and evaluation of a text simplification system for Spanish","volume":"6","author":"Saggion","year":"2015","journal-title":"ACM Transactions on Accessible Computing (TACCESS) \u2013 Special Issue on Speech and Language Processing for AT (Part 2)"},{"key":"S1351324918000384_ref061","first-page":"41","volume-title":"Proceedings of the Workshop on Automatic Text Simplification: Methods and Applications in the Multilingual Society","author":"Sheremetyeva","year":"2014"},{"unstructured":"Siddharthan A. 2004. Syntactic Simplification and Text Cohesion. Ph.d. thesis, University of Cambridge.","key":"S1351324918000384_ref062"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref063","DOI":"10.1007\/s11168-006-9011-1"},{"key":"S1351324918000384_ref066","first-page":"618","volume-title":"Proceedings of Recent Advances in Natural Language Processing (RANLP-2015)","author":"S\u0306tajner","year":"2015"},{"key":"S1351324918000384_ref067","first-page":"279","volume-title":"Proceedings of the 13th Conference on Natural Language Processing (KONVENS 2016)","author":"Suter","year":"2016"},{"key":"S1351324918000384_ref071","first-page":"344","volume-title":"Proceedings of the Association for Computational Linguistics: Human Language Technologies (ACL \u201908: HLT)","author":"Vickrey","year":"2008"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref072","DOI":"10.1007\/978-3-319-05476-6_4"},{"key":"S1351324918000384_ref074","first-page":"1015","volume-title":"Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics","author":"Wubben","year":"2012"},{"key":"S1351324918000384_ref077","first-page":"365","volume-title":"Proceedings of Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL","author":"Yatskar","year":"2010"},{"key":"S1351324918000384_ref009","first-page":"722","volume-title":"Proceedings of the 12th National Conference on Artificial Intelligence","author":"Brill","year":"1994"},{"volume-title":"Efficient Parsing for Natural Language: A Fast Algorithm for Practical Systems","year":"1985","author":"Tomita","key":"S1351324918000384_ref069"},{"key":"S1351324918000384_ref076","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1162\/tacl_a_00107","article-title":"Optimizing statistical machine translation for text simplification","volume":"4","author":"Xu","year":"2016","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"S1351324918000384_ref075","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1162\/tacl_a_00139","article-title":"Problems in current text simplification research: new data can help","volume":"3","author":"Xu","year":"2015","journal-title":"Transactions of the Association for Computational Linguistics"},{"doi-asserted-by":"crossref","unstructured":"Mishra K. , Soni A. , Sharma R. , and Sharma D. 2014. Exploring the effects of sentence simplification on Hindi to English machine translation system In Proceedings of the Workshop on Automatic Text Simplification: Methods and Applications in the Multilingual Society, Dublin, Ireland: Association for Computational Linguistics, pp. 21\u20139.","key":"S1351324918000384_ref049","DOI":"10.3115\/v1\/W14-5603"},{"doi-asserted-by":"publisher","key":"S1351324918000384_ref013","DOI":"10.1017\/S0140525X99001788"},{"key":"S1351324918000384_ref073","first-page":"409","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing","author":"Woodsend","year":"2011"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324918000384","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,9,4]],"date-time":"2022-09-04T23:51:02Z","timestamp":1662335462000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324918000384\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,10,31]]},"references-count":79,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2019,1]]}},"alternative-id":["S1351324918000384"],"URL":"https:\/\/doi.org\/10.1017\/s1351324918000384","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"type":"print","value":"1351-3249"},{"type":"electronic","value":"1469-8110"}],"subject":[],"published":{"date-parts":[[2018,10,31]]}}}