{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,1]],"date-time":"2026-03-01T10:47:24Z","timestamp":1772362044858,"version":"3.50.1"},"reference-count":28,"publisher":"Cambridge University Press (CUP)","issue":"5","license":[{"start":{"date-parts":[[2017,2,21]],"date-time":"2017-02-21T00:00:00Z","timestamp":1487635200000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/www.cambridge.org\/core\/terms"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Nat. Lang. Eng."],"published-print":{"date-parts":[[2017,9]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Newspaper text can be broadly divided in the classes \u2018opinion\u2019 (editorials, commentary, letters to the editor) and \u2018neutral\u2019 (reports). We describe a classification system for performing this separation, which uses a set of linguistically motivated features. Working with various English newspaper corpora, we demonstrate that it significantly outperforms bag-of-lemma and PoS-tag models. We conclude that the linguistic features constitute the best method for achieving robustness against change of newspaper or domain.<\/jats:p>","DOI":"10.1017\/s1351324917000043","type":"journal-article","created":{"date-parts":[[2017,2,21]],"date-time":"2017-02-21T14:56:45Z","timestamp":1487689005000},"page":"687-707","source":"Crossref","is-referenced-by-count":13,"title":["Classifying news versus opinions in newspapers: Linguistic features for domain independence"],"prefix":"10.1017","volume":"23","author":[{"given":"K. R.","family":"KR\u00dcGER","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"A.","family":"LUKOWIAK","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"J.","family":"SONNTAG","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"S.","family":"WARZECHA","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"M.","family":"STEDE","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"56","published-online":{"date-parts":[[2017,2,21]]},"reference":[{"key":"S1351324917000043_ref022","unstructured":"Santini M. 2007. Automatic Identification of Genre in Web Pages. PhD thesis, University of Brighton, UK."},{"key":"S1351324917000043_ref016","volume-title":"Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference","author":"Pearl","year":"1988"},{"key":"S1351324917000043_ref017","doi-asserted-by":"publisher","DOI":"10.1162\/COLI_a_00052"},{"key":"S1351324917000043_ref007","volume-title":"Proceedings of the Workshop on Computational Approaches to Style Analysis and Synthesis at the International Joint Conference on Artificial Intelligence (IJCAI 2003)","author":"Finn","year":"2003"},{"key":"S1351324917000043_ref014","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-5010"},{"key":"S1351324917000043_ref012","doi-asserted-by":"publisher","DOI":"10.3115\/976909.979622"},{"key":"S1351324917000043_ref010","doi-asserted-by":"publisher","DOI":"10.1002\/9781118548387"},{"key":"S1351324917000043_ref019","unstructured":"Platt J. 1998. Sequential minimal optimization: a fast algorithm for training support vector machines. Technical Report msr-tr-98-14, Microsoft Research."},{"key":"S1351324917000043_ref008","doi-asserted-by":"publisher","DOI":"10.1145\/1164820.1164829"},{"key":"S1351324917000043_ref020","first-page":"2961","volume-title":"Proceedings of the 6th Conference on International Language Resources and Evaluation (LREC 2008)","author":"Prasad","year":"2008"},{"key":"S1351324917000043_ref002","volume-title":"Natural Language Processing with Python","author":"Bird","year":"2009"},{"key":"S1351324917000043_ref006","first-page":"4781","volume-title":"Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing","author":"Feldman","year":"2009"},{"key":"S1351324917000043_ref001","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511814358"},{"key":"S1351324917000043_ref005","first-page":"417","volume-title":"Proceedings of the 5th Conference on Language Resources and Evaluation (LREC 2006)","author":"Esuli","year":"2006"},{"key":"S1351324917000043_ref027","doi-asserted-by":"publisher","DOI":"10.3115\/1220575.1220619"},{"key":"S1351324917000043_ref026","doi-asserted-by":"publisher","DOI":"10.1162\/0891201041850885"},{"key":"S1351324917000043_ref023","first-page":"3063","volume-title":"Proceedings of the 7th Conference on International Language Resources and Evaluation (LREC 2010)","author":"Sharoff","year":"2010"},{"key":"S1351324917000043_ref013","doi-asserted-by":"publisher","DOI":"10.1109\/MASSP.1987.1165576"},{"key":"S1351324917000043_ref003","volume-title":"BLLIP 1987-89 WSJ Corpus Release 1 LDC2000T43","author":"Charniak","year":"2000"},{"key":"S1351324917000043_ref004","first-page":"449","volume-title":"Proceedings of the 5th Conference on International Language Resources and Evaluation (LREC 2006)","author":"de Marneffe","year":"2006"},{"key":"S1351324917000043_ref011","doi-asserted-by":"publisher","DOI":"10.3115\/991250.991324"},{"key":"S1351324917000043_ref018","unstructured":"Plank B. 2011. Corresponding genre sets based on the meta-data found in ACL\/DCI corpus. http:\/\/www.let.rug.nl\/~bplank\/metadata\/genre_files_updated.html. Accessed 2016-07-01."},{"key":"S1351324917000043_ref025","first-page":"674","volume-title":"Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics (ACL 2009) and the 4th International Joint Conference on Natural Language Processing of the AFNLP","author":"Webber","year":"2009"},{"key":"S1351324917000043_ref024","first-page":"89","volume-title":"Proceedings of the D\u00c9fi Fouille de Textes (DEFT 2009) Text Mining Challenge","author":"Toprak","year":"2009"},{"key":"S1351324917000043_ref028","doi-asserted-by":"publisher","DOI":"10.3115\/1119355.1119372"},{"key":"S1351324917000043_ref009","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"},{"key":"S1351324917000043_ref021","volume-title":"The New York Times Annotated Corpus LDC2008T19","author":"Sandhaus","year":"2008"},{"key":"S1351324917000043_ref015","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1613\/jair.453","article-title":"Cached sufficient statistics for efficient machine learning with large datasets","volume":"8","author":"Moore","year":"1998","journal-title":"Journal of Artificial Intelligence Research"}],"container-title":["Natural Language Engineering"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.cambridge.org\/core\/services\/aop-cambridge-core\/content\/view\/S1351324917000043","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,4,17]],"date-time":"2019-04-17T00:44:22Z","timestamp":1555461862000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.cambridge.org\/core\/product\/identifier\/S1351324917000043\/type\/journal_article"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,2,21]]},"references-count":28,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2017,9]]}},"alternative-id":["S1351324917000043"],"URL":"https:\/\/doi.org\/10.1017\/s1351324917000043","relation":{},"ISSN":["1351-3249","1469-8110"],"issn-type":[{"value":"1351-3249","type":"print"},{"value":"1469-8110","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,2,21]]}}}