{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,14]],"date-time":"2026-02-14T00:56:53Z","timestamp":1771030613526,"version":"3.50.1"},"reference-count":23,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2021,10,3]],"date-time":"2021-10-03T00:00:00Z","timestamp":1633219200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100002783","name":"Ministry of Higher Education","doi-asserted-by":"publisher","award":["13.1902.21.0016"],"award-info":[{"award-number":["13.1902.21.0016"]}],"id":[{"id":"10.13039\/501100002783","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Ministry of Science and Higher Education of Russia","award":["13.1902.21.0016"],"award-info":[{"award-number":["13.1902.21.0016"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>We consider the problems of the authorship of literary texts in the framework of the quantitative study of literature. This article proposes a methodology for authorship attribution of literary texts based on the use of data compressors. Unlike other methods, the suggested one gives a possibility to make statistically verified results. This method is used to solve two problems of attribution in Russian literature.<\/jats:p>","DOI":"10.3390\/e23101302","type":"journal-article","created":{"date-parts":[[2021,10,4]],"date-time":"2021-10-04T09:24:15Z","timestamp":1633339455000},"page":"1302","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Using Data Compression to Build a Method for Statistically Verified Attribution of Literary Texts"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7232-9644","authenticated-orcid":false,"given":"Boris","family":"Ryabko","sequence":"first","affiliation":[{"name":"Federal Research Center for Information and Computational Technologies of SB RAS, 630090 Novosibirsk, Russia"},{"name":"Department of Information Technologies, Novosibirsk State University, 630090 Novosibirsk, Russia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9765-9714","authenticated-orcid":false,"given":"Nadezhda","family":"Savina","sequence":"additional","affiliation":[{"name":"Department of Information Technologies, Novosibirsk State University, 630090 Novosibirsk, Russia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,10,3]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1007\/BF01829876","article-title":"Shakespeare vs. Fletcher: A Stylometric Analysis by Radial Basis Functions","volume":"29","author":"Lowe","year":"1995","journal-title":"Comput. Humanit."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Ryabko, B., Astola, J., and Malyutov, M. (2016). Compression-Based Methods of Statistical Analysis and Prediction of Time Series, Springer International Publishing.","DOI":"10.1007\/978-3-319-32253-7"},{"key":"ref_3","unstructured":"Khmelev, D.V. (2021, July 17). Classification and Mark Up of Texts Using Data Compression Methods. All about Data, Image and Video Compression. Available online: https:\/\/www.compression.ru\/download\/articles\/classif\/intro.html."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1016\/j.forsciint.2013.02.025","article-title":"Comparing compression models for authorship attribution","volume":"228","author":"Oliveira","year":"2013","journal-title":"Forensic Sci. Int."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1023\/A:1010478226705","article-title":"Text authorship attribution using letters and grammatical information","volume":"37","author":"Kukushkina","year":"2001","journal-title":"Probl. Inf. Transm."},{"key":"ref_6","unstructured":"Gorshkov, S., Nered, M., Ilyushin, E., and Namiot, D. (2019, July 02). Using Machine Learning Methods to Establish Program Authorship. International Journal of Open Information Technologies. No1. Available online: https:\/\/cyberleninka.ru\/article\/n\/using-machine-learning-methodsto-establish-program-authorship."},{"key":"ref_7","unstructured":"Marusenko, M.A., Bessonov, B.A., Bogdanova, L.M., and Myasoedova, N.E. (2001). Search for the Lost Author, Attribution Etudes, Sankt Petersburg University."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1002\/j.1538-7305.1948.tb01338.x","article-title":"A mathematical theory of communication","volume":"27","author":"Shannon","year":"1948","journal-title":"Bell Syst. Tech. J."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1162\/089120100561746","article-title":"Using compression models to segment Chinese text","volume":"26","author":"Teahan","year":"2000","journal-title":"Comput. Linguist."},{"key":"ref_10","first-page":"83","article-title":"Using compression- based language models for text categorization","volume":"Volume 13","author":"Teahan","year":"2003","journal-title":"Language Modeling for Information Retrieval"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1523","DOI":"10.1109\/TIT.2005.844059","article-title":"Clustering by compression","volume":"51","author":"Cilibrasi","year":"2005","journal-title":"IEEE Trans. Inf. Theory"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1162\/0148926042728449","article-title":"Algorithmic clus- tering of music based on string compression","volume":"28","author":"Cilibrasi","year":"2004","journal-title":"Comput. Music"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1134\/S0032946017030115","article-title":"Information-Theoretic Method for Classification of Texts","volume":"53","author":"Ryabko","year":"2017","journal-title":"Probl. Inf. Transm."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Ryabko, B.Y., Guskov, A.E., and Selivanova, I.V. (2017, January 25\u201330). Using data-compressors for statistical analysis of problems on homogeneity testing and classification. Proceedings of the 2017 IEEE International Symposium on Information Theory (ISIT), Aachen, Germany.","DOI":"10.1109\/ISIT.2017.8006502"},{"key":"ref_15","unstructured":"Amlinsky, I. (2013). 12 Chairs from Mikhail Bulgakov, Kirschner Verlag. (In Russian)."},{"key":"ref_16","unstructured":"Kozarovetskiy, V.A., and Moscow Baranki and Odessa Bubliki (2018, October 10). Who Wrote \u201c12 Chairs\u201d Literary Russia, No. 41. Available online: http:\/\/old.litrossia.ru\/2013\/41\/08347.html."},{"key":"ref_17","unstructured":"Khmelnitsky, D.S. (2019, March 18). In Defense of Ilf and Petrov Seven Arts. v. 52. Available online: http:\/\/7iskusstv.com\/2014\/Nomer5\/Chmelnicky1.php."},{"key":"ref_18","unstructured":"Freidgeym, L.I. (2019, March 18). Ilf and Petrov or Bulgakov \u2026 Round Table (Virtual Version) Seven Arts, v. 47. Available online: http:\/\/7iskusstv.com\/2013\/Nomer11\/Frejdgejm1.php."},{"key":"ref_19","unstructured":"Bessonov, B.L. (1978). On the Authorship of the Novel \u201cThree Countries of the World\u201d, Nauka. (In Russian)."},{"key":"ref_20","unstructured":"Panaeva, A.Y. (2019, March 18). Memories. Available online: http:\/\/az.lib.ru\/p\/panaewa."},{"key":"ref_21","unstructured":"Kendall, M., and Stjuart, A. (1961). The Advanced Theory of Statistics, Inference and Relationship."},{"key":"ref_22","unstructured":"Lansky, J., and Zemlicka, M. (2005, January 13\u201315). Text compression: Syllables. Proceedings of the Dateso 2005 Annual International Workshop on DAtabases, TExts, Specifications and Objects, Desna, Czech Republic."},{"key":"ref_23","unstructured":"Nekrasov, N.A. (1981). The Complete Collection of Works and Letters: In 15 Vols. Artistic Works, Nauka."}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/23\/10\/1302\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:09:03Z","timestamp":1760166543000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/23\/10\/1302"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,3]]},"references-count":23,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2021,10]]}},"alternative-id":["e23101302"],"URL":"https:\/\/doi.org\/10.3390\/e23101302","relation":{},"ISSN":["1099-4300"],"issn-type":[{"value":"1099-4300","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,10,3]]}}}