{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,17]],"date-time":"2026-03-17T23:26:43Z","timestamp":1773790003198,"version":"3.50.1"},"publisher-location":"Amsterdam","reference-count":43,"publisher":"John Benjamins Publishing Company","isbn-type":[{"value":"9789027210104","type":"print"},{"value":"9789027258380","type":"electronic"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,12,22]]},"abstract":"<jats:title>Abstract<\/jats:title>\n<jats:p>We present a study of the distinctiveness of random and non-random texts based on text characteristics of quantitative linguistics. We additionally experiment with text features that evaluate contiguity associations among sentences by means of BERT (Bidirectional Encoder Representations from Transformers). To this end, we experiment with generative models for random texts as currently discussed in the context of neural networks. The chapter contributes to the clarification of deficits of existing random text models and of the informativeness of quantitative text features.<\/jats:p>","DOI":"10.1075\/cilt.356.10kon","type":"book-chapter","created":{"date-parts":[[2021,11,8]],"date-time":"2021-11-08T14:37:53Z","timestamp":1636382273000},"page":"145-162","source":"Crossref","is-referenced-by-count":1,"title":["From distinguishability to informativity"],"prefix":"10.1075","author":[{"given":"Maxim","family":"Konca","sequence":"first","affiliation":[{"id":[{"id":"https:\/\/ror.org\/04cvxnb49","id-type":"ROR","asserted-by":"publisher"}],"name":"Goethe University Frankfurt"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexander","family":"Mehler","sequence":"additional","affiliation":[{"id":[{"id":"https:\/\/ror.org\/04cvxnb49","id-type":"ROR","asserted-by":"publisher"}],"name":"Goethe University Frankfurt"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Baumartz","sequence":"additional","affiliation":[{"id":[{"id":"https:\/\/ror.org\/04cvxnb49","id-type":"ROR","asserted-by":"publisher"}],"name":"Goethe University Frankfurt"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wahed","family":"Hemati","sequence":"additional","affiliation":[{"id":[{"id":"https:\/\/ror.org\/04cvxnb49","id-type":"ROR","asserted-by":"publisher"}],"name":"Goethe University Frankfurt"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1757","published-online":{"date-parts":[[2021,11,8]]},"reference":[{"key":"CIT0185","volume-title":"Wiederholungen in Texten","author":"Altmann","year":"1988"},{"key":"CIT0186","doi-asserted-by":"publisher","DOI":"10.1093\/llc\/11.3.121"},{"key":"CIT0187","article-title":"Neural machine translation by jointly learning to align and translate","author":"Bahdanau","year":"2014","journal-title":"arXiv preprint arXiv:1409.0473"},{"key":"CIT0188","first-page":"1137","article-title":"A neural probabilistic language model","volume":"3","author":"Bengio","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"CIT0189","first-page":"105","article-title":"A random text model for the generation of statistical language invariants","volume-title":"Human language technologies 2007: The conference of the North American chapter of the association for computational linguistics; proceedings of the main conference","author":"Biemann","year":"2007"},{"key":"CIT0190","doi-asserted-by":"publisher","DOI":"10.1145\/130385.130401"},{"key":"CIT0191","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010933404324"},{"key":"CIT0192","doi-asserted-by":"publisher","DOI":"10.1016\/j.envsoft.2006.10.004"},{"key":"CIT0193","doi-asserted-by":"publisher","DOI":"10.1515\/9783110362879-006"},{"key":"CIT0194","first-page":"4","article-title":"Methods of analysis of a thematic concentration of the text","volume":"3","author":"\u010cech","year":"2013","journal-title":"Czech and Slovak Linguistic Review"},{"key":"CIT0195","doi-asserted-by":"publisher","DOI":"10.1145\/1961189.1961199"},{"key":"CIT0196","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D16-1053"},{"key":"CIT0197","doi-asserted-by":"publisher","DOI":"10.1177\/001316446002000104"},{"key":"CIT0198","article-title":"Bert: Pre-training of deep bidirectional transformers for language understanding","author":"Devlin","year":"2018","journal-title":"arXiv preprint arXiv:1810.04805"},{"issue":"4","key":"CIT0199","first-page":"337","article-title":"A summary on entropy statistics","volume":"31","author":"Esteban","year":"1995","journal-title":"Kybernetika"},{"key":"CIT0200","first-page":"1301","article-title":"Overcoming the brittleness bottleneck using Wikipedia: Enhancing text categorization with encyclopedic knowledge","volume-title":"Proceedings of the twenty-first national conference on artificial intelligence","author":"Gabrilovich","year":"2006"},{"key":"CIT0201","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0507655102"},{"key":"CIT0202","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"CIT0203","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4615-0907-3"},{"key":"CIT0204","volume-title":"Quita. Quantitative Index Text Analyzer","author":"Kub\u00e1t","year":"2014"},{"key":"CIT0205","doi-asserted-by":"publisher","DOI":"10.2307\/1932674"},{"key":"CIT0206","first-page":"325","article-title":"Eigenschaften der textuellen Einheiten und Systeme [Properties of textual units and systems]","volume-title":"Quantitative linguistik. ein internationales handbuch \/ quantitative linguistics. An international handbook","author":"Mehler","year":"2005"},{"issue":"2","key":"CIT0207","doi-asserted-by":"crossref","first-page":"51","DOI":"10.21248\/jlcl.22.2007.95","article-title":"Structural classifiers of text types: Towards a novel model of text representation","volume":"22","author":"Mehler","year":"2007","journal-title":"Journal for Language Technology and Computational Linguistics (JLCL)"},{"key":"CIT0208","first-page":"149","article-title":"VienNA: Auf dem Weg zu einer Infrastruktur f\u00fcr die verteilte interaktive evolution\u00e4re Verarbeitung nat\u00fcrlicher Sprache","volume-title":"Forschungsinfrastrukturen und digitale Informationssysteme in der germanistischen Sprachwissenschaft","author":"Mehler","year":"2018"},{"key":"CIT0209","doi-asserted-by":"publisher","DOI":"10.1515\/9783110573565-016"},{"key":"CIT0210","article-title":"Unrolled generative adversarial networks","author":"Metz","year":"2016","journal-title":"arXiv preprint arXiv:1611.02163"},{"key":"CIT0211","doi-asserted-by":"publisher","DOI":"10.1080\/00401706.1991.10484804"},{"key":"CIT0212","first-page":"383","article-title":"On spectral analysis with missing observations and amplitude modulation","author":"Parzen","year":"1963","journal-title":"Sankhy\u0101: The Indian Journal of Statistics, Series A"},{"key":"CIT0213","volume-title":"Word frequency studies","author":"Popescu","year":"2009"},{"key":"CIT0214","first-page":"23","article-title":"Some aspects of word frequencies","volume":"13","author":"Popescu","year":"2006","journal-title":"Glottometrics"},{"key":"CIT0215","first-page":"71","article-title":"Writer\u2019s view of text generation","volume":"15","author":"Popescu","year":"2007","journal-title":"Glottometrics"},{"key":"CIT0216","first-page":"110","article-title":"Thematic concentration in texts","volume":"2","author":"Popescu","year":"2011","journal-title":"Issues in quantitative linguistics"},{"key":"CIT0217","volume-title":"The lambda-structure of texts","author":"Popescu","year":"2011"},{"issue":"8","key":"CIT0218","first-page":"9","article-title":"Language models are unsupervised multitask learners","volume":"1","author":"Radford","year":"2019","journal-title":"OpenAI Blog"},{"key":"CIT0219","doi-asserted-by":"publisher","DOI":"10.1017\/S1351324997001502"},{"key":"CIT0220","doi-asserted-by":"publisher","DOI":"10.1016\/S0010-4655(02)00280-1"},{"key":"CIT0221","doi-asserted-by":"publisher","DOI":"10.1016\/j.cpc.2009.09.018"},{"key":"CIT0222","doi-asserted-by":"publisher","DOI":"10.1023\/B:STCO.0000035301.49549.88"},{"key":"CIT0223","doi-asserted-by":"publisher","DOI":"10.1016\/S0378-4754(00)00270-6"},{"key":"CIT0224","doi-asserted-by":"publisher","DOI":"10.1214\/009053607000000505"},{"key":"CIT0225","first-page":"5998","article-title":"Attention is all you need","author":"Vaswani","year":"2017","journal-title":"Advances in neural information processing systems"},{"key":"CIT0226","first-page":"361","article-title":"The type-token-relation","volume-title":"Quantitative Linguistik: Ein internationales Handbuch","author":"Wimmer","year":"2005"},{"key":"CIT0227","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.11"}],"container-title":["Current Issues in Linguistic Theory","Language and Text"],"original-title":[],"language":"en","deposited":{"date-parts":[[2025,7,2]],"date-time":"2025-07-02T18:09:09Z","timestamp":1751479749000},"score":1,"resource":{"primary":{"URL":"https:\/\/benjamins.com\/catalog\/cilt.356.10kon"},"secondary":[{"URL":"https:\/\/www.degruyter.com\/document\/doi\/10.1075\/cilt.356.10kon\/html","label":"DeGruyter"}]},"subtitle":["A quantitative text model for detecting random texts"],"short-title":[],"issued":{"date-parts":[[2021,11,8]]},"ISBN":["9789027210104","9789027258380"],"references-count":43,"URL":"https:\/\/doi.org\/10.1075\/cilt.356.10kon","archive":["Portico"],"relation":{},"ISSN":["0304-0763"],"issn-type":[{"value":"0304-0763","type":"print"}],"subject":[],"published":{"date-parts":[[2021,11,8]]}}}