{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,30]],"date-time":"2025-09-30T10:58:41Z","timestamp":1759229921858,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":44,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,8,7]],"date-time":"2017-08-07T00:00:00Z","timestamp":1502064000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"ERC","award":["DMAP 680153"],"award-info":[{"award-number":["DMAP 680153"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,8,7]]},"DOI":"10.1145\/3077136.3080821","type":"proceedings-article","created":{"date-parts":[[2017,7,28]],"date-time":"2017-07-28T19:35:01Z","timestamp":1501270501000},"page":"385-394","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["On the Power Laws of Language"],"prefix":"10.1145","author":[{"given":"Flavio","family":"Chierichetti","sequence":"first","affiliation":[{"name":"Sapienza University, Rome, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ravi","family":"Kumar","sequence":"additional","affiliation":[{"name":"Google, Mountain View, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bo","family":"Pang","sequence":"additional","affiliation":[{"name":"Google, Mountain View, CA, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2017,8,7]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"IJsbrand Jan Aalbersberg. 1991. Posting compression in dynamic retrieval environments SIGIR. 72--81. IJsbrand Jan Aalbersberg. 1991. Posting compression in dynamic retrieval environments SIGIR. 72--81.","DOI":"10.1145\/122860.122868"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"IJsbrand Jan Aalbersberg. 1994. A document retrieval model based on term frequency ranks SIGIR. 163--172. IJsbrand Jan Aalbersberg. 1994. A document retrieval model based on term frequency ranks SIGIR. 163--172.","DOI":"10.1007\/978-1-4471-2099-5_17"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"crossref","unstructured":"Leif Azzopardi. 2009. Query Side Evaluation: An empirical analysis of effectiveness and effort SIGIR. 556--563. Leif Azzopardi. 2009. Query Side Evaluation: An empirical analysis of effectiveness and effort SIGIR. 556--563.","DOI":"10.1145\/1571941.1572037"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"crossref","unstructured":"Harald Baayen. 1991. A stochastic process for word frequency distributions ACL. 271--278. Harald Baayen. 1991. A stochastic process for word frequency distributions ACL. 271--278.","DOI":"10.3115\/981344.981379"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF00136980"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"crossref","unstructured":"L. Douglas Baker and Andrew Kachites McCallum. 1998. Distributional clustering of words for text classification SIGIR. 96--103. L. Douglas Baker and Andrew Kachites McCallum. 1998. Distributional clustering of words for text classification SIGIR. 96--103.","DOI":"10.1145\/290941.290970"},{"key":"e_1_3_2_1_7_1","volume-title":"Henzinger","author":"Bharat Krishna","year":"1998","unstructured":"Krishna Bharat and Monika R . Henzinger . 1998 . Improved algorithms for topic distillation in a hyperlinked environment SIGIR. 104--111. Krishna Bharat and Monika R. Henzinger. 1998. Improved algorithms for topic distillation in a hyperlinked environment SIGIR. 104--111."},{"volume-title":"Language and Representation in Information Retrieval","author":"Blair David C","key":"e_1_3_2_1_8_1","unstructured":"David C Blair . 1990. Language and Representation in Information Retrieval . Elsevier Science Publishers . David C Blair. 1990. Language and Representation in Information Retrieval. Elsevier Science Publishers."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4573(01)00024-3"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0019-9958(67)90201-X"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1390334.1390360"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10791-006-9001-9"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Soumen Chakrabarti Mukul Joshi and Vivek Tawde. 2001. Enhanced topic distillation using text markup tags and hyperlinks SIGIR. 208--216. Soumen Chakrabarti Mukul Joshi and Vivek Tawde. 2001. Enhanced topic distillation using text markup tags and hyperlinks SIGIR. 208--216.","DOI":"10.1145\/383952.383990"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","unstructured":"Flavio Chierichetti Ravi Kumar and Prabhakar Raghavan. 2009. Compressed web indexes. In WWW. 451--460. Flavio Chierichetti Ravi Kumar and Prabhakar Raghavan. 2009. Compressed web indexes. In WWW. 451--460.","DOI":"10.1145\/1526709.1526770"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.2004.830752"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511581274"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1005634925734"},{"key":"e_1_3_2_1_18_1","unstructured":"Jean-Baptiste Estoup. 1916. Gammes St\u00e9nographiques. Institut Stenographique de France. Jean-Baptiste Estoup. 1916. Gammes St\u00e9nographiques. Institut Stenographique de France."},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1098\/rspb.2004.2957"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0335980100"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"crossref","unstructured":"Le Q. Ha P. Hanna D. W. Stewart and F. J. Smith. 2006. Reduced n-gram models for English and Chinese corpora COLING-ACL. 309--315. Le Q. Ha P. Hanna D. W. Stewart and F. J. Smith. 2006. Reduced n -gram models for English and Chinese corpora COLING-ACL. 309--315.","DOI":"10.3115\/1273073.1273113"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"crossref","unstructured":"Le Quan Ha E. I. Sicilia-Garcia Ji Ming and F. J. Smith. 2002. Extension of Zipf's law to words and phrases. In COLING. 1--6. Le Quan Ha E. I. Sicilia-Garcia Ji Ming and F. J. Smith. 2002. Extension of Zipf's law to words and phrases. In COLING. 1--6.","DOI":"10.3115\/1072228.1072345"},{"key":"e_1_3_2_1_24_1","unstructured":"Thomas Hofmann. 1999. Probabilistic Latent Semantic Analysis. In UAI. 289--296. Thomas Hofmann. 1999. Probabilistic Latent Semantic Analysis. In UAI. 289--296."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/312624.312649"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"crossref","unstructured":"Thorsten Joachims. 2001. A statistical learning model of text classification for support vector machines SIGIR. 128--136. Thorsten Joachims. 2001. A statistical learning model of text classification for support vector machines SIGIR. 128--136.","DOI":"10.1145\/383952.383974"},{"key":"e_1_3_2_1_27_1","volume-title":"Europarl: A parallel corpus for statistical machine translation. MT summit.","author":"Koehn Philipp","year":"2005","unstructured":"Philipp Koehn . 2005 . Europarl: A parallel corpus for statistical machine translation. MT summit. (2005). Philipp Koehn. 2005. Europarl: A parallel corpus for statistical machine translation. MT summit. (2005)."},{"key":"e_1_3_2_1_28_1","unstructured":"Andreas Krause and Andreas Zollmann 2002. Not so randomly typing monkeys - Rank-frequency behavior of natural and artificial languages. Algorithms for Information Networks - Project Report. (2002). Andreas Krause and Andreas Zollmann 2002. Not so randomly typing monkeys - Rank-frequency behavior of natural and artificial languages. Algorithms for Information Networks - Project Report. (2002)."},{"volume-title":"An informational theory of the statistical structure of language","author":"Mandelbrot Benoit","key":"e_1_3_2_1_29_1","unstructured":"Benoit Mandelbrot . 1953. An informational theory of the statistical structure of language . Communication Theory, W. Jackson (Ed.). Butterworths , London , 486--502. Benoit Mandelbrot. 1953. An informational theory of the statistical structure of language. Communication Theory, W. Jackson (Ed.). Butterworths, London, 486--502."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511809071"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.2307\/1419346"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1080\/15427951.2004.10129088"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1080\/00107510500052444"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1098\/rsif.2012.0491"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90021-0"},{"key":"e_1_3_2_1_37_1","volume-title":"Proceedings of the 4th Workshop on Very Large Corpora. 70--78","author":"Samuelsson Christer","year":"1996","unstructured":"Christer Samuelsson . 1996 . Relating Turing's Formula and Zipf's Law . In Proceedings of the 4th Workshop on Very Large Corpora. 70--78 . Christer Samuelsson. 1996. Relating Turing's Formula and Zipf's Law. In Proceedings of the 4th Workshop on Very Large Corpora. 70--78."},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835804.1835890"},{"key":"e_1_3_2_1_39_1","first-page":"83","article-title":"Note on a paper by L. G. Sathe. J. Indian","volume":"18","author":"Selberg Atle","year":"1954","unstructured":"Atle Selberg . 1954 . Note on a paper by L. G. Sathe. J. Indian Math. Soc., N. Ser. Vol. 18 (1954), 83 -- 87 . Atle Selberg. 1954. Note on a paper by L. G. Sathe. J. Indian Math. Soc., N. Ser. Vol. 18 (1954), 83--87.","journal-title":"Math. Soc., N. Ser."},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1975.10482469"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.2307\/2333389"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2004.03.006"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1098\/rstb.1925.0002"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.4159\/harvard.9780674434929"},{"volume-title":"The Psycho-Biology of Language: An Introduction to Dynamic Philology","author":"Zipf George K.","key":"e_1_3_2_1_45_1","unstructured":"George K. Zipf . 1935. The Psycho-Biology of Language: An Introduction to Dynamic Philology . Houghton Mifflin Company . George K. Zipf. 1935. The Psycho-Biology of Language: An Introduction to Dynamic Philology. Houghton Mifflin Company."},{"volume-title":"Human Behavior and the Principle of Least Effort","author":"Zipf George K.","key":"e_1_3_2_1_46_1","unstructured":"George K. Zipf . 1949. Human Behavior and the Principle of Least Effort . Addison-Wesley Press . George K. Zipf. 1949. Human Behavior and the Principle of Least Effort. Addison-Wesley Press."}],"event":{"name":"SIGIR '17: The 40th International ACM SIGIR conference on research and development in Information Retrieval","sponsor":["SIGIR ACM Special Interest Group on Information Retrieval"],"location":"Shinjuku Tokyo Japan","acronym":"SIGIR '17"},"container-title":["Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3077136.3080821","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3077136.3080821","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,24]],"date-time":"2025-06-24T18:42:04Z","timestamp":1750790524000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3077136.3080821"}},"subtitle":["Word Frequency Distributions"],"short-title":[],"issued":{"date-parts":[[2017,8,7]]},"references-count":44,"alternative-id":["10.1145\/3077136.3080821","10.1145\/3077136"],"URL":"https:\/\/doi.org\/10.1145\/3077136.3080821","relation":{},"subject":[],"published":{"date-parts":[[2017,8,7]]},"assertion":[{"value":"2017-08-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}