{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:33:32Z","timestamp":1750221212789,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":14,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,8,28]],"date-time":"2018-08-28T00:00:00Z","timestamp":1535414400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,8,28]]},"DOI":"10.1145\/3209280.3229103","type":"proceedings-article","created":{"date-parts":[[2018,10,2]],"date-time":"2018-10-02T12:09:29Z","timestamp":1538482169000},"page":"1-4","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Helmholtz Principle on word embeddings for automatic document segmentation"],"prefix":"10.1145","author":[{"given":"Dominik","family":"Krzemi\u0144ski","sequence":"first","affiliation":[{"name":"CUBRIC, Cardiff University, United Kingdom"}]},{"given":"Helen","family":"Balinsky","sequence":"additional","affiliation":[{"name":"Hewlett-Packard Laboratories, United Kingdom"}]},{"given":"Alexander","family":"Balinsky","sequence":"additional","affiliation":[{"name":"Cardiff School of Mathematics, Cardiff University, United Kingdom"}]}],"member":"320","published-online":{"date-parts":[[2018,8,28]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1860559.1860624"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-1023"},{"key":"e_1_3_2_1_3_1","volume-title":"A Neural Probabilistic Language Model. J. Mach. Learn. Res. 3 (March","author":"Bengio Yoshua","year":"2003","unstructured":"Yoshua Bengio , R\u00e9jean Ducharme , Pascal Vincent , and Christian Janvin . 2003. A Neural Probabilistic Language Model. J. Mach. Learn. Res. 3 (March 2003 ), 1137--1155. Yoshua Bengio, R\u00e9jean Ducharme, Pascal Vincent, and Christian Janvin. 2003. A Neural Probabilistic Language Model. J. Mach. Learn. Res. 3 (March 2003), 1137--1155."},{"key":"e_1_3_2_1_4_1","volume-title":"Natural Language Processing with Python","author":"Bird Steven","unstructured":"Steven Bird , Ewan Klein , and Edward Loper . 2009. Natural Language Processing with Python ( 1 st ed.). O'Reilly Media, Inc. Steven Bird, Ewan Klein, and Edward Loper. 2009. Natural Language Processing with Python (1st ed.). O'Reilly Media, Inc.","edition":"1"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.5555\/974305.974309"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/2644866.2644874"},{"key":"e_1_3_2_1_7_1","volume-title":"From Gestalt Theory to Image Analysis: A Probabilistic Approach","author":"Desolneux Agns","unstructured":"Agns Desolneux , Lionel Moisan , and Jean-Michel Morel . 2007. From Gestalt Theory to Image Analysis: A Probabilistic Approach ( 1 st ed.). Springer Publishing Company, Inc orporated. Agns Desolneux, Lionel Moisan, and Jean-Michel Morel. 2007. From Gestalt Theory to Image Analysis: A Probabilistic Approach (1st ed.). Springer Publishing Company, Incorporated.","edition":"1"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/1613715.1613760"},{"key":"e_1_3_2_1_9_1","unstructured":"J. R. Firth. 1957. A synopsis of linguistic theory 1930-55. Studies in Linguistic Analysis Special volume of the Philological Society (1957).  J. R. Firth. 1957. A synopsis of linguistic theory 1930-55. Studies in Linguistic Analysis Special volume of the Philological Society (1957)."},{"key":"e_1_3_2_1_10_1","volume-title":"TextTiling: Segmenting text into multi-paragraph subtopic passages. Computational Linguistics","author":"Hearst Marti A.","year":"1997","unstructured":"Marti A. Hearst . 1997. TextTiling: Segmenting text into multi-paragraph subtopic passages. Computational Linguistics ( 1997 ), 33--64. Marti A. Hearst. 1997. TextTiling: Segmenting text into multi-paragraph subtopic passages. Computational Linguistics (1997), 33--64."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/1244002.1244140"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICTAI.2007.142"},{"key":"e_1_3_2_1_13_1","volume-title":"Efficient Estimation of Word Representations in Vector Space. CoRR abs\/1301.3781","author":"Mikolov Tomas","year":"2013","unstructured":"Tomas Mikolov , Kai Chen , Greg Corrado , and Jeffrey Dean . 2013. Efficient Estimation of Word Representations in Vector Space. CoRR abs\/1301.3781 ( 2013 ). arXiv:1301.3781 Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space. CoRR abs\/1301.3781 (2013). arXiv:1301.3781"},{"key":"e_1_3_2_1_14_1","volume-title":"Manning","author":"Pennington Jeffrey","year":"2014","unstructured":"Jeffrey Pennington , Richard Socher , and Christopher D . Manning . 2014 . GloVe: Global Vectors for Word Representation. In Empirical Methods in Natural Language Processing (EMNLP) . 1532--1543. Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. GloVe: Global Vectors for Word Representation. In Empirical Methods in Natural Language Processing (EMNLP). 1532--1543."}],"event":{"name":"DocEng '18: ACM Symposium on Document Engineering 2018","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web","SIGDOC ACM Special Interest Group on Systems Documentation"],"location":"Halifax NS Canada","acronym":"DocEng '18"},"container-title":["Proceedings of the ACM Symposium on Document Engineering 2018"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3209280.3229103","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3209280.3229103","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T01:39:45Z","timestamp":1750210785000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3209280.3229103"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,8,28]]},"references-count":14,"alternative-id":["10.1145\/3209280.3229103","10.1145\/3209280"],"URL":"https:\/\/doi.org\/10.1145\/3209280.3229103","relation":{},"subject":[],"published":{"date-parts":[[2018,8,28]]},"assertion":[{"value":"2018-08-28","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}