{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:14:27Z","timestamp":1750306467066,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":25,"publisher":"ACM","license":[{"start":{"date-parts":[[2015,12,8]],"date-time":"2015-12-08T00:00:00Z","timestamp":1449532800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2015,12,8]]},"DOI":"10.1145\/2838931.2838940","type":"proceedings-article","created":{"date-parts":[[2015,11,23]],"date-time":"2015-11-23T13:24:48Z","timestamp":1448285088000},"page":"1-4","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Text segmentation and Chinese site search"],"prefix":"10.1145","author":[{"given":"Liyuan","family":"Zhou","sequence":"first","affiliation":[{"name":"NICTA &amp; ANU"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Hawking","sequence":"additional","affiliation":[{"name":"Microsoft &amp; ANU"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Paul","family":"Thomas","sequence":"additional","affiliation":[{"name":"CSIRO &amp; ANU"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2015,12,8]]},"reference":[{"doi-asserted-by":"publisher","key":"e_1_3_2_1_1_1","DOI":"10.1145\/792550.792552"},{"key":"e_1_3_2_1_2_1","first-page":"78","article-title":"\u4e2d\u6587\u5206\u8bcd\u5bf9\u4e2d\u6587\u4fe1\u606f\u68c0\u7d22\u7cfb\u7edf\u6027\u80fd\u7684\u5f71\u54cd (Impact of Chinese Segmentation to Chinese Information Retrieval)","volume":"19","author":"Cao G.","year":"2003","unstructured":"G. Cao , P. He , G. Wu , and S. Nie . \u4e2d\u6587\u5206\u8bcd\u5bf9\u4e2d\u6587\u4fe1\u606f\u68c0\u7d22\u7cfb\u7edf\u6027\u80fd\u7684\u5f71\u54cd (Impact of Chinese Segmentation to Chinese Information Retrieval) . Computer Engineering and Applications , 19 : 78 -- 79 , 2003 . G. Cao, P. He, G. Wu, and S. Nie. \u4e2d\u6587\u5206\u8bcd\u5bf9\u4e2d\u6587\u4fe1\u606f\u68c0\u7d22\u7cfb\u7edf\u6027\u80fd\u7684\u5f71\u54cd (Impact of Chinese Segmentation to Chinese Information Retrieval). Computer Engineering and Applications, 19:78--79, 2003.","journal-title":"Computer Engineering and Applications"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_3_1","DOI":"10.1016\/S0306-4573(02)00079-1"},{"key":"e_1_3_2_1_4_1","first-page":"200704","volume-title":"China Sciencepaper Online","author":"Application Q. Fu.","year":"2007","unstructured":"Q. Fu. \u57fa\u4e8e\u641c\u7d22\u7edf\u8ba1\u6280\u672f\u4e2d\u6587\u5206\u8bcd\u7b97\u6cd5\u7684\u5e94\u7528\u7814\u7a76 ( Application of statistical techniques to Chinese word segmentation algorithm) . China Sciencepaper Online , 2007 . http:\/\/www.paper.edu.cn\/releasepaper\/content\/ 200704 - 200749 . Q. Fu. \u57fa\u4e8e\u641c\u7d22\u7edf\u8ba1\u6280\u672f\u4e2d\u6587\u5206\u8bcd\u7b97\u6cd5\u7684\u5e94\u7528\u7814\u7a76 (Application of statistical techniques to Chinese word segmentation algorithm). China Sciencepaper Online, 2007. http:\/\/www.paper.edu.cn\/releasepaper\/content\/200704-749."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_5_1","DOI":"10.3115\/1118824.1118828"},{"issue":"1","key":"e_1_3_2_1_6_1","first-page":"21","article-title":"\u6c49\u8bed\u5206\u8bcd\u5bf9\u4e2d\u6587\u641c\u7d22\u5f15\u64ce\u68c0\u7d22\u6027\u80fd\u7684\u5f71\u54cd (Influence of Chinese word segmentation on web information retrieval)","volume":"25","author":"Jin P.","year":"2006","unstructured":"P. Jin , Y. Liu , and S. Wang . \u6c49\u8bed\u5206\u8bcd\u5bf9\u4e2d\u6587\u641c\u7d22\u5f15\u64ce\u68c0\u7d22\u6027\u80fd\u7684\u5f71\u54cd (Influence of Chinese word segmentation on web information retrieval) . Journal of the China Society for Scientific and Technical Information , 25 ( 1 ): 21 -- 24 , 2006 . P. Jin, Y. Liu, and S. Wang. \u6c49\u8bed\u5206\u8bcd\u5bf9\u4e2d\u6587\u641c\u7d22\u5f15\u64ce\u68c0\u7d22\u6027\u80fd\u7684\u5f71\u54cd (Influence of Chinese word segmentation on web information retrieval). Journal of the China Society for Scientific and Technical Information, 25(1):21--24, 2006.","journal-title":"Journal of the China Society for Scientific and Technical Information"},{"key":"e_1_3_2_1_7_1","volume-title":"Proc. NTCIR","author":"Kang I.-S.","year":"2004","unstructured":"I.-S. Kang , S.-H. Na , and J.-H. Lee . Combination approaches in information retrieval: words vs. n-grams, and query translation vs. document translation . In Proc. NTCIR , 2004 . I.-S. Kang, S.-H. Na, and J.-H. Lee. Combination approaches in information retrieval: words vs. n-grams, and query translation vs. document translation. In Proc. NTCIR, 2004."},{"key":"e_1_3_2_1_8_1","first-page":"177","volume-title":"Proc. Int. Conf. Internet Information Retrieval","author":"Kim D.","year":"2005","unstructured":"D. Kim and S. Ming . Effectiveness of segmentation granularity and indexing units for worst case evaluation in Chinese information retrieval . Proc. Int. Conf. Internet Information Retrieval , pages 177 -- 180 , 2005 . D. Kim and S. Ming. Effectiveness of segmentation granularity and indexing units for worst case evaluation in Chinese information retrieval. Proc. Int. Conf. Internet Information Retrieval, pages 177--180, 2005."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_9_1","DOI":"10.1145\/278459.258531"},{"key":"e_1_3_2_1_10_1","first-page":"141","volume-title":"Proc. Empirical Methods in NLP","author":"Kwok K. L.","year":"1997","unstructured":"K. L. Kwok . Lexicon effects on Chinese information retrieval . In Proc. Empirical Methods in NLP , pages 141 -- 148 , 1997 . K. L. Kwok. Lexicon effects on Chinese information retrieval. In Proc. Empirical Methods in NLP, pages 141--8, 1997."},{"key":"e_1_3_2_1_11_1","first-page":"551","volume-title":"Proc. TREC-6","author":"Leong M.-K.","year":"1997","unstructured":"M.-K. Leong and H. Zhou . Preliminary qualitative analysis of segmented vs bigram indexing in Chinese . In Proc. TREC-6 , pages 551 -- 557 , 1997 . M.-K. Leong and H. Zhou. Preliminary qualitative analysis of segmented vs bigram indexing in Chinese. In Proc. TREC-6, pages 551--557, 1997."},{"issue":"3","key":"e_1_3_2_1_12_1","first-page":"80","article-title":"\u5f00\u6e90\u4e2d\u6587\u5206\u8bcd\u5668\u5728 web \u641c\u7d22\u5f15\u64ce\u4e2d\u7684\u5e94\u7528 (The application of open source Chinese tokenizer in web search engine)","volume":"34","author":"Liu X.","year":"2013","unstructured":"X. Liu , Y. Hu , and X. Ai . \u5f00\u6e90\u4e2d\u6587\u5206\u8bcd\u5668\u5728 web \u641c\u7d22\u5f15\u64ce\u4e2d\u7684\u5e94\u7528 (The application of open source Chinese tokenizer in web search engine) . Computer Engineering & Software , 34 ( 3 ): 80 -- 83 , 2013 . X. Liu, Y. Hu, and X. Ai. \u5f00\u6e90\u4e2d\u6587\u5206\u8bcd\u5668\u5728 web \u641c\u7d22\u5f15\u64ce\u4e2d\u7684\u5e94\u7528 (The application of open source Chinese tokenizer in web search engine). Computer Engineering & Software, 34(3):80--83, 2013.","journal-title":"Computer Engineering & Software"},{"issue":"10","key":"e_1_3_2_1_13_1","first-page":"2605","volume":"5","author":"Long S.","year":"2009","unstructured":"S. Long , Z. Zhao , and H. Tang . Overview on Chinese Segmentation Algorithm. Computer Knowledge and Technology , 5 ( 10 ): 2605 -- 2607 , 2009 . S. Long, Z. Zhao, and H. Tang. Overview on Chinese Segmentation Algorithm. Computer Knowledge and Technology, 5(10):2605--2607, 2009.","journal-title":"Overview on Chinese Segmentation Algorithm. Computer Knowledge and Technology"},{"key":"e_1_3_2_1_14_1","first-page":"130","volume-title":"Proc. NTCIR","author":"Luk R. W.","year":"2001","unstructured":"R. W. Luk , K.-F. Wong , and K.-L. Kwok . Hybrid term indexing: an evaluation . In Proc. NTCIR , pages 130 -- 136 , 2001 . R. W. Luk, K.-F. Wong, and K.-L. Kwok. Hybrid term indexing: an evaluation. In Proc. NTCIR, pages 130--136, 2001."},{"key":"e_1_3_2_1_15_1","volume-title":"Jun","author":"National Taiwan University.","year":"2000","unstructured":"National Taiwan University. Chinese Information Retrieval Benchmark version 1.0 (CIRB010). Web site , Jun 2000 . http:\/\/lips.lis.ntu.edu.tw\/cirb\/releases\/CIRB010.htm. National Taiwan University. Chinese Information Retrieval Benchmark version 1.0 (CIRB010). Web site, Jun 2000. http:\/\/lips.lis.ntu.edu.tw\/cirb\/releases\/CIRB010.htm."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_16_1","DOI":"10.1145\/243199.243270"},{"key":"e_1_3_2_1_17_1","first-page":"697","volume-title":"Proc. TREC-6","author":"Nie J. Y.","year":"1998","unstructured":"J. Y. Nie , J. P. Chevallet , and M. F. Bruandet . Between terms and words for European language IR and between words and bigrams for Chinese IR . In Proc. TREC-6 , pages 697 -- 710 , 1998 . J. Y. Nie, J. P. Chevallet, and M. F. Bruandet. Between terms and words for European language IR and between words and bigrams for Chinese IR. In Proc. TREC-6, pages 697--710, 1998."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_18_1","DOI":"10.1145\/355214.355235"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_19_1","DOI":"10.5555\/519452.830790"},{"key":"e_1_3_2_1_20_1","first-page":"175","volume-title":"Proc. AAAI Spring Symposium","author":"Palmer D.","year":"1997","unstructured":"D. Palmer and J. Burger . Chinese word segmentation and information retrieval . In Proc. AAAI Spring Symposium , pages 175 -- 178 , 1997 . D. Palmer and J. Burger. Chinese word segmentation and information retrieval. In Proc. AAAI Spring Symposium, pages 175--178, 1997."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_21_1","DOI":"10.3115\/1072228.1072376"},{"issue":"1","key":"e_1_3_2_1_22_1","first-page":"22","article-title":"\u6c49\u8bed\u81ea\u52a8\u5206\u8bcd\u7814\u7a76\u8bc4\u8ff0 (Chinese Automatic Segmentation Research Review)","volume":"3","author":"Sun M.","year":"2001","unstructured":"M. Sun , J. Zou , \u6c49\u8bed\u81ea\u52a8\u5206\u8bcd\u7814\u7a76\u8bc4\u8ff0 (Chinese Automatic Segmentation Research Review) . Contemporary Linguistics , 3 ( 1 ): 22 -- 32 , 2001 . M. Sun, J. Zou, et al. \u6c49\u8bed\u81ea\u52a8\u5206\u8bcd\u7814\u7a76\u8bc4\u8ff0 (Chinese Automatic Segmentation Research Review). Contemporary Linguistics, 3(1): 22--32, 2001.","journal-title":"Contemporary Linguistics"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_23_1","DOI":"10.1145\/1183614.1183632"},{"key":"e_1_3_2_1_24_1","first-page":"335","volume-title":"Proc. TREC-5","author":"Tong X.","year":"1997","unstructured":"X. Tong , C. Zhai , N. Millic Frayling , and D. A. Evans . Experiments on Chinese text indexing: CLARIT TREC-5 Chinese track report . In Proc. TREC-5 , pages 335 -- 339 , 1997 . X. Tong, C. Zhai, N. Millic Frayling, and D. A. Evans. Experiments on Chinese text indexing: CLARIT TREC-5 Chinese track report. In Proc. TREC-5, pages 335--339, 1997."},{"key":"e_1_3_2_1_26_1","volume-title":"Investigating indexing units for Chinese web information retrieval: Chinese word segmentation versus N-grams. Master's thesis","author":"Zhou L.","year":"2013","unstructured":"L. Zhou . Investigating indexing units for Chinese web information retrieval: Chinese word segmentation versus N-grams. Master's thesis , Australian National University , 2013 . L. Zhou. Investigating indexing units for Chinese web information retrieval: Chinese word segmentation versus N-grams. Master's thesis, Australian National University, 2013."}],"event":{"sponsor":["ACM Association for Computing Machinery","Univ. of Western Sydney University of Western Sydney","SIGIR ACM Special Interest Group on Information Retrieval"],"acronym":"ADCS '15","name":"ADCS '15: The 20th Australasian Document Computing Symposium","location":"Parramatta NSW Australia"},"container-title":["Proceedings of the 20th Australasian Document Computing Symposium"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2838931.2838940","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2838931.2838940","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T05:43:44Z","timestamp":1750225424000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2838931.2838940"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,12,8]]},"references-count":25,"alternative-id":["10.1145\/2838931.2838940","10.1145\/2838931"],"URL":"https:\/\/doi.org\/10.1145\/2838931.2838940","relation":{},"subject":[],"published":{"date-parts":[[2015,12,8]]},"assertion":[{"value":"2015-12-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}