{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,25]],"date-time":"2026-02-25T18:32:52Z","timestamp":1772044372772,"version":"3.50.1"},"reference-count":41,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2016,9,21]],"date-time":"2016-09-21T00:00:00Z","timestamp":1474416000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Hong Kong RGC Project","award":["N_HKUST637\/13"],"award-info":[{"award-number":["N_HKUST637\/13"]}]},{"name":"Microsoft Research Asia Fellowship","award":["2012"],"award-info":[{"award-number":["2012"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61502021, 61328202, and 61532004"],"award-info":[{"award-number":["61502021, 61328202, and 61532004"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Grand Fundamental Research 973 Program of China","award":["2014CB340304"],"award-info":[{"award-number":["2014CB340304"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Inf. Syst."],"published-print":{"date-parts":[[2017,4,30]]},"abstract":"<jats:p>\n            Today, major commercial search engines are operating in a multinational fashion to provide web search services for millions of users who compose search queries by different languages. Hence, the search engine query log, which serves as the backbone of many search engine applications, records millions of users\u2019 search history in a wide spectrum of human languages and demonstrates a strong multilingual phenomenon. However, with its salience, the multilingual nature of a search engine query log is usually ignored by existing works, which usually consider query log entries of different languages as being orthogonal and independent. This kind of oversimplified assumption heavily distorts the underlying structure of web search data. In this article, we pioneer in recognition of the multilingual nature of a query log and make the first attempt to cross the language barrier in query logs. We propose a novel model named\n            <jats:italic>Cross-Lingual Query Log Topic Model<\/jats:italic>\n            (CL-QLTM) to analyze query logs from a cross-lingual perspective and derive the latent topics of web search data. The CL-QLTM comprehensively integrates web search data in different languages by collectively utilizing cross-lingual dictionaries, as well as the co-occurrence relations in the query log. In order to relieve the efficiency bottleneck of applying the CL-QLTM on voluminous query logs, we propose an efficient parameter inference algorithm based on the MapReduce computing paradigm. Both qualitative and quantitative experimental results show that the CL-QLTM is able to effectively derive cross-lingual topics from multilingual query logs and spawn a wide spectrum of new search engine applications.\n          <\/jats:p>","DOI":"10.1145\/2956235","type":"journal-article","created":{"date-parts":[[2016,9,21]],"date-time":"2016-09-21T12:42:46Z","timestamp":1474461766000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["Cross-Lingual Topic Discovery From Multilingual Search Engine Query Log"],"prefix":"10.1145","volume":"35","author":[{"given":"Di","family":"Jiang","sequence":"first","affiliation":[{"name":"SKLSDE Lab, Beihang University, China and Baidu, Beijing, China"}]},{"given":"Yongxin","family":"Tong","sequence":"additional","affiliation":[{"name":"SKLSDE Lab, Beihang University, China"}]},{"given":"Yuanfeng","family":"Song","sequence":"additional","affiliation":[{"name":"Baidu, Beijing, China"}]}],"member":"320","published-online":{"date-parts":[[2016,9,21]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Vamshi Ambati and U. Rohini. 2006. Using monolingual clickthrough data to build cross-lingual search systems. New Directions in Multilingual Information Access (2006) 28.  Vamshi Ambati and U. Rohini. 2006. Using monolingual clickthrough data to build cross-lingual search systems. New Directions in Multilingual Information Access (2006) 28."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143859"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944937"},{"key":"e_1_2_1_4_1","volume-title":"Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. AUAI Press, 75--82","author":"Boyd-Graber Jordan","year":"2009","unstructured":"Jordan Boyd-Graber and David M. Blei . 2009. Multilingual topic models for unaligned text . In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. AUAI Press, 75--82 , 2009 . Jordan Boyd-Graber and David M. Blei. 2009. Multilingual topic models for unaligned text. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. AUAI Press, 75--82, 2009."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/1871437.1871745"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1327452.1327492"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/2390524.2390599"},{"key":"e_1_2_1_8_1","volume-title":"Xing","author":"Fukumasu Kosuke","year":"2012","unstructured":"Kosuke Fukumasu , Koji Eguchi , and Eric P . Xing . 2012 . Symmetric correspondence topic models for multilingual text analysis. In Advances in Neural Information Processing Systems . 1286--1294, 2012. Kosuke Fukumasu, Koji Eguchi, and Eric P. Xing. 2012. Symmetric correspondence topic models for multilingual text analysis. In Advances in Neural Information Processing Systems. 1286--1294, 2012."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1277741.1277821"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0307752101"},{"key":"e_1_2_1_11_1","volume-title":"Proceedings of the Workshop on Query Log Analysis at the 16th International Conference on World Wide Web. ACM","author":"Grimes Carrie","year":"2007","unstructured":"Carrie Grimes , Diane Tang , and Daniel M. Russell . 2007. Query logs alone are not enough . In Proceedings of the Workshop on Query Log Analysis at the 16th International Conference on World Wide Web. ACM , 2007 . Carrie Grimes, Diane Tang, and Daniel M. Russell. 2007. Query logs alone are not enough. In Proceedings of the Workshop on Query Log Analysis at the 16th International Conference on World Wide Web. ACM, 2007."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1109\/42.24868"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/599609.599631"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1645953.1645966"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-12275-0_39"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2566486.2567965"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11280-015-0336-2"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2015.07.014"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2396761.2398414"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2452376.2452420"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2014.6816668"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2015.03.020"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/564376.564408"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2010.5447911"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.5555\/1699571.1699627"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/1577069.1755845"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1526709.1526904"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766462.2767713"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2484028.2484052"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2600428.2609595"},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence. AUAI Press, 487--494","author":"Rosen-Zvi M.","year":"2004","unstructured":"M. Rosen-Zvi , T. Griffiths , M. Steyvers , and P. Smyth . 2004. The author-topic model for authors and documents . In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence. AUAI Press, 487--494 , 2004 . M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. 2004. The author-topic model for authors and documents. In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence. AUAI Press, 487--494, 2004."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.5555\/2002736.2002832"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2014.08.003"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-36973-5_9"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1040"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1460027.1460046"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150450"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/1963405.1963443"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/2187836.2187955"},{"key":"e_1_2_1_40_1","volume-title":"Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 1128--1137","author":"Zhang Duo","year":"2010","unstructured":"Duo Zhang , Qiaozhu Mei , and ChengXiang Zhai . 2010 . Cross-lingual latent topic extraction . In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 1128--1137 , 2010. Duo Zhang, Qiaozhu Mei, and ChengXiang Zhai. 2010. Cross-lingual latent topic extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 1128--1137, 2010."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.3115\/1119250.1119280"}],"container-title":["ACM Transactions on Information Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2956235","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2956235","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:39:43Z","timestamp":1750217983000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2956235"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,9,21]]},"references-count":41,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2017,4,30]]}},"alternative-id":["10.1145\/2956235"],"URL":"https:\/\/doi.org\/10.1145\/2956235","relation":{},"ISSN":["1046-8188","1558-2868"],"issn-type":[{"value":"1046-8188","type":"print"},{"value":"1558-2868","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,9,21]]},"assertion":[{"value":"2015-10-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2016-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2016-09-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}