{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,10,19]],"date-time":"2022-10-19T07:06:05Z","timestamp":1666163165017},"reference-count":18,"publisher":"World Scientific Pub Co Pte Lt","issue":"02","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2015,3]]},"abstract":"<jats:p> Cross-lingual document clustering is the task of automatically organizing a large collection of multi-lingual documents into a few clusters, depending on their content or topic. It is well known that language barrier and translation ambiguity are two challenging issues for cross-lingual document representation. To this end, we propose to represent cross-lingual documents through statistical word senses, which are automatically discovered from a parallel corpus through a novel cross-lingual word sense induction model and a sense clustering method. In particular, the former consists in a sense-based vector space model and the latter leverages on a sense-based latent Dirichlet allocation. Evaluation on the benchmarking datasets shows that the proposed models outperform two state-of-the-art methods for cross-lingual document clustering. <\/jats:p>","DOI":"10.1142\/s021800141559003x","type":"journal-article","created":{"date-parts":[[2014,10,24]],"date-time":"2014-10-24T05:27:16Z","timestamp":1414128436000},"page":"1559003","source":"Crossref","is-referenced-by-count":6,"title":["Document Representation with Statistical Word Senses in Cross-Lingual Document Clustering"],"prefix":"10.1142","volume":"29","author":[{"given":"Guoyu","family":"Tang","sequence":"first","affiliation":[{"name":"Department of Computer Science and Technology, TNList, Tsinghua University, Beijing 100084, P. R. China"}]},{"given":"Yunqing","family":"Xia","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, TNList, Tsinghua University, Beijing 100084, P. R. China"}]},{"given":"Erik","family":"Cambria","sequence":"additional","affiliation":[{"name":"School of Computer Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore 639798, Singapore"}]},{"given":"Peng","family":"Jin","sequence":"additional","affiliation":[{"name":"School of Computer Science, Leshan Normal University, Leshan 614000, P. R. China"}]},{"given":"Thomas Fang","family":"Zheng","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Technology, TNList, Tsinghua University, Beijing 100084, P. R. China"}]}],"member":"219","published-online":{"date-parts":[[2015,2,27]]},"reference":[{"key":"rf3","first-page":"993","volume":"3","author":"Blei D. M.","year":"2003","journal-title":"J. Mach. Lear. Res."},{"key":"rf6","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2014.01.064"},{"key":"rf7","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-04391-8_33"},{"key":"rf11","doi-asserted-by":"publisher","DOI":"10.1142\/S0218213007003369"},{"key":"rf13","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-010-0367-z"},{"key":"rf15","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0307752101"},{"key":"rf17","doi-asserted-by":"publisher","DOI":"10.1109\/TFUZZ.2010.2065811"},{"key":"rf18","doi-asserted-by":"publisher","DOI":"10.1177\/0165551511404867"},{"key":"rf20","doi-asserted-by":"publisher","DOI":"10.1037\/0033-295X.104.2.211"},{"key":"rf23","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2006.11.005"},{"key":"rf24","doi-asserted-by":"publisher","DOI":"10.1007\/s10791-010-9141-9"},{"key":"rf26","doi-asserted-by":"publisher","DOI":"10.1142\/S0218213001000398"},{"key":"rf30","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2009.09.007"},{"key":"rf32","doi-asserted-by":"publisher","DOI":"10.1145\/361219.361220"},{"key":"rf40","doi-asserted-by":"publisher","DOI":"10.1198\/016214506000000302"},{"key":"rf41","doi-asserted-by":"publisher","DOI":"10.1016\/j.dss.2007.07.008"},{"key":"rf43","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2013.27"},{"key":"rf46","doi-asserted-by":"publisher","DOI":"10.1142\/S0218001411008890"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S021800141559003X","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,6]],"date-time":"2019-08-06T14:22:51Z","timestamp":1565101371000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S021800141559003X"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,2,27]]},"references-count":18,"journal-issue":{"issue":"02","published-online":{"date-parts":[[2015,2,27]]},"published-print":{"date-parts":[[2015,3]]}},"alternative-id":["10.1142\/S021800141559003X"],"URL":"https:\/\/doi.org\/10.1142\/s021800141559003x","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"value":"0218-0014","type":"print"},{"value":"1793-6381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,2,27]]}}}