{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:33:48Z","timestamp":1750221228956,"version":"3.41.0"},"reference-count":38,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2018,7,21]],"date-time":"2018-07-21T00:00:00Z","timestamp":1532131200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61572154 and 91520204"],"award-info":[{"award-number":["61572154 and 91520204"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Asian Low-Resour. Lang. Inf. Process."],"published-print":{"date-parts":[[2018,12,31]]},"abstract":"<jats:p>Cross-lingual word embeddings are representations for vocabularies of two or more languages in one common continuous vector space and are widely used in various natural language processing tasks. A state-of-the-art way to generate cross-lingual word embeddings is to learn a linear mapping, with an assumption that the vector representations of similar words in different languages are related by a linear relationship. However, this assumption does not always hold true, especially for substantially different languages. We therefore propose to use kernel canonical correlation analysis to capture a non-linear relationship between word embeddings of two languages. By extensively evaluating the learned word embeddings on three tasks (word similarity, cross-lingual dictionary induction, and cross-lingual document classification) across five language pairs, we demonstrate that our proposed approach achieves essentially better performances than previous linear methods on all of the three tasks, especially for language pairs with substantial typological difference.<\/jats:p>","DOI":"10.1145\/3197566","type":"journal-article","created":{"date-parts":[[2018,7,23]],"date-time":"2018-07-23T13:02:15Z","timestamp":1532350935000},"page":"1-16","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":3,"title":["Improving Vector Space Word Representations Via Kernel Canonical Correlation Analysis"],"prefix":"10.1145","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7044-0683","authenticated-orcid":false,"given":"Xuefeng","family":"Bai","sequence":"first","affiliation":[{"name":"Harbin Institute of Technology, China"}]},{"given":"Hailong","family":"Cao","sequence":"additional","affiliation":[{"name":"Harbin Institute of Technology, China"}]},{"given":"Tiejun","family":"Zhao","sequence":"additional","affiliation":[{"name":"Harbin Institute of Technology, China"}]}],"member":"320","published-online":{"date-parts":[[2018,7,21]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Shotaro Akaho. 2006. A kernel method for canonical correlation analysis. CoRR abs\/cs\/0609071.  Shotaro Akaho. 2006. A kernel method for canonical correlation analysis. CoRR abs\/cs\/0609071."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D16-1250"},{"volume-title":"Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1352--1362","year":"2013","author":"Bond Francis","key":"e_1_2_1_3_1"},{"volume-title":"Proceedings of the 16 Conference of the European Association for Machine Translation (EAMT\u201912)","year":"2012","author":"Cettolo Mauro","key":"e_1_2_1_4_1"},{"volume-title":"Austin Matthews, and Noah A. Smith","year":"2015","author":"Dyer Chris","key":"e_1_2_1_5_1"},{"volume-title":"Proceedings of the ACL 2010 System Demonstrations. Association for Computational Linguistics, 7--12","year":"2010","author":"Dyer Chris","key":"e_1_2_1_6_1"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/E14-1049"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/371920.372094"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007662407062"},{"volume-title":"Statistical consistency of kernel canonical correlation analysis. J. Mach. Learn. Res. 8 (May","year":"2007","author":"Fukumizu Kenji","key":"e_1_2_1_10_1"},{"volume-title":"Proceedings of the 32nd International Conference on Machine Learning (ICML\u201915), David Blei and Francis Bach (Eds.). JMLR Workshop and Conference Proceedings, 748--756","year":"2015","author":"Gouws Stephan","key":"e_1_2_1_11_1"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/N15-1157"},{"volume-title":"Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":"Guo Jiang","key":"e_1_2_1_13_1"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-1006"},{"volume-title":"SimLex-999: Evaluating semantic models with (genuine) similarity estimation. CoRR abs\/1408.3456","year":"2014","author":"Hill Felix","key":"e_1_2_1_15_1"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1093\/biomet\/28.3-4.321"},{"volume-title":"Proceedings of the International Conference on Computational Linguistics (COLING\u201912)","year":"2012","author":"Klementiev Alexandre","key":"e_1_2_1_17_1"},{"volume-title":"RCV1: A new benchmark collection for text categorization research. J. Mach. Learn. Res. 5 (Dec","year":"2004","author":"Lewis David D.","key":"e_1_2_1_18_1"},{"volume-title":"Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing. 151--159","author":"Luong Thang","key":"e_1_2_1_19_1"},{"volume-title":"Visualizing data using t-SNE. J. Mach. Learn. Res. 9 (Nov","year":"2008","author":"van der Maaten Laurens","key":"e_1_2_1_20_1"},{"volume-title":"Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, 92--97","year":"2013","author":"McDonald Ryan","key":"e_1_2_1_21_1"},{"key":"e_1_2_1_22_1","unstructured":"Tomas Mikolov Kai Chen Greg Corrado and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. CoRR abs\/1301.3781.  Tomas Mikolov Kai Chen Greg Corrado and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. CoRR abs\/1301.3781."},{"volume-title":"Exploiting similarities among languages for machine translation. CoRR abs\/1309.4168","year":"2013","author":"Mikolov Tomas","key":"e_1_2_1_23_1"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N16-1083"},{"volume-title":"Myers and Arnold Well","year":"1995","author":"Jerome","key":"e_1_2_1_25_1"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/0306-4573(88)90021-0"},{"volume-title":"Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)","author":"Shi Tianze","key":"e_1_2_1_27_1"},{"volume-title":"Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 455--465","author":"Socher Richard","key":"e_1_2_1_28_1"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1037\/0033-2909.87.2.245"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1243"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1157"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.5555\/1873781.1873905"},{"volume-title":"Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 106--116","year":"2013","author":"Vuli\u0107 Ivan","key":"e_1_2_1_33_1"},{"volume-title":"Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)","author":"Vuli\u0107 Ivan","key":"e_1_2_1_34_1"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.5555\/1687878.1687913"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/N15-1104"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-1011"},{"volume-title":"Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI\u201916)","year":"2016","author":"Zhang Meng","key":"e_1_2_1_38_1"}],"container-title":["ACM Transactions on Asian and Low-Resource Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3197566","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3197566","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:07:00Z","timestamp":1750212420000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3197566"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,7,21]]},"references-count":38,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2018,12,31]]}},"alternative-id":["10.1145\/3197566"],"URL":"https:\/\/doi.org\/10.1145\/3197566","relation":{},"ISSN":["2375-4699","2375-4702"],"issn-type":[{"type":"print","value":"2375-4699"},{"type":"electronic","value":"2375-4702"}],"subject":[],"published":{"date-parts":[[2018,7,21]]},"assertion":[{"value":"2017-06-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-03-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-07-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}