{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,20]],"date-time":"2025-11-20T12:36:42Z","timestamp":1763642202691,"version":"3.37.3"},"reference-count":66,"publisher":"Springer Science and Business Media LLC","issue":"7","license":[{"start":{"date-parts":[[2017,4,3]],"date-time":"2017-04-03T00:00:00Z","timestamp":1491177600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2017,4,3]],"date-time":"2017-04-03T00:00:00Z","timestamp":1491177600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100003382","name":"Core Research for Evolutional Science and Technology","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100003382","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Mach Learn"],"published-print":{"date-parts":[[2017,7]]},"DOI":"10.1007\/s10994-017-5634-8","type":"journal-article","created":{"date-parts":[[2017,4,4]],"date-time":"2017-04-04T07:53:50Z","timestamp":1491292430000},"page":"1083-1130","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["The mechanism of additive composition"],"prefix":"10.1007","volume":"106","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9146-2486","authenticated-orcid":false,"given":"Ran","family":"Tian","sequence":"first","affiliation":[]},{"given":"Naoaki","family":"Okazaki","sequence":"additional","affiliation":[]},{"given":"Kentaro","family":"Inui","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2017,4,3]]},"reference":[{"key":"5634_CR1","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1162\/tacl_a_00106","volume":"4","author":"S Arora","year":"2016","unstructured":"Arora, S., Li, Y., Liang, Y., & Ma, T. (2016). A latent variable model approach to pmi-based word embeddings. Transactions of the Association for Computational Linguistics, 4, 385\u2013399.","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"5634_CR2","unstructured":"Banea, C., Chen, D., Mihalcea, R., Cardie, C., & Wiebe, J. (2014). Simcompass: Using deep learning word embeddings to assess cross-level similarity. In: Proceedings of SemEval."},{"key":"5634_CR3","unstructured":"Baroni, M., & Zamparelli, R. (2010). Nouns are vectors, adjectives are matrices: Representing adjective-noun constructions in semantic space. In: Proceedings of EMNLP."},{"key":"5634_CR4","unstructured":"Blacoe, W., & Lapata, M. (2012). A comparison of vector-based representations for semantic composition. In: Proceedings of EMNLP."},{"issue":"4","key":"5634_CR5","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1145\/2133806.2133826","volume":"55","author":"DM Blei","year":"2012","unstructured":"Blei, D. M. (2012). Probabilistic topic models. Communications of the ACM, 55(4), 77\u201384.","journal-title":"Communications of the ACM"},{"key":"5634_CR6","unstructured":"Boleda, G., Baroni, M., Pham, T.N., & McNally, L. (2013). Intensionality was only alleged: On adjective-noun composition in distributional semantics. In: Proceedings of IWCS."},{"key":"5634_CR7","volume-title":"Neural Networks: Tricks of the Trade","author":"L Bottou","year":"2012","unstructured":"Bottou, L. (2012). Stochastic gradient descent tricks. In G. Montavon, G. B. Orr, & K. R. M\u00fcller (Eds.), Neural Networks: Tricks of the Trade. Berlin: Springer."},{"issue":"2","key":"5634_CR8","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1006\/jath.2001.3613","volume":"112","author":"M Burger","year":"2001","unstructured":"Burger, M., & Neubauer, A. (2001). Error bounds for approximation with neural networks. Journal of Approximation Theory, 112(2), 235\u2013250.","journal-title":"Journal of Approximation Theory"},{"issue":"1","key":"5634_CR9","first-page":"22","volume":"16","author":"KW Church","year":"1990","unstructured":"Church, K. W., & Hanks, P. (1990). Word association norms, mutual information, and lexicography. Computational Linguistics, 16(1), 22\u201329.","journal-title":"Computational Linguistics"},{"issue":"1","key":"5634_CR10","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1162\/COLI_a_00084","volume":"38","author":"D Clarke","year":"2012","unstructured":"Clarke, D. (2012). A context-theoretic framework for compositionality in distributional semantics. Computational Linguistics, 38(1), 41\u201347.","journal-title":"Computational Linguistics"},{"issue":"4","key":"5634_CR11","doi-asserted-by":"publisher","first-page":"661","DOI":"10.1137\/070710111","volume":"51","author":"A Clauset","year":"2009","unstructured":"Clauset, A., Shalizi, C. R., & Newman, M. E. J. (2009). Power-law distributions in empirical data. SIAM Review, 51(4), 661\u2013703.","journal-title":"SIAM Review"},{"issue":"1","key":"5634_CR12","first-page":"345","volume":"36","author":"B Coecke","year":"2010","unstructured":"Coecke, B., Sadrzadeh, M., & Clark, S. (2010). Mathematical foundations for a compositional distributional model of meaning. Linguistic Analysis, 36(1), 345\u2013384.","journal-title":"Linguistic Analysis"},{"key":"5634_CR13","first-page":"2493","volume":"12","author":"R Collobert","year":"2011","unstructured":"Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., & Kuksa, P. (2011). Natural language processing (almost) from scratch. Journal of Machine Learning Research, 12, 2493\u20132537.","journal-title":"Journal of Machine Learning Research"},{"issue":"7","key":"5634_CR14","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1371\/journal.pone.0129031","volume":"10","author":"A Corral","year":"2015","unstructured":"Corral, A., Boleda, G., & i Cancho, R. E. (2015). Zipf\u2019s law for word frequencies: Word forms versus lemmas in long texts. PLoS One, 10(7), 1\u201323.","journal-title":"PLoS One"},{"key":"5634_CR15","unstructured":"Dagan, I., Pereira, F., & Lee, L. (1994). Similarity-based estimation of word cooccurrence probabilities. In: Proceedings of ACL."},{"key":"5634_CR16","unstructured":"Dinu, G., Pham, N.T., & Baroni, M. (2013). General estimation and evaluation of compositional distributional semantic models. In: Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality."},{"key":"5634_CR17","first-page":"2121","volume":"12","author":"J Duchi","year":"2011","unstructured":"Duchi, J., Hazan, E., & Singer, Y. (2011). Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research, 12, 2121\u20132159.","journal-title":"Journal of Machine Learning Research"},{"key":"5634_CR18","doi-asserted-by":"publisher","first-page":"285","DOI":"10.1080\/01638539809545029","volume":"15","author":"PW Foltz","year":"1998","unstructured":"Foltz, P. W., Kintsch, W., & Landauer, T. K. (1998). The measurement of textual coherence with latent semantic analysis. Discourse Process, 15, 285\u2013307.","journal-title":"Discourse Process"},{"issue":"1","key":"5634_CR19","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1162\/neco.1992.4.1.1","volume":"4","author":"S Geman","year":"1992","unstructured":"Geman, S., Bienenstock, E., & Doursat, R. (1992). Neural networks and the bias\/variance dilemma. Neural Computation, 4(1), 1\u201358.","journal-title":"Neural Computation"},{"issue":"4","key":"5634_CR20","first-page":"153","volume":"2","author":"G Gnecco","year":"2008","unstructured":"Gnecco, G., & Sanguineti, M. (2008). Approximation error bounds via rademachers complexity. Applied Mathematical Sciences, 2(4), 153\u2013176.","journal-title":"Applied Mathematical Sciences"},{"key":"5634_CR21","unstructured":"Grefenstette, E., & Sadrzadeh, M. (2011). Experimental support for a categorical compositional distributional model of meaning. In: Proceedings of EMNLP."},{"key":"5634_CR22","unstructured":"Guevara, E. (2010). A regression model of adjective-noun compositionality in distributional semantics. In: Proceedings of the Workshop on GEometrical Models of Natural Language Semantics."},{"issue":"1","key":"5634_CR23","first-page":"207","volume":"13","author":"MU Gutmann","year":"2012","unstructured":"Gutmann, M. U., & Hyv\u00e4rinen, A. (2012). Noise-contrastive estimation of unnormalized statistical models, with applications to natural image statistics. Journal of Machine Learning Research, 13(1), 207\u2013361.","journal-title":"Journal of Machine Learning Research"},{"key":"5634_CR24","unstructured":"Ha LQ, Sicilia-Garcia, E.I., Ming, J., & Smith, F.J. (2002). Extension of zipf\u2019s law to words and phrases. In: Proceedings of Coling."},{"issue":"2","key":"5634_CR25","doi-asserted-by":"publisher","first-page":"217","DOI":"10.1137\/090771806","volume":"53","author":"N Halko","year":"2011","unstructured":"Halko, N., Martinsson, P. G., & Tropp, J. A. (2011). Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions. SIAM Review, 53(2), 217\u2013288.","journal-title":"SIAM Review"},{"key":"5634_CR26","doi-asserted-by":"publisher","first-page":"146","DOI":"10.1080\/00437956.1954.11659520","volume":"10","author":"ZS Harris","year":"1954","unstructured":"Harris, Z. S. (1954). Distributional structure. Word, 10, 146\u2013162.","journal-title":"Word"},{"key":"5634_CR27","unstructured":"Hashimoto, K., Stenetorp, P., Miwa, M., & Tsuruoka, Y. (2014). Jointly learning word representations and composition functions using predicate-argument structures. In: Proceedings of EMNLP."},{"key":"5634_CR28","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1162\/tacl_a_00098","volume":"4","author":"T Hashimoto","year":"2016","unstructured":"Hashimoto, T., Alvarez-Melis, D., & Jaakkola, T. (2016). Word embeddings as metric recovery in semantic spaces. Transactions of the Association for Computational Linguistics, 4, 273\u2013286.","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"5634_CR29","unstructured":"Iyyer, M., Manjunatha, V., Boyd-Graber, J., & III, H.D. (2015). Deep unordered composition rivals syntactic methods for text classification. In: Proceedings of ACL."},{"key":"5634_CR30","doi-asserted-by":"crossref","unstructured":"Kobayashi, H. (2014), Perplexity on reduced corpora. In: Proceedings of ACL.","DOI":"10.3115\/v1\/P14-1075"},{"key":"5634_CR31","volume-title":"The Psychology of Learning and Motivation","author":"TK Landauer","year":"2002","unstructured":"Landauer, T. K. (2002). On the computational basis of learning and cognition: Arguments from LSA. In N. Ross (Ed.), The Psychology of Learning and Motivation (Vol. 41). Cambridge: Academic Press."},{"issue":"2","key":"5634_CR32","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1037\/0033-295X.104.2.211","volume":"104","author":"TK Landauer","year":"1997","unstructured":"Landauer, T. K., & Dumais, S. T. (1997). A solution to platos problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review, 104(2), 211.","journal-title":"Psychological Review"},{"key":"5634_CR33","unstructured":"Landauer, T.K., Laham, D., Rehder, B., & Schreiner, M.E. (1997). How well can passage meaning be derived without using word order? a comparison of latent semantic analysis and humans. In: Proceedings of Annual Conference of the Cognitive Science Society."},{"key":"5634_CR34","unstructured":"Lebret, R., & Collobert, R. (2014). Word embeddings through Hellinger PCA. In: Proceedings of EACL."},{"key":"5634_CR35","doi-asserted-by":"crossref","unstructured":"Levy, O., & Goldberg, Y. (2014a). Linguistic regularities in sparse and explicit word representations. In: Proceedings of CoNLL.","DOI":"10.3115\/v1\/W14-1618"},{"key":"5634_CR36","unstructured":"Levy, O., & Goldberg, Y. (2014b). Neural word embedding as implicit matrix factorization. In: Advances in Neural Information Processing Systems (NIPS) 27, 2177\u20132185."},{"key":"5634_CR37","doi-asserted-by":"crossref","unstructured":"Levy, O., Goldberg, Y., & Dagan, I. (2015). Improving distributional similarity with lessons learned from word embeddings. Transactions of the Association for Computational Linguistics, 3, 211\u2013225.","DOI":"10.1162\/tacl_a_00134"},{"key":"5634_CR38","unstructured":"Melamud, O., Goldberger, J., & Dagan, I. (2016). context2vec: Learning generic context embedding with bidirectional lstm. In: Proceedings of CoNLL."},{"key":"5634_CR39","unstructured":"Mikolov, T., Ilya, S., Chen, K., Corrado, G., & Dean, J. (2013a). Distributed representations of words and phrases and their compositionality. In NIPS\u201913 Proceedings of the 26th International Conference on Neural Information Processing Systems (pp. 3111\u20133119)."},{"key":"5634_CR40","unstructured":"Mikolov, T., Yih, Wen-tau, & Zweig, G. (2013b). Linguistic regularities in continuous space word representations. In: Proceedings of NAACL-HLT."},{"issue":"1","key":"5634_CR41","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/01690969108406936","volume":"6","author":"GA Miller","year":"1991","unstructured":"Miller, G. A., & Charles, W. G. (1991). Contextual correlates of semantic similarity. Language and Cognitive Processes, 6(1), 1\u201328.","journal-title":"Language and Cognitive Processes"},{"key":"5634_CR42","unstructured":"Mitchell, J., & Lapata, M. (2008). Vector-based models of semantic composition. In: Proceedings of ACL-HLT."},{"issue":"8","key":"5634_CR43","doi-asserted-by":"publisher","first-page":"1388","DOI":"10.1111\/j.1551-6709.2010.01106.x","volume":"34","author":"J Mitchell","year":"2010","unstructured":"Mitchell, J., & Lapata, M. (2010). Composition in distributional models of semantics. Cognitive Science, 34(8), 1388\u20131429.","journal-title":"Cognitive Science"},{"issue":"3","key":"5634_CR44","doi-asserted-by":"publisher","first-page":"567","DOI":"10.1016\/S0378-4371(01)00355-7","volume":"300","author":"MA Montemurro","year":"2001","unstructured":"Montemurro, M. A. (2001). Beyond the Zipf\u2013Mandelbrot law in quantitative linguistics. Physica A: Statistical Mechanics and its Applications, 300(3), 567\u2013578.","journal-title":"Physica A: Statistical Mechanics and its Applications"},{"key":"5634_CR45","unstructured":"Muraoka, M., Shimaoka, S., Yamamoto, K., Watanabe, Y., Okazaki, N., & Inui, K. (2014). Finding the best model among representative compositional models. In: Proceedings of PACLIC."},{"key":"5634_CR46","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1023\/A:1018966213079","volume":"10","author":"P Niyogi","year":"1999","unstructured":"Niyogi, P., & Girosi, F. (1999). Generalization bounds for function approximation from scattered noisy data. Advances in Computational Mathematics, 10, 51\u201380.","journal-title":"Advances in Computational Mathematics"},{"key":"5634_CR47","unstructured":"Paperno, D., Pham, N.T., & Baroni, M. (2014). A practical and linguistically-motivated approach to compositional distributional semantics. In: Proceedings of ACL."},{"key":"5634_CR48","unstructured":"Pennington, J., Socher, R., & Manning, C. (2014). Glove: Global vectors for word representation. In: Proceedings of EMNLP."},{"key":"5634_CR49","unstructured":"Pham, N.T., Kruszewski, G., Lazaridou, A., & Baroni, M. (2015). Jointly optimizing word representations for lexical and sentential tasks with the c-phrase model. In: Proceedings of ACL."},{"key":"5634_CR50","volume-title":"Combinatorial Stochastic Processes","author":"J Pitman","year":"2006","unstructured":"Pitman, J. (2006). Combinatorial Stochastic Processes. Berlin: Springer-Verlag."},{"key":"5634_CR51","doi-asserted-by":"publisher","first-page":"855","DOI":"10.1214\/aop\/1024404422","volume":"25","author":"J Pitman","year":"1997","unstructured":"Pitman, J., & Yor, M. (1997). The two-parameter Pisson-Dirichlet distribution derived from a stable subordinator. Annals of Probability, 25, 855\u2013900.","journal-title":"Annals of Probability"},{"key":"5634_CR52","unstructured":"Rothe, S., & Sch\u00fctze, H. (2015). Autoextend: Extending word embeddings to embeddings for synsets and lexemes. In: Proceedings of ACL-IJCNLP."},{"key":"5634_CR53","first-page":"801","volume":"24","author":"R Socher","year":"2011","unstructured":"Socher, R., Huang, E. H., Pennin, J., & Manning, C. D. (2011). Dynamic pooling and unfolding recursive autoencoders for paraphrase detection. Advances in NIPS, 24, 801\u2013809.","journal-title":"Advances in NIPS"},{"key":"5634_CR54","unstructured":"Socher, R., Huval, B., Manning, C.D., & Ng, A.Y. (2012). Semantic compositionality through recursive matrix-vector spaces. In: Proceedings of EMNLP."},{"key":"5634_CR55","unstructured":"Stratos, K., Collins, M., & Hsu, D. (2015). Model-based word embeddings from decompositions of count matrices. In: Proceedings of ACL-IJCNLP."},{"key":"5634_CR56","unstructured":"Takase, S., Okazaki, N., & Inui, K. (2016). Composing distributed representations of relational patterns. In: Proceedings of ACL."},{"key":"5634_CR57","unstructured":"Teh, Y.W. (2006). A hierarchical bayesian language model based on Pitman-Yor processes. In: Proceedings of ACL."},{"key":"5634_CR58","unstructured":"The BNC Consortium (2007) The british national corpus, version 3 (bnc xml edition). Distributed by Oxford University Computing Services, \n                    http:\/\/www.natcorp.ox.ac.uk\/"},{"key":"5634_CR59","unstructured":"Tian, R., Miyao, Y., & Matsuzaki, T. (2014). Logical inference on dependency-based compositional semantics. In: Proceedings of ACL."},{"key":"5634_CR60","unstructured":"Tian, R., Okazaki, N., & Inui, K. (2016). Learning semantically and additively compositional distributional representations. In: Proceedings of ACL."},{"key":"5634_CR61","unstructured":"Turian, J., Ratinov, L.A., & Bengio, Y. (2010). Word representations: A simple and general method for semi-supervised learning. In: Proceedings of ACL."},{"key":"5634_CR62","unstructured":"Turney, P.D. (2001). Mining the web for synonyms: PMI-IR versus LSA on TOEFL. In: Proceedings of EMCL."},{"issue":"1","key":"5634_CR63","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1613\/jair.2934","volume":"37","author":"PD Turney","year":"2010","unstructured":"Turney, P. D., & Pantel, P. (2010). From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research, 37(1), 141\u2013188.","journal-title":"Journal of Artificial Intelligence Research"},{"key":"5634_CR64","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-2440-0","volume-title":"The Nature of Statistical Learning Theory","author":"VN Vapnik","year":"1995","unstructured":"Vapnik, V. N. (1995). The Nature of Statistical Learning Theory. Berlin: Springer-Verlag."},{"key":"5634_CR65","unstructured":"Zanzotto, F.M., Korkontzelos, I., Fallucchi, F., & Manandhar, S. (2010). Estimating linear models for compositional distributional semantics. In: Proceedings of Coling."},{"key":"5634_CR66","volume-title":"The Psychobiology of Language: An Introduction to Dynamic Philology","author":"GK Zipf","year":"1935","unstructured":"Zipf, G. K. (1935). The Psychobiology of Language: An Introduction to Dynamic Philology. Cambridge: M.I.T. Press."}],"container-title":["Machine Learning"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10994-017-5634-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-017-5634-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-017-5634-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,5,17]],"date-time":"2020-05-17T08:06:27Z","timestamp":1589702787000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10994-017-5634-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,4,3]]},"references-count":66,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2017,7]]}},"alternative-id":["5634"],"URL":"https:\/\/doi.org\/10.1007\/s10994-017-5634-8","relation":{},"ISSN":["0885-6125","1573-0565"],"issn-type":[{"type":"print","value":"0885-6125"},{"type":"electronic","value":"1573-0565"}],"subject":[],"published":{"date-parts":[[2017,4,3]]},"assertion":[{"value":"16 September 2016","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 March 2017","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 April 2017","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}