{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T03:13:06Z","timestamp":1772507586282,"version":"3.50.1"},"reference-count":79,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2024,5,22]],"date-time":"2024-05-22T00:00:00Z","timestamp":1716336000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,5,22]],"date-time":"2024-05-22T00:00:00Z","timestamp":1716336000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["No. 62007014"],"award-info":[{"award-number":["No. 62007014"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100017630","name":"Humanities and Social Sciences Youth Foundation, Ministry of Education of the People\u2019s Republic of China","doi-asserted-by":"publisher","award":["No. 20YJC880024"],"award-info":[{"award-number":["No. 20YJC880024"]}],"id":[{"id":"10.13039\/501100017630","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2024,8]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In recent years, there has been a significant increase in the design of neural network models for solving math word problems (MWPs). These neural solvers have been designed with various architectures and evaluated on diverse datasets, posing challenges in fair and effective performance evaluation. This paper presents a comparative study of representative neural solvers, aiming to elucidate their technical features and performance variations in solving different types of MWPs. Firstly, an in-depth technical analysis is conducted from the initial deep neural solver DNS to the state-of-the-art GPT-4. To enhance the technical analysis, a unified framework is introduced, which comprises highly reusable modules decoupled from existing MWP solvers. Subsequently, a testbed is established to conveniently reproduce existing solvers and develop new solvers by combing these reusable modules, and finely regrouped datasets are provided to facilitate the comparative evaluation of the designed solvers. Then, comprehensive testing is conducted and detailed results for eight representative MWP solvers on five finely regrouped datasets are reported. The comparative analysis yields several key findings: (1) Pre-trained language model-based solvers demonstrate significant accuracy advantages across nearly all datasets, although they suffer from limitations in math equation calculation. (2) Models integrated with tree decoders exhibit strong performance in generating complex math equations. (3) Identifying and appropriately representing implicit knowledge hidden in problem texts is crucial for improving the accuracy of math equation generation. Finally, the paper also discusses the major technical challenges and potential research directions in this field. The insights gained from this analysis offer valuable guidance for future research, model development, and performance optimization in the field of math word problem solving.<\/jats:p>","DOI":"10.1007\/s40747-024-01454-8","type":"journal-article","created":{"date-parts":[[2024,5,22]],"date-time":"2024-05-22T03:38:20Z","timestamp":1716349100000},"page":"5805-5830","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Comparative study of typical neural solvers in solving math word problems"],"prefix":"10.1007","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2088-8193","authenticated-orcid":false,"given":"Bin","family":"He","sequence":"first","affiliation":[]},{"given":"Xinguo","family":"Yu","sequence":"additional","affiliation":[]},{"given":"Litian","family":"Huang","sequence":"additional","affiliation":[]},{"given":"Hao","family":"Meng","sequence":"additional","affiliation":[]},{"given":"Guanghua","family":"Liang","sequence":"additional","affiliation":[]},{"given":"Shengnan","family":"Chen","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,5,22]]},"reference":[{"issue":"9","key":"1454_CR1","doi-asserted-by":"publisher","first-page":"2287","DOI":"10.1109\/TPAMI.2019.2914054","volume":"42","author":"D Zhang","year":"2019","unstructured":"Zhang D, Wang L, Zhang L, Dai BT, Shen HT (2019) The gap of semantic parsing: a survey on automatic math word problem solvers. IEEE Trans Pattern Anal Mach Intell 42(9):2287\u20132305","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1454_CR2","doi-asserted-by":"crossref","unstructured":"Wang Y, Liu X, Shi S (2017) Deep neural solver for math word problems. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 845\u2013854. Association for Computational Linguistics, Copenhagen, Denmark","DOI":"10.18653\/v1\/D17-1088"},{"issue":"10","key":"1454_CR3","doi-asserted-by":"publisher","first-page":"2222","DOI":"10.1109\/TNNLS.2016.2582924","volume":"28","author":"K Greff","year":"2017","unstructured":"Greff K, Srivastava RK, Koutn\u00edk J, Steunebrink BR, Schmidhuber J (2017) LSTM: a search space odyssey. IEEE Trans Neural Netw Learn Syst 28(10):2222\u20132232","journal-title":"IEEE Trans Neural Netw Learn Syst"},{"key":"1454_CR4","doi-asserted-by":"crossref","unstructured":"Wang L, Wang Y, Cai D, Zhang D, Liu X (2018) Translating a math word problem to a expression tree. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 1064\u20131069. Association for Computational Linguistics, Brussels, Belgium","DOI":"10.18653\/v1\/D18-1132"},{"key":"1454_CR5","doi-asserted-by":"crossref","unstructured":"Xie Z, Sun S (2019) A goal-driven tree-structured neural model for math word problems. In: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, pp. 5299\u20135305. International Joint Conferences on Artificial Intelligence Organization, Macao, China","DOI":"10.24963\/ijcai.2019\/736"},{"key":"1454_CR6","doi-asserted-by":"crossref","unstructured":"Zhang J, Wang L, Lee RKW, Bin Y, Wang Y, Shao J, Lim EP (2020) Graph-to-tree learning for solving math word problems. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 3928\u20133937. Association for Computational Linguistics, Seattle, USA","DOI":"10.18653\/v1\/2020.acl-main.362"},{"key":"1454_CR7","doi-asserted-by":"crossref","unstructured":"Wu Q, Zhang Q, Wei Z (2021) An edge-enhanced hierarchical graph-to-tree network for math word problem solving. In: Findings of the Association for Computational Linguistics: EMNLP 2021, pp. 1473\u20131482. Association for Computational Linguistics, Punta Cana, Dominican Republic","DOI":"10.18653\/v1\/2021.findings-emnlp.127"},{"key":"1454_CR8","doi-asserted-by":"crossref","unstructured":"Yang Z, Qin J, Chen J, Lin L, Liang X (2022) LogicSolver: Towards interpretable math word problem solving with logical prompt-enhanced learning. In: Findings of the Association for Computational Linguistics: EMNLP 2022, pp. 1\u201313. Association for Computational Linguistics, Abu Dhabi, United Arab Emirates","DOI":"10.18653\/v1\/2022.findings-emnlp.1"},{"key":"1454_CR9","doi-asserted-by":"crossref","unstructured":"Jie Z, Li J, Lu W (2022) Learning to reason deductively: Math word problem solving as complex relation extraction. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, pp. 5944\u20135955. Association for Computational Linguistics, Dublin, Ireland","DOI":"10.18653\/v1\/2022.acl-long.410"},{"key":"1454_CR10","doi-asserted-by":"crossref","unstructured":"Lan Y, Wang L, Zhang Q, Lan Y, Dai BT, Wang Y, Zhang D, Lim EP (2022) Mwptoolkit: An open-source framework for deep learning-based math word problem solvers. Proceedings of the AAAI Conference on Artificial Intelligence 36:13188\u201313190","DOI":"10.1609\/aaai.v36i11.21723"},{"key":"1454_CR11","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS\u201917, pp. 6000\u20136010. Curran Associates Inc., Red Hook, NY, USA"},{"key":"1454_CR12","doi-asserted-by":"publisher","DOI":"10.1016\/j.knosys.2020.106548","volume":"212","author":"JCW Lin","year":"2021","unstructured":"Lin JCW, Shao Y, Djenouri Y, Yun U (2021) ASRNN: a recurrent neural network with an attention model for sequence labeling. Knowl-Based Syst 212:106548","journal-title":"Knowl-Based Syst"},{"key":"1454_CR13","doi-asserted-by":"crossref","unstructured":"Chiang TR, Chen YN (2019) Semantically-aligned equation generation for solving and reasoning math word problems. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologie, pp. 2656\u20132668. Association for Computational Linguistics, Minneapolis, Minnesota","DOI":"10.18653\/v1\/N19-1272"},{"key":"1454_CR14","doi-asserted-by":"crossref","unstructured":"Hong Y, Li Q, Ciao D, Haung S, Zhu SC (2021) Learning by fixing: Solving math word problems with weak supervision. In: Proceedings of the AAAI Conference on Artificial Intelligence, 35:4959\u20134967","DOI":"10.1609\/aaai.v35i6.16629"},{"key":"1454_CR15","doi-asserted-by":"crossref","unstructured":"Hong Y, Li Q, Gong R, Ciao D, Huang S, Zhu SC (2021) SMART: a situation model for algebra story problems via attributed grammar. In: Proceedings of the 2021 AAAI Conference on Artificial Intelligence, pp. 13009\u201313017. Vancouver, Canada","DOI":"10.1609\/aaai.v35i14.17538"},{"key":"1454_CR16","doi-asserted-by":"crossref","unstructured":"Roy S, Roth D (2017) Unit dependency graph and its application to arithmetic word problem solving. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp. 3082\u20133088. San Francisco, USA","DOI":"10.1609\/aaai.v31i1.10959"},{"key":"1454_CR17","doi-asserted-by":"crossref","unstructured":"Li S, Wu L, Feng S, Xu F, Xu F, Zhong S (2020) Graph-to-tree neural networks for learning structured input-output translation with applications to semantic parsing and math word problem. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 2841\u20132852. Association for Computational Linguistics, Punta Cana, Dominican Republic","DOI":"10.18653\/v1\/2020.findings-emnlp.255"},{"issue":"8","key":"1454_CR18","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735\u20131780","journal-title":"Neural Comput"},{"key":"1454_CR19","unstructured":"Bahdanau D, Cho KH, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In: 3rd International Conference on Learning Representations. San Diego, California"},{"key":"1454_CR20","doi-asserted-by":"crossref","unstructured":"Cai D, Lam W (2020) Graph transformer for graph-to-sequence learning. Proceedings of the AAAI Conference on Artificial Intelligence 34:7464\u20137471","DOI":"10.1609\/aaai.v34i05.6243"},{"key":"1454_CR21","unstructured":"Hamilton WL, Ying R, Leskovec J (2017) Inductive representation learning on large graphs. In: Proceedings of the 31st International Conference on Neural Information Processing Systems, pp. 1025\u20131035. Curran Associates Inc., Red Hook, NY, USA"},{"issue":"2","key":"1454_CR22","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1007\/s10462-009-9110-0","volume":"29","author":"A Mukherjee","year":"2008","unstructured":"Mukherjee A, Garain U (2008) A review of methods for automatic understanding of natural language mathematical problems. Artificial Intell Rev 29(2):93\u2013122","journal-title":"Artificial Intell Rev"},{"key":"1454_CR23","unstructured":"Meadows J, Freitas A (2022) A survey in mathematical language processing. arXiv:2205.15231 [cs]"},{"key":"1454_CR24","doi-asserted-by":"crossref","unstructured":"Lu P, Qiu L, Yu W, Welleck S, Chang KW (2023) A survey of deep learning for mathematical reasoning. arXiv:2212.10535 [cs]","DOI":"10.18653\/v1\/2023.acl-long.817"},{"key":"1454_CR25","doi-asserted-by":"crossref","unstructured":"Liu Q, Guan W, Li S, Kawahara D (2019) Tree-structured decoding for solving math word problems. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2370\u20132379. Association for Computational Linguistics, Hong Kong, China","DOI":"10.18653\/v1\/D19-1241"},{"key":"1454_CR26","doi-asserted-by":"crossref","unstructured":"Liang Z, Zhang J, Wang L, Qin W, Lan Y, Shao J, Zhang X (2022) MWP-BERT: Numeracy-augmented pre-training for math word problem solving. In: Findings of the Association for Computational Linguistics: NAACL 2022, pp. 997\u20131009. Association for Computational Linguistics, Seattle, United States","DOI":"10.18653\/v1\/2022.findings-naacl.74"},{"key":"1454_CR27","doi-asserted-by":"crossref","unstructured":"Zhang W, Shen Y, Ma Y, Cheng X, Tan Z, Nong Q, Lu W (2022) Multi-view reasoning: Consistent contrastive learning for math word problem. In: Findings of the Association for Computational Linguistics: EMNLP 2022, pp. 1103\u20131116. Association for Computational Linguistics, Abu Dhabi, United Arab Emirates","DOI":"10.18653\/v1\/2022.findings-emnlp.79"},{"key":"1454_CR28","unstructured":"Brown T, Mann B, Ryder N, Subbiah M, Kaplan JD, Dhariwal P, Neelakantan A, Shyam P, Sastry G, Askell A, Agarwal S, Herbert-Voss A, Krueger G, Henighan T, Child R, Ramesh A, Ziegler D, Wu J, Winter C, Hesse C, Chen M, Sigler E, Litwin M, Gray S, Chess B, Clark J, Berner C, McCandlish S, Radford A, Sutskever I, Amodei D (2020) Language models are few-shot learners. In: Advances in Neural Information Processing Systems, 33:1877\u20131901. Curran Associates, Inc"},{"key":"1454_CR29","unstructured":"Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I (2019) Language models are unsupervised multitask learners. Tech. rep., OpenAI. OpenAI blog"},{"key":"1454_CR30","doi-asserted-by":"publisher","unstructured":"Zhou A, Wang K, Lu Z, Shi W, Luo S, Qin Z, Lu S, Jia A, Song L, Zhan M, Li H (2023) Solving challenging math word problems using GPT-4 code interpreter with code-based self-verification. https:\/\/doi.org\/10.48550\/arXiv.2308.07921. arXiv:2308.07921 [cs]","DOI":"10.48550\/arXiv.2308.07921"},{"issue":"5","key":"1454_CR31","doi-asserted-by":"publisher","first-page":"565","DOI":"10.3758\/BF03207654","volume":"17","author":"CR Fletcher","year":"1985","unstructured":"Fletcher CR (1985) Understanding and solving arithmetic word problems: a computer simulation. Behav Res Methods Instruments Comput 17(5):565\u2013571","journal-title":"Behav Res Methods Instruments Comput"},{"key":"1454_CR32","doi-asserted-by":"crossref","unstructured":"Kushman N, Artzi Y, Zettlemoyer L, Barzilay R (2014) Learning to automatically solve algebra word problems. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 271\u2013281. Association for Computational Linguistics, Baltimore, Maryland","DOI":"10.3115\/v1\/P14-1026"},{"key":"1454_CR33","doi-asserted-by":"crossref","unstructured":"Shen Y, Jin C (2020) Solving math word problems with multi-encoders and multi-decoders. In: Proceedings of the 28th International Conference on Computational Linguistics, pp. 2924\u20132934. International Committee on Computational Linguistics, Barcelona, Spain","DOI":"10.18653\/v1\/2020.coling-main.262"},{"key":"1454_CR34","doi-asserted-by":"crossref","unstructured":"Liang CC, Hsu KY, Huang CT, Li CM, Miao SY, Su KY (2016) A tag-based statistical english math word problem solver with understanding, reasoning and explanation. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pp. 4254\u20134255. San Diego, USA","DOI":"10.24963\/ijcai.2017\/775"},{"key":"1454_CR35","unstructured":"Devlin J, Chang MW, Lee K, Toutanova K (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171\u20134186. Association for Computational Linguistics, Minneapolis, Minnesota"},{"key":"1454_CR36","doi-asserted-by":"crossref","unstructured":"Wang L, Zhang D, Zhang J, Xu X, Gao L, Dai BT, Shen HT (2019) Template-based math word problem solvers with recursive neural networks. In: Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, pp. 7144\u20137151. AAAI Press, Hawaii, USA","DOI":"10.1609\/aaai.v33i01.33017144"},{"key":"1454_CR37","doi-asserted-by":"crossref","unstructured":"Li J, Wang L, Zhang J, Wang Y, Dai BT, Zhang D (2019) Modeling intra-relation in math word problems with different functional multi-head attentions. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 6162\u20136167. Association for Computational Linguistics","DOI":"10.18653\/v1\/P19-1619"},{"key":"1454_CR38","doi-asserted-by":"crossref","unstructured":"Qin J, Lin L, Liang X, Zhang R, Lin L (2020) Semantically-aligned universal tree-structured solver for math word problems. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 3780\u20133789. Association for Computational Linguistics, Punta Cana, Dominican Republic","DOI":"10.18653\/v1\/2020.emnlp-main.309"},{"key":"1454_CR39","doi-asserted-by":"crossref","unstructured":"Wu Q, Zhang Q, Wei Z, Huang X (2021) Math word problem solving with explicit numerical values. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 5859\u20135869. Association for Computational Linguistics, Bangkok, Thailand","DOI":"10.18653\/v1\/2021.acl-long.455"},{"key":"1454_CR40","doi-asserted-by":"crossref","unstructured":"Yu W, Wen Y, Zheng F, Xiao N (2021) Improving math word problems with pre-trained knowledge and hierarchical reasoning. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp. 3384\u20133394. Association for Computational Linguistics, Online and Punta Cana, Dominican Republic","DOI":"10.18653\/v1\/2021.emnlp-main.272"},{"key":"1454_CR41","doi-asserted-by":"crossref","unstructured":"Li Z, Zhang W, Yan C, Zhou Q, Li C, Liu H, Cao Y (2022) Seeking patterns, not just memorizing procedures: Contrastive learning for solving math word problems. In: Findings of the Association for Computational Linguistics: ACL 2022, pp. 2486\u20132496. Association for Computational Linguistics, Dublin, Ireland","DOI":"10.18653\/v1\/2022.findings-acl.195"},{"key":"1454_CR42","doi-asserted-by":"crossref","unstructured":"Shen J, Yin Y, Li L, Shang L, Jiang X, Zhang M, Liu Q (2021) Generate & rank: A multi-task framework for math word problems. In: Findings of the Association for Computational Linguistics: EMNLP 2021, pp. 2269\u20132279. Association for Computational Linguistics, Punta Cana, Dominican Republic","DOI":"10.18653\/v1\/2021.findings-emnlp.195"},{"key":"1454_CR43","doi-asserted-by":"crossref","unstructured":"Shen Y, Liu Q, Mao Z, Cheng F, Kurohashi S (2022) Textual enhanced contrastive learning for solving math word problems. In: Findings of the Association for Computational Linguistics: EMNLP 2022, pp. 4297\u20134307. Association for Computational Linguistics, Abu Dhabi, United Arab Emirates","DOI":"10.18653\/v1\/2022.findings-emnlp.316"},{"key":"1454_CR44","unstructured":"Chowdhery A, Narang S, Devlin J, Bosma M, Mishra G, Roberts A, Barham P, Chung HW, Sutton C, Gehrmann S, Schuh P, Shi K, Tsvyashchenko S, Maynez J, Rao A, Barnes P, Tay Y, Shazeer N, Prabhakaran V, Reif E, Du N, Hutchinson B, Pope R, Bradbury J, Austin J, Isard M, Gur-Ari G, Yin P, Duke T, Levskaya A, Ghemawat S, Dev S, Michalewski H, Garcia X, Misra V, Robinson K, Fedus L, Zhou D, Ippolito D, Luan D, Lim H, Zoph B, Spiridonov A, Sepassi R, Dohan D, Agrawal S, Omernick M, Dai AM, Pillai TS, Pellat M, Lewkowycz A, Moreira E, Child R, Polozov O, Lee K, Zhou Z, Wang X, Saeta B, Diaz M, Firat O, Catasta M, Wei J, Meier-Hellstern K, Eck D, Dean J, Petrov S, Fiedel N (2022) PaLM: Scaling language modeling with pathways . arXiv:2204.02311 [cs]"},{"key":"1454_CR45","unstructured":"Lewkowycz A, Andreassen A, Dohan D, Dyer E, Michalewski H, Ramasesh V, Slone A, Anil C, Schlag I, Gutman-Solo T, Wu Y, Neyshabur B, Gur-Ari G, Misra V (2022) Solving Quantitative Reasoning Problems with Language Models. In: Advances in Neural Information Processing Systems, vol.\u00a035, pp. 3843\u20133857. Curran Associates, Inc"},{"key":"1454_CR46","doi-asserted-by":"publisher","unstructured":"Touvron H, Lavril T, Izacard G, Martinet X, Lachaux MA, Lacroix T, Rozi\u00e8re B, Goyal N, Hambro E, Azhar F, Rodriguez A, Joulin A, Grave E, Lample G (2023). LLaMA: Open and efficient foundation language models https:\/\/doi.org\/10.48550\/arXiv.2302.13971. arXiv:2302.13971 [cs]","DOI":"10.48550\/arXiv.2302.13971"},{"key":"1454_CR47","unstructured":"Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. In: NIPS 2014 Workshop on Deep Learning, December 2014. Montreal, Canada"},{"key":"1454_CR48","doi-asserted-by":"publisher","first-page":"115167","DOI":"10.1109\/ACCESS.2020.3002766","volume":"8","author":"A Ghazvini","year":"2020","unstructured":"Ghazvini A, Abdullah SNHS, Kamru Hasan M, Bin Kasim DZA (2020) Crime spatiotemporal prediction with fused objective function in time delay neural network. IEEE Access 8:115167\u2013115183","journal-title":"IEEE Access"},{"issue":"4","key":"1454_CR49","doi-asserted-by":"publisher","first-page":"2947","DOI":"10.1109\/TII.2020.3001493","volume":"17","author":"Y Djenouri","year":"2021","unstructured":"Djenouri Y, Srivastava G, Lin JCW (2021) Fast and accurate convolution neural network for detecting manufacturing data. IEEE Trans Ind Inform 17(4):2947\u20132955","journal-title":"IEEE Trans Ind Inform"},{"key":"1454_CR50","doi-asserted-by":"crossref","unstructured":"Wu Q, Zhang Q, Fu J, Huang X (2020) A knowledge-aware sequence-to-tree network for math word problem solving. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 7137\u20137146. Association for Computational Linguistics, Punta Cana, Dominican Republic","DOI":"10.18653\/v1\/2020.emnlp-main.579"},{"key":"1454_CR51","doi-asserted-by":"crossref","unstructured":"Gupta A, Kumar S, Kumar P S (2023) Solving age-word problems using domain ontology and bert. In: Proceedings of the 6th Joint International Conference on Data Science & Management of Data, pp. 95\u2013103. ACM, New York, NY, USA","DOI":"10.1145\/3570991.3571058"},{"key":"1454_CR52","doi-asserted-by":"crossref","unstructured":"Petroni F, Rockt\u00e4schel T, Riedel S, Lewis P, Bakhtin A, Wu Y, Miller A (2019) Language models as knowledge bases? In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2463\u20132473. Association for Computational Linguistics, Hong Kong, China","DOI":"10.18653\/v1\/D19-1250"},{"key":"1454_CR53","doi-asserted-by":"publisher","first-page":"423","DOI":"10.1162\/tacl_a_00324","volume":"8","author":"Z Jiang","year":"2020","unstructured":"Jiang Z, Xu FF, Araki J, Neubig G (2020) How can we know what language models know? Trans Assoc Comput Linguistics 8:423\u2013438 (Place: Cambridge, MA Publisher: MIT Press)","journal-title":"Trans Assoc Comput Linguistics"},{"key":"1454_CR54","doi-asserted-by":"crossref","unstructured":"Liu Z, Lin W, Shi Y, Zhao J (2021) A robustly optimized bert pre-training approach with post-training. In: Proceedings of the 20th Chinese National Conference on Computational Linguistics, pp. 1218\u20131227. Chinese Information Processing Society of China, Huhhot, China","DOI":"10.1007\/978-3-030-84186-7_31"},{"key":"1454_CR55","doi-asserted-by":"publisher","first-page":"3504","DOI":"10.1109\/TASLP.2021.3124365","volume":"29","author":"Y Cui","year":"2019","unstructured":"Cui Y, Che W, Liu T, Qin B, Yang Z, Wang S, Hu G (2019) Pre-training with whole word masking for chinese bert. IEEE\/ACM Trans Audio Speech Language Process 29:3504\u20133514","journal-title":"IEEE\/ACM Trans Audio Speech Language Process"},{"key":"1454_CR56","doi-asserted-by":"publisher","unstructured":"Chen J, Pan X, Yu D, Song K, Wang X, Yu D, Chen J (2023) Skills-in-context prompting: Unlocking compositionality in large language models. https:\/\/doi.org\/10.48550\/arXiv.2308.00304. arXiv:2308.00304 [cs]","DOI":"10.48550\/arXiv.2308.00304"},{"key":"1454_CR57","unstructured":"Wei J, Wang X, Schuurmans D, Bosma M, ichter b, Xia F, Chi E, Le QV, Zhou D (2022) Chain-of-thought prompting elicits reasoning in large language models. In: Advances in Neural Information Processing Systems, vol.\u00a035, pp. 24824\u201324837. Curran Associates, Inc"},{"key":"1454_CR58","doi-asserted-by":"crossref","unstructured":"Huang X, Ruan W, Huang W, Jin G, Dong Y, Wu C, Bensalem S, Mu R, Qi Y, Zhao X, Cai K, Zhang Y, Wu S, Xu P, Wu D, Freitas A, Mustafa MA (2023) A survey of safety and trustworthiness of large language models through the lens of verification and validation. http:\/\/arxiv.org\/abs\/2305.11391. arXiv:2305.11391 [cs]","DOI":"10.1007\/s10462-024-10824-0"},{"key":"1454_CR59","doi-asserted-by":"crossref","unstructured":"Dong L, Lapata M (2016) Language to logical form with neural attention. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 33\u201343. Association for Computational Linguistics, Berlin, Germany","DOI":"10.18653\/v1\/P16-1004"},{"key":"1454_CR60","doi-asserted-by":"crossref","unstructured":"Zhang J, Lee RKW, Lim EP, Qin W, Wang L, Shao J, Sun Q (2020) Teacher-student networks with multiple decoders for solving math word problem. In: Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, pp. 4011\u20134017. International Joint Conferences on Artificial Intelligence Organization, Yokohama, Japan","DOI":"10.24963\/ijcai.2020\/555"},{"key":"1454_CR61","unstructured":"Bobrow DG (1964) Natural language input for a computer problem solving system. Tech. rep., Massachusetts Institute of Technology, USA"},{"key":"1454_CR62","unstructured":"Bakman Y (2007) Robust understanding of word problems with extraneous information. arXiv General Mathematics. https:\/\/api.semanticscholar.org\/CorpusID:117981901"},{"key":"1454_CR63","doi-asserted-by":"publisher","first-page":"585","DOI":"10.1162\/tacl_a_00160","volume":"3","author":"R Koncel-Kedziorski","year":"2015","unstructured":"Koncel-Kedziorski R, Hajishirzi H, Sabharwal A, Etzioni O, Ang SD (2015) Parsing algebraic word problems into equations. Trans Assoc Comput Linguistics 3:585\u2013597 (Place: Cambridge, MA)","journal-title":"Trans Assoc Comput Linguistics"},{"key":"1454_CR64","doi-asserted-by":"crossref","unstructured":"Roy S, Upadhyay S, Roth D (2016) Equation parsing: Mapping sentences to grounded equations. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 1088\u20131097. Association for Computational Linguistics, Austin, Texas","DOI":"10.18653\/v1\/D16-1117"},{"key":"1454_CR65","doi-asserted-by":"crossref","unstructured":"Wang L, Zhang D, Gao L, Song J, Guo L, Shen HT (2018) MathDQN: Solving arithmetic word problems via deep reinforcement learning. In: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, pp. 5545\u20135552. AAAI Press, New Orleans, USA","DOI":"10.1609\/aaai.v32i1.11981"},{"key":"1454_CR66","doi-asserted-by":"crossref","unstructured":"Hosseini MJ, Hajishirzi H, Etzioni O, Kushman N (2014) Learning to solve arithmetic word problems with verb categorization. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 523\u2013533. Association for Computational Linguistics, Doha, Qatar","DOI":"10.3115\/v1\/D14-1058"},{"key":"1454_CR67","doi-asserted-by":"crossref","unstructured":"Shi S, Wang Y, Lin CY, Liu X, Rui Y (2015) Automatically solving number word problems by semantic parsing and reasoning. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1132\u20131142. Lisbon, Portugal","DOI":"10.18653\/v1\/D15-1135"},{"key":"1454_CR68","doi-asserted-by":"crossref","unstructured":"Liang CC, Hsu KY, Huang CT, Li CM, Miao SY, Su KY (2016) A tag-based English math word problem solver with understanding, reasoning and explanation. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, pp. 67\u201371. Association for Computational Linguistics, San Diego, California","DOI":"10.18653\/v1\/N16-3014"},{"key":"1454_CR69","doi-asserted-by":"crossref","unstructured":"Upadhyay S, Chang MW, Chang KW, Yih Wt (2016) Learning from explicit and implicit supervision jointly for algebra word problems. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 297\u2013306. Association for Computational Linguistics, Austin, Texas","DOI":"10.18653\/v1\/D16-1029"},{"key":"1454_CR70","doi-asserted-by":"crossref","unstructured":"Chen S, Zhou M, He B, Wang P, Wang Z (2022) A comparative analysis of math word problem solving on characterized datasets. In: In Proceedings of the 2022 International Conference on Intelligent Education and Intelligent Research. IEEE, Wuhan, China","DOI":"10.1109\/IEIR56323.2022.10050058"},{"key":"1454_CR71","doi-asserted-by":"crossref","unstructured":"He B, Chen S, Miao Z, Liang G, Pan K, Huang L (2022) Comparative analysis of problem representation learning in math word problem solving. In: In Proceedings of the 2022 International Conference on Intelligent Education and Intelligent Research. IEEE, Wuhan, China","DOI":"10.1109\/IEIR56323.2022.10050067"},{"key":"1454_CR72","doi-asserted-by":"crossref","unstructured":"Roy S, Roth D (2015) Solving general arithmetic word problems. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1743\u20131752. Association for Computational Linguistics, Lisbon, Portugal","DOI":"10.18653\/v1\/D15-1202"},{"key":"1454_CR73","doi-asserted-by":"crossref","unstructured":"Yenduri G, M R, G CS, Y S, Srivastava G, Maddikunta PKR, G DR, Jhaveri RH, B P, Wang W, Vasilakos AV, Gadekallu TR (2023) Generative pre-trained transformer: A comprehensive review on enabling technologies, potential applications, emerging challenges, and future directions. arXiv:2305.10435","DOI":"10.1109\/ACCESS.2024.3389497"},{"issue":"4","key":"1454_CR74","doi-asserted-by":"publisher","first-page":"2645","DOI":"10.1007\/s11063-020-10404-7","volume":"54","author":"H Zhang","year":"2021","unstructured":"Zhang H, Lu G, Zhan M, Zhang B (2021) Semi-supervised classification of graph convolutional networks with laplacian rank constraints. Neural Process Lett 54(4):2645\u20132656","journal-title":"Neural Process Lett"},{"key":"1454_CR75","doi-asserted-by":"crossref","unstructured":"Zhou L, Dai S, Chen L (2015) Learn to solve algebra word problems using quadratic programming. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 817\u2013822. Association for Computational Linguistics, Lisbon, Portugal","DOI":"10.18653\/v1\/D15-1096"},{"key":"1454_CR76","doi-asserted-by":"crossref","unstructured":"Koncel-Kedziorski R, Roy S, Amini A, Kushman N, Hajishirzi H (2016) MAWPS: A math word problem repository. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1152\u20131157. Association for Computational Linguistics, San Diego, California","DOI":"10.18653\/v1\/N16-1136"},{"key":"1454_CR77","doi-asserted-by":"crossref","unstructured":"Huang D, Shi S, Lin CY, Yin J, Ma WY (2016) How well do computers solve math word problems? Large-scale dataset construction and evaluation. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 887\u2013896. Association for Computational Linguistics, Berlin, Germany","DOI":"10.18653\/v1\/P16-1084"},{"key":"1454_CR78","doi-asserted-by":"crossref","unstructured":"Upadhyay S, Chang MW (2017) Annotating derivations: A new evaluation strategy and dataset for algebra word problems. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pp. 494\u2013504. Association for Computational Linguistics, Valencia, Spain","DOI":"10.18653\/v1\/E17-1047"},{"key":"1454_CR79","doi-asserted-by":"crossref","unstructured":"Patel A, Bhattamishra S, Goyal N (2021) Are NLP models really able to solve simple math word problems? In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 2080\u20132094. Association for Computational Linguistics, Bangkok, Thailand","DOI":"10.18653\/v1\/2021.naacl-main.168"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-024-01454-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-024-01454-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-024-01454-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,17]],"date-time":"2024-07-17T17:30:32Z","timestamp":1721237432000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-024-01454-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,22]]},"references-count":79,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["1454"],"URL":"https:\/\/doi.org\/10.1007\/s40747-024-01454-8","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,5,22]]},"assertion":[{"value":"27 March 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 April 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 May 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no Conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}