{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T16:21:49Z","timestamp":1759940509346,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":27,"publisher":"ACM","license":[{"start":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T00:00:00Z","timestamp":1691107200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2023,8,6]]},"DOI":"10.1145\/3580305.3599816","type":"proceedings-article","created":{"date-parts":[[2023,8,4]],"date-time":"2023-08-04T18:13:58Z","timestamp":1691172838000},"page":"4733-4742","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Entity-aware Multi-task Learning for Query Understanding at Walmart"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9870-4422","authenticated-orcid":false,"given":"Zhiyuan","family":"Peng","sequence":"first","affiliation":[{"name":"Santa Clara University, Santa Clara, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-1963-8415","authenticated-orcid":false,"given":"Vachik","family":"Dave","sequence":"additional","affiliation":[{"name":"Walmart Global Tech, Sunnyvale, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-7951-2720","authenticated-orcid":false,"given":"Nicole","family":"McNabb","sequence":"additional","affiliation":[{"name":"Walmart Global Tech, Sunnyvale, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-9080-5287","authenticated-orcid":false,"given":"Rahul","family":"Sharnagat","sequence":"additional","affiliation":[{"name":"Walmart Global Tech, Sunnyvale, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0345-9629","authenticated-orcid":false,"given":"Alessandro","family":"Magnani","sequence":"additional","affiliation":[{"name":"Walmart Global Tech, Sunnyvale, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6788-139X","authenticated-orcid":false,"given":"Ciya","family":"Liao","sequence":"additional","affiliation":[{"name":"Walmart Global Tech, Sunnyvale, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6572-4315","authenticated-orcid":false,"given":"Yi","family":"Fang","sequence":"additional","affiliation":[{"name":"Santa Clara University, Santa Clara, CA, USA"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-1813-9995","authenticated-orcid":false,"given":"Sravanthi","family":"Rajanala","sequence":"additional","affiliation":[{"name":"Walmart Global Tech, Sunnyvale, CA, USA"}]}],"member":"320","published-online":{"date-parts":[[2023,8,4]]},"reference":[{"key":"e_1_3_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273513"},{"key":"e_1_3_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigData50022.2020.9378495"},{"key":"e_1_3_2_2_3_1","volume-title":"Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 1759--1762","author":"Nobari Arash Dargahi","year":"2018","unstructured":"Arash Dargahi Nobari , Arian Askari , Faegheh Hasibi , and Mahmood Neshati . 2018 . Query understanding via entity attribute identification . In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 1759--1762 . Arash Dargahi Nobari, Arian Askari, Faegheh Hasibi, and Mahmood Neshati. 2018. Query understanding via entity attribute identification. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 1759--1762."},{"key":"e_1_3_2_2_4_1","volume-title":"Contextual BERT: Conditioning the language model using a global state. arXiv preprint arXiv:2010.15778","author":"Denk Timo I","year":"2020","unstructured":"Timo I Denk and Ana Peleteiro Ramallo . 2020. Contextual BERT: Conditioning the language model using a global state. arXiv preprint arXiv:2010.15778 ( 2020 ). Timo I Denk and Ana Peleteiro Ramallo. 2020. Contextual BERT: Conditioning the language model using a global state. arXiv preprint arXiv:2010.15778 (2020)."},{"key":"e_1_3_2_2_5_1","first-page":"1","article-title":"Switch transformers: Scaling to trillion parameter models with simple and efficient sparsity","volume":"23","author":"Fedus William","year":"2021","unstructured":"William Fedus , Barret Zoph , and Noam Shazeer . 2021 . Switch transformers: Scaling to trillion parameter models with simple and efficient sparsity . J. Mach. Learn. Res , Vol. 23 (2021), 1 -- 40 . William Fedus, Barret Zoph, and Noam Shazeer. 2021. Switch transformers: Scaling to trillion parameter models with simple and efficient sparsity. J. Mach. Learn. Res, Vol. 23 (2021), 1--40.","journal-title":"J. Mach. Learn. Res"},{"key":"e_1_3_2_2_6_1","volume-title":"Sparsely activated mixture-of-experts are robust multi-task learners. arXiv preprint arXiv:2204.07689","author":"Gupta Shashank","year":"2022","unstructured":"Shashank Gupta , Subhabrata Mukherjee , Krishan Subudhi , Eduardo Gonzalez , Damien Jose , Ahmed H Awadallah , and Jianfeng Gao . 2022. Sparsely activated mixture-of-experts are robust multi-task learners. arXiv preprint arXiv:2204.07689 ( 2022 ). Shashank Gupta, Subhabrata Mukherjee, Krishan Subudhi, Eduardo Gonzalez, Damien Jose, Ahmed H Awadallah, and Jianfeng Gao. 2022. Sparsely activated mixture-of-experts are robust multi-task learners. arXiv preprint arXiv:2204.07689 (2022)."},{"key":"e_1_3_2_2_7_1","volume-title":"Embracing change: Continual learning in deep neural networks. Trends in cognitive sciences","author":"Hadsell Raia","year":"2020","unstructured":"Raia Hadsell , Dushyant Rao , Andrei A Rusu , and Razvan Pascanu . 2020. Embracing change: Continual learning in deep neural networks. Trends in cognitive sciences , Vol. 24 , 12 ( 2020 ), 1028--1040. Raia Hadsell, Dushyant Rao, Andrei A Rusu, and Razvan Pascanu. 2020. Embracing change: Continual learning in deep neural networks. Trends in cognitive sciences, Vol. 24, 12 (2020), 1028--1040."},{"key":"e_1_3_2_2_8_1","volume-title":"Short Text Pre-training with Extended Token Classification for E-commerce Query Understanding. arXiv preprint arXiv:2210.03915","author":"Jiang Haoming","year":"2022","unstructured":"Haoming Jiang , Tianyu Cao , Zheng Li , Chen Luo , Xianfeng Tang , Qingyu Yin , Danqing Zhang , Rahul Goutam , and Bing Yin . 2022. Short Text Pre-training with Extended Token Classification for E-commerce Query Understanding. arXiv preprint arXiv:2210.03915 ( 2022 ). Haoming Jiang, Tianyu Cao, Zheng Li, Chen Luo, Xianfeng Tang, Qingyu Yin, Danqing Zhang, Rahul Goutam, and Bing Yin. 2022. Short Text Pre-training with Extended Token Classification for E-commerce Query Understanding. arXiv preprint arXiv:2210.03915 (2022)."},{"key":"e_1_3_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/TBDATA.2019.2921572"},{"key":"e_1_3_2_2_10_1","volume-title":"Proceedings of NAACL-HLT. 4171--4186","author":"Ming-Wei Chang Jacob Devlin","year":"2019","unstructured":"Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding . In Proceedings of NAACL-HLT. 4171--4186 . Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT. 4171--4186."},{"key":"e_1_3_2_2_11_1","volume-title":"Graph Enhanced BERT for Query Understanding. arXiv preprint arXiv:2204.06522","author":"Li Juanhui","year":"2022","unstructured":"Juanhui Li , Yao Ma , Wei Zeng , Suqi Cheng , Jiliang Tang , Shuaiqiang Wang , and Dawei Yin . 2022. Graph Enhanced BERT for Query Understanding. arXiv preprint arXiv:2204.06522 ( 2022 ). Juanhui Li, Yao Ma, Wei Zeng, Suqi Cheng, Jiliang Tang, Shuaiqiang Wang, and Dawei Yin. 2022. Graph Enhanced BERT for Query Understanding. arXiv preprint arXiv:2204.06522 (2022)."},{"key":"e_1_3_2_2_12_1","first-page":"1","article-title":"Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing","volume":"55","author":"Liu Pengfei","year":"2023","unstructured":"Pengfei Liu , Weizhe Yuan , Jinlan Fu , Zhengbao Jiang , Hiroaki Hayashi , and Graham Neubig . 2023 . Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing . Comput. Surveys , Vol. 55 , 9 (2023), 1 -- 35 . Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, and Graham Neubig. 2023. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. Comput. Surveys, Vol. 55, 9 (2023), 1--35.","journal-title":"Comput. Surveys"},{"key":"e_1_3_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1441"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3534678.3539055"},{"key":"e_1_3_2_2_15_1","unstructured":"Hiroki Nakayama. 2018. seqeval: A Python framework for sequence labeling evaluation. https:\/\/github.com\/chakki-works\/seqeval Software available from https:\/\/github.com\/chakki-works\/seqeval.  Hiroki Nakayama. 2018. seqeval: A Python framework for sequence labeling evaluation. https:\/\/github.com\/chakki-works\/seqeval Software available from https:\/\/github.com\/chakki-works\/seqeval."},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3471158.3472246"},{"key":"e_1_3_2_2_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/3219819.3219870"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D19-1410"},{"key":"e_1_3_2_2_19_1","volume-title":"Representing Text Chunks. In Ninth Conference of the European Chapter of the Association for Computational Linguistics. 173--179","author":"Kim Sang Erik Tjong","year":"1999","unstructured":"Erik Tjong Kim Sang and Jorn Veenstra . 1999 . Representing Text Chunks. In Ninth Conference of the European Chapter of the Association for Computational Linguistics. 173--179 . Erik Tjong Kim Sang and Jorn Veenstra. 1999. Representing Text Chunks. In Ninth Conference of the European Chapter of the Association for Computational Linguistics. 173--179."},{"key":"e_1_3_2_2_20_1","volume-title":"Outrageously large neural networks: The sparsely-gated mixture-of-experts layer. arXiv preprint arXiv:1701.06538","author":"Shazeer Noam","year":"2017","unstructured":"Noam Shazeer , Azalia Mirhoseini , Krzysztof Maziarz , Andy Davis , Quoc Le , Geoffrey Hinton , and Jeff Dean . 2017. Outrageously large neural networks: The sparsely-gated mixture-of-experts layer. arXiv preprint arXiv:1701.06538 ( 2017 ). Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, and Jeff Dean. 2017. Outrageously large neural networks: The sparsely-gated mixture-of-experts layer. arXiv preprint arXiv:1701.06538 (2017)."},{"key":"e_1_3_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3383313.3412236"},{"key":"e_1_3_2_2_22_1","volume-title":"Attention is all you need. Advances in neural information processing systems","author":"Vaswani Ashish","year":"2017","unstructured":"Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N Gomez , \u0141ukasz Kaiser , and Illia Polosukhin . 2017. Attention is all you need. Advances in neural information processing systems , Vol. 30 ( 2017 ). Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, \u0141ukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems, Vol. 30 (2017)."},{"key":"e_1_3_2_2_23_1","volume-title":"Improving named entity recognition by external context retrieving and cooperative learning. arXiv preprint arXiv:2105.03654","author":"Wang Xinyu","year":"2021","unstructured":"Xinyu Wang , Yong Jiang , Nguyen Bach , Tao Wang , Zhongqiang Huang , Fei Huang , and Kewei Tu. 2021. Improving named entity recognition by external context retrieving and cooperative learning. arXiv preprint arXiv:2105.03654 ( 2021 ). Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang, and Kewei Tu. 2021. Improving named entity recognition by external context retrieving and cooperative learning. arXiv preprint arXiv:2105.03654 (2021)."},{"key":"e_1_3_2_2_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3485447.3511977"},{"key":"e_1_3_2_2_25_1","volume-title":"Unsupervised data augmentation for consistency training. Advances in neural information processing systems","author":"Xie Qizhe","year":"2020","unstructured":"Qizhe Xie , Zihang Dai , Eduard Hovy , Thang Luong , and Quoc Le. 2020. Unsupervised data augmentation for consistency training. Advances in neural information processing systems , Vol. 33 ( 2020 ), 6256--6268. Qizhe Xie, Zihang Dai, Eduard Hovy, Thang Luong, and Quoc Le. 2020. Unsupervised data augmentation for consistency training. Advances in neural information processing systems, Vol. 33 (2020), 6256--6268."},{"key":"e_1_3_2_2_26_1","volume-title":"SIGIR 2022 Workshop on eCommerce. https:\/\/www.amazon.science\/publications\/advancing-query-rewriting-in-e-commerce-via-shopping-intent-learning","author":"Zhang Mengxiao","year":"2022","unstructured":"Mengxiao Zhang , Yongning Wu , Raif Rustamov , Hongyu Zhu , Haoran Shi , Yuqi Wu , Lei Tang , Zuohua Zhang , and Chu Wang . 2022 . Advancing query rewriting in e-commerce via shopping intent learning . In SIGIR 2022 Workshop on eCommerce. https:\/\/www.amazon.science\/publications\/advancing-query-rewriting-in-e-commerce-via-shopping-intent-learning Mengxiao Zhang, Yongning Wu, Raif Rustamov, Hongyu Zhu, Haoran Shi, Yuqi Wu, Lei Tang, Zuohua Zhang, and Chu Wang. 2022. Advancing query rewriting in e-commerce via shopping intent learning. In SIGIR 2022 Workshop on eCommerce. https:\/\/www.amazon.science\/publications\/advancing-query-rewriting-in-e-commerce-via-shopping-intent-learning"},{"key":"e_1_3_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2021.3070203"}],"event":{"name":"KDD '23: The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining","sponsor":["SIGMOD ACM Special Interest Group on Management of Data","SIGKDD ACM Special Interest Group on Knowledge Discovery in Data"],"location":"Long Beach CA USA","acronym":"KDD '23"},"container-title":["Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3580305.3599816","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3580305.3599816","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:49:23Z","timestamp":1750182563000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3580305.3599816"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,4]]},"references-count":27,"alternative-id":["10.1145\/3580305.3599816","10.1145\/3580305"],"URL":"https:\/\/doi.org\/10.1145\/3580305.3599816","relation":{},"subject":[],"published":{"date-parts":[[2023,8,4]]},"assertion":[{"value":"2023-08-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}