{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:22:02Z","timestamp":1750220522753,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":22,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,1,2]],"date-time":"2021-01-02T00:00:00Z","timestamp":1609545600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,1,2]]},"DOI":"10.1145\/3430984.3430994","type":"proceedings-article","created":{"date-parts":[[2020,12,28]],"date-time":"2020-12-28T05:34:44Z","timestamp":1609133684000},"page":"272-280","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Anticipatory Decisions in Retail E-Commerce Warehouses using Reinforcement Learning"],"prefix":"10.1145","author":[{"given":"Omkar","family":"Shelke","sequence":"first","affiliation":[{"name":"TCS Research"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vinita","family":"Baniwal","sequence":"additional","affiliation":[{"name":"TCS Research"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Harshad","family":"Khadilkar","sequence":"additional","affiliation":[{"name":"TCS Research"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2021,1,2]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-36718-3_39"},{"key":"e_1_3_2_1_2_1","volume-title":"An Imitation Learning Approach for Computing Anticipatory Picking Decisions in Retail Distribution Centrese. In 2019 American Control Conference (ACC).","author":"Baniwal Vinita","year":"2019","unstructured":"Vinita Baniwal , Chandrai Kayal , Dheeraj Shah , Padmakumar Ma , and Harshad Khadilkar . 2019 . An Imitation Learning Approach for Computing Anticipatory Picking Decisions in Retail Distribution Centrese. In 2019 American Control Conference (ACC). Vinita Baniwal, Chandrai Kayal, Dheeraj Shah, Padmakumar Ma, and Harshad Khadilkar. 2019. An Imitation Learning Approach for Computing Anticipatory Picking Decisions in Retail Distribution Centrese. In 2019 American Control Conference (ACC)."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ejor.2018.03.026"},{"key":"e_1_3_2_1_4_1","volume-title":"Quantifying the bullwhip effect in a simple supply chain: The impact of forecasting, lead times, and information. Management science 46, 3","author":"Chen Frank","year":"2000","unstructured":"Frank Chen , Zvi Drezner , Jennifer\u00a0 K Ryan , and David Simchi-Levi . 2000. Quantifying the bullwhip effect in a simple supply chain: The impact of forecasting, lead times, and information. Management science 46, 3 ( 2000 ), 436\u2013443. Frank Chen, Zvi Drezner, Jennifer\u00a0K Ryan, and David Simchi-Levi. 2000. Quantifying the bullwhip effect in a simple supply chain: The impact of forecasting, lead times, and information. Management science 46, 3 (2000), 436\u2013443."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1287\/msom.2015.0561"},{"key":"e_1_3_2_1_6_1","unstructured":"Scott Fujimoto Herke van Hoof and David Meger. 2018. Addressing Function Approximation Error in Actor-Critic Methods. arxiv:1802.09477\u00a0[cs.AI]  Scott Fujimoto Herke van Hoof and David Meger. 2018. Addressing Function Approximation Error in Actor-Critic Methods. arxiv:1802.09477\u00a0[cs.AI]"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0925-5273(00)00156-0"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1287\/mnsc.1040.0308"},{"key":"e_1_3_2_1_9_1","unstructured":"Tuomas Haarnoja Aurick Zhou Pieter Abbeel and Sergey Levine. 2018. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. arXiv preprint arXiv:1801.01290(2018).  Tuomas Haarnoja Aurick Zhou Pieter Abbeel and Sergey Levine. 2018. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. arXiv preprint arXiv:1801.01290(2018)."},{"key":"e_1_3_2_1_10_1","unstructured":"Kaggle. Retrieved 08-2018. Instacart Market Basket Analysis Data. https:\/\/www.kaggle.com\/c\/instacart-market-basket-analysis\/data.  Kaggle. Retrieved 08-2018. Instacart Market Basket Analysis Data. https:\/\/www.kaggle.com\/c\/instacart-market-basket-analysis\/data."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00170-004-2069-8"},{"key":"e_1_3_2_1_12_1","volume-title":"Management science 33, 1","author":"Krajewski J","year":"1987","unstructured":"Lee\u00a0 J Krajewski , Barry\u00a0 E King , Larry\u00a0 P Ritzman , and Danny\u00a0 S Wong . 1987. Kanban, MRP , and shaping the manufacturing environment. Management science 33, 1 ( 1987 ), 39\u201357. Lee\u00a0J Krajewski, Barry\u00a0E King, Larry\u00a0P Ritzman, and Danny\u00a0S Wong. 1987. Kanban, MRP, and shaping the manufacturing environment. Management science 33, 1 (1987), 39\u201357."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.promfg.2015.07.316"},{"key":"e_1_3_2_1_14_1","unstructured":"Timothy\u00a0P. Lillicrap Jonathan\u00a0J. Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. CoRR abs\/1509.02971(2015).  Timothy\u00a0P. Lillicrap Jonathan\u00a0J. Hunt Alexander Pritzel Nicolas Heess Tom Erez Yuval Tassa David Silver and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. CoRR abs\/1509.02971(2015)."},{"key":"e_1_3_2_1_15_1","volume-title":"Adaptive Learning Agents Workshop at AAMAS.","author":"Meisheri Hardik","year":"2020","unstructured":"Hardik Meisheri , Vinita Baniwal , Nazneen\u00a0 N Sultana , Harshad Khadilkar , and Balaraman Ravindran . 2020 . Using Reinforcement Learning for a Large Variable-Dimensional Inventory Management Problem . In Adaptive Learning Agents Workshop at AAMAS. Hardik Meisheri, Vinita Baniwal, Nazneen\u00a0N Sultana, Harshad Khadilkar, and Balaraman Ravindran. 2020. Using Reinforcement Learning for a Large Variable-Dimensional Inventory Management Problem. In Adaptive Learning Agents Workshop at AAMAS."},{"key":"e_1_3_2_1_16_1","volume-title":"Human-level control through deep RL. Nature 518, 7540","author":"Mnih Volodymyr","year":"2015","unstructured":"Volodymyr Mnih , Koray Kavukcuoglu , David Silver , Andrei\u00a0 A Rusu , Joel Veness , Marc\u00a0 G Bellemare , Alex Graves , Martin Riedmiller , Andreas\u00a0 K Fidjeland , Georg Ostrovski , 2015. Human-level control through deep RL. Nature 518, 7540 ( 2015 ), 529. Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei\u00a0A Rusu, Joel Veness, Marc\u00a0G Bellemare, Alex Graves, Martin Riedmiller, Andreas\u00a0K Fidjeland, Georg Ostrovski, 2015. Human-level control through deep RL. Nature 518, 7540 (2015), 529."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ejor.2005.11.018"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/11871842_74"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cor.2004.07.014"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1080\/00207549008942761"},{"key":"e_1_3_2_1_21_1","unstructured":"J Spiegel M McKenna G Lakshman and P Nordstrom. 2012. Method and system for anticipatory package shipping. US Patent 8271398.  J Spiegel M McKenna G Lakshman and P Nordstrom. 2012. Method and system for anticipatory package shipping. US Patent 8271398."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1080\/00207547708943149"}],"event":{"name":"CODS COMAD 2021: 8th ACM IKDD CODS and 26th COMAD","acronym":"CODS COMAD 2021","location":"Bangalore India"},"container-title":["Proceedings of the 3rd ACM India Joint International Conference on Data Science &amp; Management of Data (8th ACM IKDD CODS &amp; 26th COMAD)"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3430984.3430994","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3430984.3430994","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:24:43Z","timestamp":1750195483000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3430984.3430994"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,2]]},"references-count":22,"alternative-id":["10.1145\/3430984.3430994","10.1145\/3430984"],"URL":"https:\/\/doi.org\/10.1145\/3430984.3430994","relation":{},"subject":[],"published":{"date-parts":[[2021,1,2]]},"assertion":[{"value":"2021-01-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}