{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,1]],"date-time":"2026-06-01T12:44:19Z","timestamp":1780317859385,"version":"3.54.1"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,11,3]],"date-time":"2021-11-03T00:00:00Z","timestamp":1635897600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,11,3]]},"DOI":"10.1145\/3490354.3494413","type":"proceedings-article","created":{"date-parts":[[2022,5,5]],"date-time":"2022-05-05T03:50:06Z","timestamp":1651722606000},"page":"1-9","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":21,"title":["FinRL-podracer"],"prefix":"10.1145","author":[{"given":"Zechu","family":"Li","sequence":"first","affiliation":[{"name":"Columbia University"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiao-Yang","family":"Liu","sequence":"additional","affiliation":[{"name":"Columbia University"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jiahao","family":"Zheng","sequence":"additional","affiliation":[{"name":"Shenzhen Inst. of Advanced Tech."}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhaoran","family":"Wang","sequence":"additional","affiliation":[{"name":"Northwestern University"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Anwar","family":"Walid","sequence":"additional","affiliation":[{"name":"Amazon &amp; Columbia University"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jian","family":"Guo","sequence":"additional","affiliation":[{"name":"IDEA Research"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2022,5,4]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"What Is MLOps? In Beginning MLOps with MLFlow","author":"Alla Sridhar","unstructured":"Sridhar Alla and Suman Kalyan Adari . 2021. What Is MLOps? In Beginning MLOps with MLFlow . Springer , 79--124. Sridhar Alla and Suman Kalyan Adari. 2021. What Is MLOps? In Beginning MLOps with MLFlow. Springer, 79--124."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1080\/14697688.2019.1571683"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICSESS47205.2019.9040728"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/3383455.3422529"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2021.3061394"},{"key":"e_1_3_2_1_6_1","volume-title":"Jan. 07","author":"Cloud Google","year":"2020","unstructured":"Google Cloud . 2020. MLOps: Continuous delivery and automation pipelines in machine learning. https:\/\/cloud.google.com\/architecture\/mlops-continuous-delivery-and-automation-pipelines-in-machine-learning#mlops_level_0_manual_process. Google Cloud , Jan. 07 , 2020 . Google Cloud. 2020. MLOps: Continuous delivery and automation pipelines in machine learning. https:\/\/cloud.google.com\/architecture\/mlops-continuous-delivery-and-automation-pipelines-in-machine-learning#mlops_level_0_manual_process. Google Cloud, Jan. 07, 2020."},{"key":"e_1_3_2_1_7_1","first-page":"2021","article-title":"It's time for businesses to chart a course for reinforcement learning. https:\/\/www.mckinsey.com\/business-functions\/mckinsey-analytics\/our-insights\/its-time-for-businesses-to-chart-a-course-for-reinforcement-learning. McKinsey Analytics","volume":"01","author":"Corbo Jacomo","year":"2021","unstructured":"Jacomo Corbo , Oliver Flemin , and Nicolas Hohn . 2021 . It's time for businesses to chart a course for reinforcement learning. https:\/\/www.mckinsey.com\/business-functions\/mckinsey-analytics\/our-insights\/its-time-for-businesses-to-chart-a-course-for-reinforcement-learning. McKinsey Analytics , Apirl. 01 , 2021 . Jacomo Corbo, Oliver Flemin, and Nicolas Hohn. 2021. It's time for businesses to chart a course for reinforcement learning. https:\/\/www.mckinsey.com\/business-functions\/mckinsey-analytics\/our-insights\/its-time-for-businesses-to-chart-a-course-for-reinforcement-learning. McKinsey Analytics, Apirl. 01, 2021.","journal-title":"Apirl."},{"key":"e_1_3_2_1_8_1","volume-title":"Reinforcement learning in stock trading. ICCSAMA","author":"Dang Quang-Vinh","year":"2019","unstructured":"Quang-Vinh Dang . 2019. Reinforcement learning in stock trading. ICCSAMA ( 2019 ). Quang-Vinh Dang. 2019. Reinforcement learning in stock trading. ICCSAMA (2019)."},{"key":"e_1_3_2_1_9_1","unstructured":"DLR-RM. 2021. Stable-baseline 3. https:\/\/github.com\/DLR-RM\/stable-baselines3. DLR-RM. 2021. Stable-baseline 3. https:\/\/github.com\/DLR-RM\/stable-baselines3."},{"key":"e_1_3_2_1_10_1","volume-title":"Proceedings of the International Conference on Learning Representations (ICLR).","author":"Espeholt Lasse","year":"2020","unstructured":"Lasse Espeholt , Rapha\u00ebl Marinier , Piotr Stanczyk , Ke Wang , and Marcin Michalski . 2020 . SEED RL: scalable and efficient deep-RL with accelerated central inference . In Proceedings of the International Conference on Learning Representations (ICLR). Lasse Espeholt, Rapha\u00ebl Marinier, Piotr Stanczyk, Ke Wang, and Marcin Michalski. 2020. SEED RL: scalable and efficient deep-RL with accelerated central inference. In Proceedings of the International Conference on Learning Representations (ICLR)."},{"key":"e_1_3_2_1_11_1","volume-title":"Proceedings of the International Conference on Machine Learning (ICML).","author":"Espeholt Lasse","year":"2018","unstructured":"Lasse Espeholt , Hubert Soyer , Remi Munos , Karen Simonyan , Volodymir Mnih , Tom Ward , Yotam Doron , Vlad Firoiu , Tim Harley , Iain Dunning , Shane Legg , and Koray Kavukcuoglu . 2018 . IMPALA: scalable distributed deep-RL with importance weighted actor-learner architectures . In Proceedings of the International Conference on Machine Learning (ICML). Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Volodymir Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, and Koray Kavukcuoglu. 2018. IMPALA: scalable distributed deep-RL with importance weighted actor-learner architectures. In Proceedings of the International Conference on Machine Learning (ICML)."},{"key":"e_1_3_2_1_12_1","volume-title":"Podracer architectures for scalable reinforcement learning. ArXiv abs\/2104.06272","author":"Hessel Matteo","year":"2021","unstructured":"Matteo Hessel , Manuel Kroiss , Aidan Clark , Iurii Kemaev , John Quan , Thomas Keck , Fabio Viola , and Hado van Hasselt . 2021. Podracer architectures for scalable reinforcement learning. ArXiv abs\/2104.06272 ( 2021 ). Matteo Hessel, Manuel Kroiss, Aidan Clark, Iurii Kemaev, John Quan, Thomas Keck, Fabio Viola, and Hado van Hasselt. 2021. Podracer architectures for scalable reinforcement learning. ArXiv abs\/2104.06272 (2021)."},{"key":"e_1_3_2_1_13_1","unstructured":"Ashley Hill Antonin Raffin Maximilian Ernestus Adam Gleave Anssi Kanervisto Rene Traore Prafulla Dhariwal Christopher Hesse Oleg Klimov Alex Nichol Matthias Plappert Alec Radford John Schulman Szymon Sidor and Yuhuai Wu. 2018. Stable baselines. https:\/\/github.com\/hill-a\/stable-baselines. Ashley Hill Antonin Raffin Maximilian Ernestus Adam Gleave Anssi Kanervisto Rene Traore Prafulla Dhariwal Christopher Hesse Oleg Klimov Alex Nichol Matthias Plappert Alec Radford John Schulman Szymon Sidor and Yuhuai Wu. 2018. Stable baselines. https:\/\/github.com\/hill-a\/stable-baselines."},{"key":"e_1_3_2_1_14_1","volume-title":"A deep reinforcement learning framework for the financial portfolio management problem. ArXiv abs\/1706.10059","author":"Jiang Zhengyao","year":"2017","unstructured":"Zhengyao Jiang , Dixing Xu , and Jinjun Liang . 2017. A deep reinforcement learning framework for the financial portfolio management problem. ArXiv abs\/1706.10059 ( 2017 ). Zhengyao Jiang, Dixing Xu, and Jinjun Liang. 2017. A deep reinforcement learning framework for the financial portfolio management problem. ArXiv abs\/1706.10059 (2017)."},{"key":"e_1_3_2_1_15_1","volume-title":"Out of control: The rise of neo-biological civilization","author":"Kelly Kevin","unstructured":"Kevin Kelly . 1994. Out of control: The rise of neo-biological civilization . Addison-Wesley Longman Publishing Co., Inc. Kevin Kelly. 1994. Out of control: The rise of neo-biological civilization. Addison-Wesley Longman Publishing Co., Inc."},{"key":"e_1_3_2_1_16_1","volume-title":"Morgan Securities LLC","author":"Kolanovic Marko","year":"2017","unstructured":"Marko Kolanovic and Rajesh T. Krishnamachari . 2017. Big data and AI strategies: machine learning and alternative data approach to investing. https:\/\/www.cognitivefinance.ai\/single-post\/big-data-and-ai-strategies. J.P . Morgan Securities LLC , May . 18, 2017 . Marko Kolanovic and Rajesh T. Krishnamachari. 2017. Big data and AI strategies: machine learning and alternative data approach to investing. https:\/\/www.cognitivefinance.ai\/single-post\/big-data-and-ai-strategies. J.P. Morgan Securities LLC, May. 18, 2017."},{"key":"e_1_3_2_1_17_1","volume-title":"Econometrics: Mathematical Methods & Programming e Journal","author":"Kolm Petter N.","year":"2019","unstructured":"Petter N. Kolm and G. Ritter . 2019 . Modern perspectives on reinforcement learning in finance. Econometrics: Mathematical Methods & Programming e Journal (2019). Petter N. Kolm and G. Ritter. 2019. Modern perspectives on reinforcement learning in finance. Econometrics: Mathematical Methods & Programming e Journal (2019)."},{"key":"e_1_3_2_1_18_1","volume-title":"ICML Workshop on Applications and Infrastructure for Multi-Agent Learning","author":"Li Xinyi","year":"2019","unstructured":"Xinyi Li , Yinchuan Li , Yuancheng Zhan , and Xiao-Yang Liu . 2019 . Optimistic bull or pessimistic bear: Adaptive deep reinforcement learning for stock portfolio allocation . ICML Workshop on Applications and Infrastructure for Multi-Agent Learning (2019). Xinyi Li, Yinchuan Li, Yuancheng Zhan, and Xiao-Yang Liu. 2019. Optimistic bull or pessimistic bear: Adaptive deep reinforcement learning for stock portfolio allocation. ICML Workshop on Applications and Infrastructure for Multi-Agent Learning (2019)."},{"key":"e_1_3_2_1_19_1","volume-title":"Ray RLLib: a composable and scalable reinforcement learning library. ArXiv abs\/1712.09381","author":"Liang Eric","year":"2017","unstructured":"Eric Liang , Richard Liaw , Robert Nishihara , Philipp Moritz , Roy Fox , Joseph Gonzalez , Ken Goldberg , and Ion Stoica . 2017. Ray RLLib: a composable and scalable reinforcement learning library. ArXiv abs\/1712.09381 ( 2017 ). Eric Liang, Richard Liaw, Robert Nishihara, Philipp Moritz, Roy Fox, Joseph Gonzalez, Ken Goldberg, and Ion Stoica. 2017. Ray RLLib: a composable and scalable reinforcement learning library. ArXiv abs\/1712.09381 (2017)."},{"key":"e_1_3_2_1_20_1","volume-title":"Conf. on Robot Learning (CoRL).","author":"Liang Jacky","unstructured":"Jacky Liang , Viktor Makoviychuk , A. Handa , N. Chentanez , M. Macklin , and D. Fox . 2018. GPU-accelerated robotic simulation for distributed reinforcement learning . In Conf. on Robot Learning (CoRL). Jacky Liang, Viktor Makoviychuk, A. Handa, N. Chentanez, M. Macklin, and D. Fox. 2018. GPU-accelerated robotic simulation for distributed reinforcement learning. In Conf. on Robot Learning (CoRL)."},{"key":"e_1_3_2_1_21_1","volume-title":"Continuous control with deep reinforcement learning. CoRR abs\/1509.02971","author":"Lillicrap T.","year":"2016","unstructured":"T. Lillicrap , Jonathan J. Hunt , A. Pritzel , N. Heess , T. Erez , Yuval Tassa , D. Silver , and Daan Wierstra . 2016. Continuous control with deep reinforcement learning. CoRR abs\/1509.02971 ( 2016 ). T. Lillicrap, Jonathan J. Hunt, A. Pritzel, N. Heess, T. Erez, Yuval Tassa, D. Silver, and Daan Wierstra. 2016. Continuous control with deep reinforcement learning. CoRR abs\/1509.02971 (2016)."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCC.2018.111121612"},{"key":"e_1_3_2_1_23_1","unstructured":"Xiao-Yang Liu Zechu Li Zhaoran Wang and Jiahao Zheng. 2021. ElegantRL: A Scalable and Elastic Deep Reinforcement Learning Library. https:\/\/github.com\/AI4Finance-Foundation\/ElegantRL. Xiao-Yang Liu Zechu Li Zhaoran Wang and Jiahao Zheng. 2021. ElegantRL: A Scalable and Elastic Deep Reinforcement Learning Library. https:\/\/github.com\/AI4Finance-Foundation\/ElegantRL."},{"key":"e_1_3_2_1_24_1","volume-title":"Deep Reinforcement Learning Workshop at NeurIPS","author":"Liu Xiao-Yang","year":"2021","unstructured":"Xiao-Yang Liu , Zechu Li , Zhuoran Yang , Jiahao Zheng , Zhaoran Wang , Anwar Walid , Jiang Guo , and Michael Jordan . 2021 . ElegantRL-Podracer: Scalable and elastic library for cloud-native deep reinforcement learning . Deep Reinforcement Learning Workshop at NeurIPS (2021). Xiao-Yang Liu, Zechu Li, Zhuoran Yang, Jiahao Zheng, Zhaoran Wang, Anwar Walid, Jiang Guo, and Michael Jordan. 2021. ElegantRL-Podracer: Scalable and elastic library for cloud-native deep reinforcement learning. Deep Reinforcement Learning Workshop at NeurIPS (2021)."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.2139\/ssrn.3737859"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.2139\/ssrn.3955949"},{"key":"e_1_3_2_1_27_1","volume-title":"Isaac Gym: High performance GPU-based physics simulation for robot learning. arXiv preprint arXiv:2108.10470","author":"Makoviychuk Viktor","year":"2021","unstructured":"Viktor Makoviychuk , Lukasz Wawrzyniak , Yunrong Guo , Michelle Lu , Kier Storey , Miles Macklin , David Hoeller , Nikita Rudin , Arthur Allshire , Ankur Handa , 2021 . Isaac Gym: High performance GPU-based physics simulation for robot learning. arXiv preprint arXiv:2108.10470 (2021). Viktor Makoviychuk, Lukasz Wawrzyniak, Yunrong Guo, Michelle Lu, Kier Storey, Miles Macklin, David Hoeller, Nikita Rudin, Arthur Allshire, Ankur Handa, et al. 2021. Isaac Gym: High performance GPU-based physics simulation for robot learning. arXiv preprint arXiv:2108.10470 (2021)."},{"key":"e_1_3_2_1_28_1","volume-title":"NVIDIA","author":"Merritt Rick","year":"2020","unstructured":"Rick Merritt . 2020 . What Is MLOps? https:\/\/blogs.nvidia.com\/blog\/2020\/09\/03\/what-is-mlops\/ . NVIDIA , Sep. 03, 2020. Rick Merritt. 2020. What Is MLOps? https:\/\/blogs.nvidia.com\/blog\/2020\/09\/03\/what-is-mlops\/. NVIDIA, Sep. 03, 2020."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"crossref","unstructured":"Melanie Mitchell. 1996. An introduction to genetic algorithms. Melanie Mitchell. 1996. An introduction to genetic algorithms.","DOI":"10.7551\/mitpress\/3927.001.0001"},{"key":"e_1_3_2_1_30_1","volume-title":"Human-level control through deep reinforcement learning. Nature 518(7540)","author":"Mnih V","year":"2015","unstructured":"V Mnih , K Kavukcuoglu , D Silver , Andrei A. Rusu , Joel Veness , Marc G. Bellemare , Alex Graves , Martin Riedmiller , Andreas K. Fidjeland , Georg Ostrovski , Stig Petersen , Charles Beattie , Amir Sadik , Ioannis Antonoglou , Helen King , Dharshan Kumaran , Daan Wierstra , Shane Legg , and Demis Hassabis . 2015. Human-level control through deep reinforcement learning. Nature 518(7540) ( 2015 ), 529--533. V Mnih, K Kavukcuoglu, D Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. 2015. Human-level control through deep reinforcement learning. Nature 518(7540) (2015), 529--533."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/MC.2011.31"},{"key":"e_1_3_2_1_32_1","volume-title":"Proximal policy optimization algorithms. ArXiv abs\/1707.06347","author":"Schulman John","year":"2017","unstructured":"John Schulman , F. Wolski , Prafulla Dhariwal , Alec Radford , and Oleg Klimov . 2017. Proximal policy optimization algorithms. ArXiv abs\/1707.06347 ( 2017 ). John Schulman, F. Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal policy optimization algorithms. ArXiv abs\/1707.06347 (2017)."},{"key":"e_1_3_2_1_33_1","unstructured":"Wharton Research Data Service. 2015. Standard & poor's compustat. Data retrieved from Wharton Research Data Service. Wharton Research Data Service. 2015. Standard & poor's compustat. Data retrieved from Wharton Research Data Service."},{"key":"e_1_3_2_1_34_1","volume-title":"Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al.","author":"Silver David","year":"2016","unstructured":"David Silver , Aja Huang , Chris J Maddison , Arthur Guez , Laurent Sifre , George Van Den Driessche , Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016 . Mastering the game of Go with deep neural networks and tree search. nature 529, 7587 (2016), 484--489. David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. nature 529, 7587 (2016), 484--489."},{"key":"e_1_3_2_1_35_1","volume-title":"Mastering the game of Go with deep neural networks and tree search. Nature 529(7587)","author":"Silver David","year":"2016","unstructured":"David Silver , Aja Huang , Chris J. Maddison , Arthur Guez , Laurent Sifre , George van den Driessche , Julian Schrittwieser , Ioannis Antonoglou , Veda Panneershelvam , Marc Lanctot , Sander Dieleman , Dominik Grewe , John Nham , Nal Kalchbrenner , Ilya Sutskever , Timothy Lillicrap , Madeleine Leach , Koray Kavukcuoglu , Thore Graepel , and Demis Hassabis . 2016. Mastering the game of Go with deep neural networks and tree search. Nature 529(7587) ( 2016 ), 484--489. David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, and Demis Hassabis. 2016. Mastering the game of Go with deep neural networks and tree search. Nature 529(7587) (2016), 484--489."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"crossref","unstructured":"David Silver Julian Schrittwieser Karen Simonyan Ioannis Antonoglou Aja Huang Arthur Guez Thomas Hubert Lucas Baker Matthew Lai Adrian Bolton etal 2017. Mastering the game of go without human knowledge. nature 550 7676 (2017) 354--359. David Silver Julian Schrittwieser Karen Simonyan Ioannis Antonoglou Aja Huang Arthur Guez Thomas Hubert Lucas Baker Matthew Lai Adrian Bolton et al. 2017. Mastering the game of go without human knowledge. nature 550 7676 (2017) 354--359.","DOI":"10.1038\/nature24270"},{"key":"e_1_3_2_1_37_1","volume-title":"Reinforcement learning: An introduction","author":"Sutton Richard S","unstructured":"Richard S Sutton and Andrew G Barto . 2018. Reinforcement learning: An introduction . MIT press . Richard S Sutton and Andrew G Barto. 2018. Reinforcement learning: An introduction. MIT press."},{"key":"e_1_3_2_1_38_1","volume-title":"NVIDIA DGX SuperPOD: Scalable infrastructure for AI leadership","author":"NVIDIA DGX","unstructured":"NVIDIA DGX A100 system reference architecture. 2020. NVIDIA DGX SuperPOD: Scalable infrastructure for AI leadership . NVIDIA Corporation . NVIDIA DGX A100 system reference architecture. 2020. NVIDIA DGX SuperPOD: Scalable infrastructure for AI leadership. NVIDIA Corporation."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/SYNASC51798.2020.00015"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2500117"},{"key":"e_1_3_2_1_41_1","unstructured":"Google Trends. 2021. What Is MLOps? https:\/\/www.google.com\/trends. Google Trends. 2021. What Is MLOps? https:\/\/www.google.com\/trends."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.2019.8851831"},{"key":"e_1_3_2_1_43_1","volume-title":"NeurIPS Workshop","author":"Xiong Zhuoran","year":"2018","unstructured":"Zhuoran Xiong , Xiao-Yang Liu , Shan Zhong , Hongyang Yang , and Anwar Walid . 2018 . Practical deep reinforcement learning approach for stock trading . NeurIPS Workshop (2018). Zhuoran Xiong, Xiao-Yang Liu, Shan Zhong, Hongyang Yang, and Anwar Walid. 2018. Practical deep reinforcement learning approach for stock trading. NeurIPS Workshop (2018)."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3383455.3422540"},{"key":"e_1_3_2_1_45_1","volume-title":"Deep reinforcement learning for trading. The Journal of Financial Data Science 2(2)","author":"Zhang Zihao","year":"2020","unstructured":"Zihao Zhang , Stefan Zohren , and Stephen Roberts . 2020. Deep reinforcement learning for trading. The Journal of Financial Data Science 2(2) ( 2020 ), 25--40. Zihao Zhang, Stefan Zohren, and Stephen Roberts. 2020. Deep reinforcement learning for trading. The Journal of Financial Data Science 2(2) (2020), 25--40."}],"event":{"name":"ICAIF'21: 2nd ACM International Conference on AI in Finance","location":"Virtual Event","acronym":"ICAIF'21","sponsor":["ACM Association for Computing Machinery"]},"container-title":["Proceedings of the Second ACM International Conference on AI in Finance"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3490354.3494413","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3490354.3494413","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T19:30:42Z","timestamp":1750188642000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3490354.3494413"}},"subtitle":["high performance and scalable deep reinforcement learning for quantitative finance"],"short-title":[],"issued":{"date-parts":[[2021,11,3]]},"references-count":45,"alternative-id":["10.1145\/3490354.3494413","10.1145\/3490354"],"URL":"https:\/\/doi.org\/10.1145\/3490354.3494413","relation":{},"subject":[],"published":{"date-parts":[[2021,11,3]]},"assertion":[{"value":"2022-05-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}