{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,10]],"date-time":"2025-12-10T08:55:51Z","timestamp":1765356951063,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":66,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,6,22]],"date-time":"2021-06-22T00:00:00Z","timestamp":1624320000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,6,22]]},"DOI":"10.1145\/3447555.3464874","type":"proceedings-article","created":{"date-parts":[[2021,6,23]],"date-time":"2021-06-23T04:49:35Z","timestamp":1624423775000},"page":"199-210","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":24,"title":["Enforcing Policy Feasibility Constraints through Differentiable Projection for Energy Optimization"],"prefix":"10.1145","author":[{"given":"Bingqing","family":"Chen","sequence":"first","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, PA, USA"}]},{"given":"Priya L.","family":"Donti","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, PA, USA"}]},{"given":"Kyri","family":"Baker","sequence":"additional","affiliation":[{"name":"University of Colorado, Boulder, Boulder, CO, USA"}]},{"given":"J. Zico","family":"Kolter","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, PA, USA"}]},{"given":"Mario","family":"Berg\u00e9s","sequence":"additional","affiliation":[{"name":"Carnegie Mellon University, Pittsburgh, PA, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,6,22]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.5555\/3305381.3305384"},{"key":"e_1_3_2_1_2_1","unstructured":"Akshay Agrawal Brandon Amos Shane Barratt Stephen Boyd Steven Diamond and J Zico Kolter. 2019. Differentiable Convex Optimization Layers. In Advances in Neural Information Processing Systems. 9558--9570.  Akshay Agrawal Brandon Amos Shane Barratt Stephen Boyd Steven Diamond and J Zico Kolter. 2019. Differentiable Convex Optimization Layers. In Advances in Neural Information Processing Systems. 9558--9570."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CDC.2014.7039601"},{"volume-title":"Constrained Markov Decision Processes","author":"Altman Eitan","key":"e_1_3_2_1_4_1","unstructured":"Eitan Altman . 1999. Constrained Markov Decision Processes . Vol. 7 . CRC Press . Eitan Altman. 1999. Constrained Markov Decision Processes. Vol. 7. CRC Press."},{"key":"e_1_3_2_1_5_1","volume-title":"Proceedings of the 34th International Conference on Machine Learning. 136--145","author":"Amos Brandon","year":"2017","unstructured":"Brandon Amos and J Zico Kolter . 2017 . OptNet: Differentiable Optimization as a Layer in Neural Networks . In Proceedings of the 34th International Conference on Machine Learning. 136--145 . Brandon Amos and J Zico Kolter. 2017. OptNet: Differentiable Optimization as a Layer in Neural Networks. In Proceedings of the 34th International Conference on Machine Learning. 136--145."},{"key":"e_1_3_2_1_6_1","volume-title":"Deep equilibrium models. arXiv preprint arXiv:1909.01377","author":"Bai Shaojie","year":"2019","unstructured":"Shaojie Bai , J Zico Kolter , and Vladlen Koltun . 2019. Deep equilibrium models. arXiv preprint arXiv:1909.01377 ( 2019 ). Shaojie Bai, J Zico Kolter, and Vladlen Koltun. 2019. Deep equilibrium models. arXiv preprint arXiv:1909.01377 (2019)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPWRS.2017.2735379"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"crossref","unstructured":"J. Bank and J. Hambrick. 2013. Development of a high resolution real time distribution-level metering system and associated visualization modeling and data analysis functions. (2013). National Renewable Energy Laboratory Tech. Rep. NREL\/TP-5500-56610.  J. Bank and J. Hambrick. 2013. Development of a high resolution real time distribution-level metering system and associated visualization modeling and data analysis functions. (2013). National Renewable Energy Laboratory Tech. Rep. NREL\/TP-5500-56610.","DOI":"10.2172\/1082568"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"T.S. Basso. 2014. In IEEE 1547 and 2030 Standards for Distributed Energy Resources Interconnection and Interoperability with the Electricity Grid. National Renewable Energy Laboratory.  T.S. Basso. 2014. In IEEE 1547 and 2030 Standards for Distributed Energy Resources Interconnection and Interoperability with the Electricity Grid. National Renewable Energy Laboratory.","DOI":"10.2172\/1166677"},{"key":"e_1_3_2_1_10_1","unstructured":"Felix Berkenkamp Matteo Turchetta Angela P. Schoellig and Andreas Krause. 2017. Safe Model-based Reinforcement Learning with Stability Guarantees. In Advances in Neural Information Processing Systems.  Felix Berkenkamp Matteo Turchetta Angela P. Schoellig and Andreas Krause. 2017. Safe Model-based Reinforcement Learning with Stability Guarantees. In Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ALLERTON.2015.7447032"},{"key":"e_1_3_2_1_12_1","unstructured":"Ya-Chien Chang Nima Roohi and Sicun Gao. 2019. Neural Lyapunov Control. In Advances in Neural Information Processing Systems. 3245--3254.  Ya-Chien Chang Nima Roohi and Sicun Gao. 2019. Neural Lyapunov Control. In Advances in Neural Information Processing Systems. 3245--3254."},{"key":"e_1_3_2_1_13_1","volume-title":"Terrence WK Mak, and Pascal Van Hentenryck","author":"Chatzos Minas","year":"2020","unstructured":"Minas Chatzos , Ferdinando Fioretto , Terrence WK Mak, and Pascal Van Hentenryck . 2020 . High-Fidelity Machine Learning Approximations of Large-Scale Optimal Power Flow . arXiv preprint arXiv:2006.16356 (2020). Minas Chatzos, Ferdinando Fioretto, Terrence WK Mak, and Pascal Van Hentenryck. 2020. High-Fidelity Machine Learning Approximations of Large-Scale Optimal Power Flow. arXiv preprint arXiv:2006.16356 (2020)."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3360322.3360849"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3408308.3427980"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3427773.3427871"},{"key":"e_1_3_2_1_17_1","unstructured":"Ricky TQ Chen Yulia Rubanova Jesse Bettencourt and David K Duvenaud. 2018. Neural ordinary differential equations. In Advances in neural information processing systems. 6571--6583.  Ricky TQ Chen Yulia Rubanova Jesse Bettencourt and David K Duvenaud. 2018. Neural ordinary differential equations. In Advances in neural information processing systems. 6571--6583."},{"key":"e_1_3_2_1_18_1","unstructured":"Filipe de Avila Belbute-Peres Kevin Smith Kelsey Allen Josh Tenenbaum and J Zico Kolter. 2018. End-to-end differentiable physics for learning and control. In Advances in Neural Information Processing Systems. 7178--7189.  Filipe de Avila Belbute-Peres Kevin Smith Kelsey Allen Josh Tenenbaum and J Zico Kolter. 2018. End-to-end differentiable physics for learning and control. In Advances in Neural Information Processing Systems. 7178--7189."},{"key":"e_1_3_2_1_19_1","unstructured":"Josip Djolonga and Andreas Krause. 2017. Differentiable Learning of Submodular Models. In Advances in Neural Information Processing Systems. 1013--1023.  Josip Djolonga and Andreas Krause. 2017. Differentiable Learning of Submodular Models. In Advances in Neural Information Processing Systems. 1013--1023."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.epsr.2020.106615"},{"key":"e_1_3_2_1_21_1","volume-title":"International Conference on Learning Representations.","author":"Donti Priya L","year":"2021","unstructured":"Priya L Donti , Melrose Roderick , Mahyar Fazlyab , and J Zico Kolter . 2021 . Enforcing robust control guarantees within neural network policies . In International Conference on Learning Representations. Priya L Donti, Melrose Roderick, Mahyar Fazlyab, and J Zico Kolter. 2021. Enforcing robust control guarantees within neural network policies. In International Conference on Learning Representations."},{"key":"e_1_3_2_1_22_1","volume-title":"David Blum, Krzysztof Arendt, Donghun Kim, Enric Perarnau Oll\u00e9, Juraj Oravec, Michael Wetter, Draguna L Vrabie, et al.","author":"Drgo\u0148a J\u00e1n","year":"2020","unstructured":"J\u00e1n Drgo\u0148a , Javier Arroyo , Iago Cupeiro Figueroa , David Blum, Krzysztof Arendt, Donghun Kim, Enric Perarnau Oll\u00e9, Juraj Oravec, Michael Wetter, Draguna L Vrabie, et al. 2020 . All you need to know about model predictive control for buildings. Annual Reviews in Control ( 2020). J\u00e1n Drgo\u0148a, Javier Arroyo, Iago Cupeiro Figueroa, David Blum, Krzysztof Arendt, Donghun Kim, Enric Perarnau Oll\u00e9, Juraj Oravec, Michael Wetter, Draguna L Vrabie, et al. 2020. All you need to know about model predictive control for buildings. Annual Reviews in Control (2020)."},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02238059"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i01.5403"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2019.8794107"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-17462-0_28"},{"key":"e_1_3_2_1_28_1","first-page":"1437","article-title":"A comprehensive survey on safe reinforcement learning","volume":"16","author":"Garc Javier","year":"2015","unstructured":"Javier Garc &iota;a and Fernando Fern\u00e1ndez . 2015 . A comprehensive survey on safe reinforcement learning . Journal of Machine Learning Research 16 , 1 (2015), 1437 -- 1480 . Javier Garc&iota;a and Fernando Fern\u00e1ndez. 2015. A comprehensive survey on safe reinforcement learning. Journal of Machine Learning Research 16, 1 (2015), 1437--1480.","journal-title":"Journal of Machine Learning Research"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.arcontrol.2019.09.008"},{"key":"e_1_3_2_1_30_1","volume-title":"Deep declarative networks: A new hope. arXiv preprint arXiv:1909.04866","author":"Gould Stephen","year":"2019","unstructured":"Stephen Gould , Richard Hartley , and Dylan Campbell . 2019. Deep declarative networks: A new hope. arXiv preprint arXiv:1909.04866 ( 2019 ). Stephen Gould, Richard Hartley, and Dylan Campbell. 2019. Deep declarative networks: A new hope. arXiv preprint arXiv:1909.04866 (2019)."},{"key":"e_1_3_2_1_31_1","unstructured":"Samuel Greydanus Misko Dzamba and Jason Yosinski. 2019. Hamiltonian neural networks. In Advances in Neural Information Processing Systems. 15379--15389.  Samuel Greydanus Misko Dzamba and Jason Yosinski. 2019. Hamiltonian neural networks. In Advances in Neural Information Processing Systems. 15379--15389."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/SmartGridComm47815.2020.9302970"},{"key":"e_1_3_2_1_33_1","volume-title":"H\u221e Model-free Reinforcement Learning with Robust Stability Guarantee. CoRR","author":"Han Minghao","year":"2019","unstructured":"Minghao Han , Yuan Tian , Lixian Zhang , Jun Wang , and Wei Pan . 2019. H\u221e Model-free Reinforcement Learning with Robust Stability Guarantee. CoRR ( 2019 ). Minghao Han, Yuan Tian, Lixian Zhang, Jun Wang, and Wei Pan. 2019. H\u221e Model-free Reinforcement Learning with Robust Stability Guarantee. CoRR (2019)."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-57628-8_1"},{"key":"e_1_3_2_1_35_1","volume-title":"Verifiably safe exploration for end-to-end reinforcement learning. arXiv preprint arXiv:2007.01223","author":"Hunt Nathan","year":"2020","unstructured":"Nathan Hunt , Nathan Fulton , Sara Magliacane , Nghia Hoang , Subhro Das , and Armando Solar-Lezama . 2020. Verifiably safe exploration for end-to-end reinforcement learning. arXiv preprint arXiv:2007.01223 ( 2020 ). Nathan Hunt, Nathan Fulton, Sara Magliacane, Nghia Hoang, Subhro Das, and Armando Solar-Lezama. 2020. Verifiably safe exploration for end-to-end reinforcement learning. arXiv preprint arXiv:2007.01223 (2020)."},{"key":"e_1_3_2_1_36_1","unstructured":"IEEE. [n.d.]. 37 node distribution test feeder. https:\/\/ewh.ieee.org\/soc\/pes\/dsacom\/testfeeders\/. Online.  IEEE. [n.d.]. 37 node distribution test feeder. https:\/\/ewh.ieee.org\/soc\/pes\/dsacom\/testfeeders\/. Online."},{"key":"e_1_3_2_1_37_1","first-page":"1","article-title":"IEEE Standard Conformance Test Procedures for Equipment Interconnecting Distributed Energy Resources with Electric Power Systems and Associated Interfaces","volume":"1547","author":"IEEE.","year":"2020","unstructured":"IEEE. 2020 . IEEE Standard Conformance Test Procedures for Equipment Interconnecting Distributed Energy Resources with Electric Power Systems and Associated Interfaces . IEEE Std 1547 . 1 - 2020 (2020), 1--282. https:\/\/doi.org\/10.1109\/IEEESTD.2020.9097534 IEEE. 2020. IEEE Standard Conformance Test Procedures for Equipment Interconnecting Distributed Energy Resources with Electric Power Systems and Associated Interfaces. IEEE Std 1547.1-2020 (2020), 1--282. https:\/\/doi.org\/10.1109\/IEEESTD.2020.9097534","journal-title":"IEEE Std"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSG.2019.2942850"},{"key":"e_1_3_2_1_39_1","volume-title":"Tutorial: Deep Implicit Layers - Neural ODEs, Deep Equilibirum Models, and Beyond","author":"Kolter Zico","year":"2020","unstructured":"Zico Kolter , David Duvenaud , and Matthew Johnson . 2020 . Tutorial: Deep Implicit Layers - Neural ODEs, Deep Equilibirum Models, and Beyond . http:\/\/implicit-layers-tutorial.org\/. Zico Kolter, David Duvenaud, and Matthew Johnson. 2020. Tutorial: Deep Implicit Layers - Neural ODEs, Deep Equilibirum Models, and Beyond. http:\/\/implicit-layers-tutorial.org\/."},{"volume-title":"The implicit function theorem: history, theory, and applications","author":"Krantz Steven G","key":"e_1_3_2_1_40_1","unstructured":"Steven G Krantz and Harold R Parks . 2012. The implicit function theorem: history, theory, and applications . Springer Science & Business Media . Steven G Krantz and Harold R Parks. 2012. The implicit function theorem: history, theory, and applications. Springer Science & Business Media."},{"key":"e_1_3_2_1_41_1","volume-title":"What game are we playing? end-to-end learning in normal and extensive form games. arXiv preprint arXiv:1805.02777","author":"Ling Chun Kai","year":"2018","unstructured":"Chun Kai Ling , Fei Fang , and J Zico Kolter . 2018. What game are we playing? end-to-end learning in normal and extensive form games. arXiv preprint arXiv:1805.02777 ( 2018 ). Chun Kai Ling, Fei Fang, and J Zico Kolter. 2018. What game are we playing? end-to-end learning in normal and extensive form games. arXiv preprint arXiv:1805.02777 (2018)."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCYB.2014.2319577"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.23919\/PSCC.2018.8450880"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1162\/0899766053011528"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRA.2018.8460547"},{"key":"e_1_3_2_1_46_1","volume-title":"Proceedings of the 34th International Conference on Machine Learning. JMLR. org, 2817--2826","author":"Pinto Lerrel","year":"2017","unstructured":"Lerrel Pinto , James Davidson , Rahul Sukthankar , and Abhinav Gupta . 2017 . Robust Adversarial Reinforcement Learning . In Proceedings of the 34th International Conference on Machine Learning. JMLR. org, 2817--2826 . Lerrel Pinto, James Davidson, Rahul Sukthankar, and Abhinav Gupta. 2017. Robust Adversarial Reinforcement Learning. In Proceedings of the 34th International Conference on Machine Learning. JMLR. org, 2817--2826."},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.enbuild.2012.10.024"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPWRS.2010.2051168"},{"key":"e_1_3_2_1_49_1","volume-title":"Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, et al.","author":"Rolnick David","year":"2019","unstructured":"David Rolnick , Priya L Donti , Lynn H Kaack , Kelly Kochanski , Alexandre Lacoste , Kris Sankaran , Andrew Slavin Ross , Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, et al. 2019 . Tackling Climate Change with Machine Learning . arXiv preprint arXiv:1906.05433 (2019). David Rolnick, Priya L Donti, Lynn H Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, et al. 2019. Tackling Climate Change with Machine Learning. arXiv preprint arXiv:1906.05433 (2019)."},{"key":"e_1_3_2_1_50_1","volume-title":"High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438","author":"Schulman John","year":"2015","unstructured":"John Schulman , Philipp Moritz , Sergey Levine , Michael Jordan , and Pieter Abbeel . 2015. High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438 ( 2015 ). John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, and Pieter Abbeel. 2015. High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438 (2015)."},{"key":"e_1_3_2_1_51_1","volume-title":"Proximal Policy Optimization Algorithms. arXiv preprint arXiv:1707.06347","author":"Schulman John","year":"2017","unstructured":"John Schulman , Filip Wolski , Prafulla Dhariwal , Alec Radford , and Oleg Klimov . 2017. Proximal Policy Optimization Algorithms. arXiv preprint arXiv:1707.06347 ( 2017 ). John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal Policy Optimization Algorithms. arXiv preprint arXiv:1707.06347 (2017)."},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i02.5599"},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1109\/MPE.2020.3014720"},{"key":"e_1_3_2_1_54_1","volume-title":"Dietterich","author":"Taleghan Majid Alkaee","year":"2018","unstructured":"Majid Alkaee Taleghan and Thomas G . Dietterich . 2018 . Efficient Exploration for Constrained MDPs. In 2018 AAAI Spring Symposia . Majid Alkaee Taleghan and Thomas G. Dietterich. 2018. Efficient Exploration for Constrained MDPs. In 2018 AAAI Spring Symposia."},{"key":"e_1_3_2_1_55_1","volume-title":"Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning 4, 2","author":"Tieleman Tijmen","year":"2012","unstructured":"Tijmen Tieleman and Geoffrey Hinton . 2012. Lecture 6.5-rmsprop : Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning 4, 2 ( 2012 ), 26--31. Tijmen Tieleman and Geoffrey Hinton. 2012. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning 4, 2 (2012), 26--31."},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2018\/379"},{"key":"e_1_3_2_1_57_1","unstructured":"Matteo Turchetta Felix Berkenkamp and Andreas Krause. 2016. Safe Exploration in Finite Markov Decision Processes with Gaussian Processes. In Advances in Neural Information Processing Systems.  Matteo Turchetta Felix Berkenkamp and Andreas Krause. 2016. Safe Exploration in Finite Markov Decision Processes with Gaussian Processes. In Advances in Neural Information Processing Systems."},{"key":"e_1_3_2_1_58_1","volume-title":"Proceedings of the AAAI Conference on Artificial Intelligence","volume":"32","author":"Wachi Akifumi","year":"2018","unstructured":"Akifumi Wachi , Yanan Sui , Yisong Yue , and Masahiro Ono . 2018 . Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes . In Proceedings of the AAAI Conference on Artificial Intelligence , Vol. 32 . Akifumi Wachi, Yanan Sui, Yisong Yue, and Masahiro Ono. 2018. Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32."},{"key":"e_1_3_2_1_59_1","volume-title":"Proceedings of the 36th International Conference on Machine Learning. 6545--6554","author":"Wang Po-Wei","year":"2019","unstructured":"Po-Wei Wang , Priya Donti , Bryan Wilder , and Zico Kolter . 2019 . SATNet: Bridging deep learning and logical reasoning using a differentiable satisfiability solver . In Proceedings of the 36th International Conference on Machine Learning. 6545--6554 . Po-Wei Wang, Priya Donti, Bryan Wilder, and Zico Kolter. 2019. SATNet: Bridging deep learning and logical reasoning using a differentiable satisfiability solver. In Proceedings of the 36th International Conference on Machine Learning. 6545--6554."},{"key":"e_1_3_2_1_60_1","doi-asserted-by":"crossref","unstructured":"Stephen Wilcox and William Marion. 2008. Users manual for TMY3 data sets. National Renewable Energy Laboratory Golden CO.  Stephen Wilcox and William Marion. 2008. Users manual for TMY3 data sets. National Renewable Energy Laboratory Golden CO.","DOI":"10.2172\/928611"},{"key":"e_1_3_2_1_61_1","volume-title":"Projection-based constrained policy optimization. arXiv preprint arXiv:2010.03152","author":"Yang Tsung-Yen","year":"2020","unstructured":"Tsung-Yen Yang , Justinian Rosca , Karthik Narasimhan , and Peter J Ramadge . 2020. Projection-based constrained policy optimization. arXiv preprint arXiv:2010.03152 ( 2020 ). Tsung-Yen Yang, Justinian Rosca, Karthik Narasimhan, and Peter J Ramadge. 2020. Projection-based constrained policy optimization. arXiv preprint arXiv:2010.03152 (2020)."},{"key":"e_1_3_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/SmartGridComm47815.2020.9303008"},{"key":"e_1_3_2_1_63_1","unstructured":"Kaiqing Zhang Bin Hu and Tamer Basar. 2020. Policy Optimization for H2 Linear Control with H\u221e Robustness Guarantee: Implicit Regularization and Global Convergence. In Learning for Dynamics and Control. PMLR 179--190.  Kaiqing Zhang Bin Hu and Tamer Basar. 2020. Policy Optimization for H2 Linear Control with H\u221e Robustness Guarantee: Implicit Regularization and Global Convergence. In Learning for Dynamics and Control. PMLR 179--190."},{"key":"e_1_3_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1145\/3276774.3276775"},{"key":"e_1_3_2_1_65_1","first-page":"213","article-title":"Deep reinforcement learning for power system applications: An overview","volume":"6","author":"Zhang Zidong","year":"2019","unstructured":"Zidong Zhang , Dongxia Zhang , and Robert C Qiu . 2019 . Deep reinforcement learning for power system applications: An overview . CSEE Journal of Power and Energy Systems 6 , 1 (2019), 213 -- 225 . Zidong Zhang, Dongxia Zhang, and Robert C Qiu. 2019. Deep reinforcement learning for power system applications: An overview. CSEE Journal of Power and Energy Systems 6, 1 (2019), 213--225.","journal-title":"CSEE Journal of Power and Energy Systems"},{"key":"e_1_3_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPWRS.2017.2674699"},{"volume-title":"Essentials of Robust Control","author":"Zhou Kemin","key":"e_1_3_2_1_67_1","unstructured":"Kemin Zhou and John Comstock Doyle . 1998. Essentials of Robust Control . Vol. 104 . Prentice hall Upper Saddle River, NJ. Kemin Zhou and John Comstock Doyle. 1998. Essentials of Robust Control. Vol. 104. Prentice hall Upper Saddle River, NJ."}],"event":{"name":"e-Energy '21: The Twelfth ACM International Conference on Future Energy Systems","acronym":"e-Energy '21","location":"Virtual Event Italy"},"container-title":["Proceedings of the Twelfth ACM International Conference on Future Energy Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447555.3464874","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3447555.3464874","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:18:32Z","timestamp":1750191512000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3447555.3464874"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,6,22]]},"references-count":66,"alternative-id":["10.1145\/3447555.3464874","10.1145\/3447555"],"URL":"https:\/\/doi.org\/10.1145\/3447555.3464874","relation":{},"subject":[],"published":{"date-parts":[[2021,6,22]]},"assertion":[{"value":"2021-06-22","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}