{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,26]],"date-time":"2026-03-26T15:28:21Z","timestamp":1774538901028,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,7,26]],"date-time":"2021-07-26T00:00:00Z","timestamp":1627257600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,7,26]]},"DOI":"10.1145\/3466772.3467043","type":"proceedings-article","created":{"date-parts":[[2021,6,29]],"date-time":"2021-06-29T10:38:19Z","timestamp":1624963099000},"page":"141-150","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":13,"title":["Neuro-DCF"],"prefix":"10.1145","author":[{"given":"Sangwoo","family":"Moon","sequence":"first","affiliation":[{"name":"Korea Advanced Institute of Science and Technology, Daejeon, South Korea"}]},{"given":"Sumyeong","family":"Ahn","sequence":"additional","affiliation":[{"name":"Korea Advanced Institute of Science and Technology, Daejeon, South Korea"}]},{"given":"Kyunghwan","family":"Son","sequence":"additional","affiliation":[{"name":"Korea Advanced Institute of Science and Technology, Daejeon, South Korea"}]},{"given":"Jinwoo","family":"Park","sequence":"additional","affiliation":[{"name":"Korea Advanced Institute of Science and Technology, Daejeon, South Korea"}]},{"given":"Yung","family":"Yi","sequence":"additional","affiliation":[{"name":"Korea Advanced Institute of Science and Technology, Daejeon, South Korea"}]}],"member":"320","published-online":{"date-parts":[[2021,7,26]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Proc. of ICML.","author":"B\u00f6hmer Wendelin","year":"2020","unstructured":"Wendelin B\u00f6hmer , Vitaly Kurin , and Shimon Whiteson . 2020 . Deep coordination graphs . In Proc. of ICML. Wendelin B\u00f6hmer, Vitaly Kurin, and Shimon Whiteson. 2020. Deep coordination graphs. In Proc. of ICML."},{"key":"e_1_3_2_1_2_1","unstructured":"Greg Brockman Vicki Cheung Ludwig Pettersson Jonas Schneider John Schulman Jie Tang and Wojciech Zaremba. 2016. OpenAI Gym. arXiv:arXiv:1606.01540  Greg Brockman Vicki Cheung Ludwig Pettersson Jonas Schneider John Schulman Jie Tang and Wojciech Zaremba. 2016. OpenAI Gym. arXiv:arXiv:1606.01540"},{"key":"e_1_3_2_1_3_1","volume-title":"Dynamic tuning of the IEEE 802.11 protocol to achieve a theoretical throughput limit","author":"Cal\u00ec Frederico","year":"2000","unstructured":"Frederico Cal\u00ec , Marco Conti , and Enrico Gregori . 2000. Dynamic tuning of the IEEE 802.11 protocol to achieve a theoretical throughput limit . IEEE\/ACM Transactions on networking 8, 6 ( 2000 ), 785--799. Frederico Cal\u00ec, Marco Conti, and Enrico Gregori. 2000. Dynamic tuning of the IEEE 802.11 protocol to achieve a theoretical throughput limit. IEEE\/ACM Transactions on networking 8, 6 (2000), 785--799."},{"key":"e_1_3_2_1_4_1","volume-title":"Proactive Resource Management for LTE in Unlicensed Spectrum: A Deep Learning Perspective","author":"Challita Ursula","year":"2018","unstructured":"Ursula Challita , Li Dong , and Walid Saad . 2018. Proactive Resource Management for LTE in Unlicensed Spectrum: A Deep Learning Perspective . IEEE transactions on wireless communications 17, 7 ( 2018 ), 4674--4689. Ursula Challita, Li Dong, and Walid Saad. 2018. Proactive Resource Management for LTE in Unlicensed Spectrum: A Deep Learning Perspective. IEEE transactions on wireless communications 17, 7 (2018), 4674--4689."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11339"},{"key":"e_1_3_2_1_6_1","volume-title":"Proc. of NeurIPS.","author":"Duvenaud David K","year":"2015","unstructured":"David K Duvenaud , Dougal Maclaurin , Jorge Iparraguirre , Rafael Bombarell , Timothy Hirzel , Al\u00e1n Aspuru-Guzik , and Ryan P Adams . 2015 . Convolutional networks on graphs for learning molecular fingerprints . In Proc. of NeurIPS. David K Duvenaud, Dougal Maclaurin, Jorge Iparraguirre, Rafael Bombarell, Timothy Hirzel, Al\u00e1n Aspuru-Guzik, and Ryan P Adams. 2015. Convolutional networks on graphs for learning molecular fingerprints. In Proc. of NeurIPS."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v32i1.11794"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/3345768.3355908"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1364654.1364685"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-71682-4_5"},{"key":"e_1_3_2_1_11_1","volume-title":"A survey of learning in multiagent environments: Dealing with non-stationarity. arXiv preprint arXiv:1707.09183","author":"Hernandez-Leal Pablo","year":"2017","unstructured":"Pablo Hernandez-Leal , Michael Kaisers , Tim Baarslag , and Enrique Munoz de Cote . 2017. A survey of learning in multiagent environments: Dealing with non-stationarity. arXiv preprint arXiv:1707.09183 ( 2017 ). Pablo Hernandez-Leal, Michael Kaisers, Tim Baarslag, and Enrique Munoz de Cote. 2017. A survey of learning in multiagent environments: Dealing with non-stationarity. arXiv preprint arXiv:1707.09183 (2017)."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1080091.1080107"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.243"},{"key":"e_1_3_2_1_14_1","volume-title":"Proc. of ICML.","author":"Jay Nathan","year":"2019","unstructured":"Nathan Jay , Noga Rotman , Brighten Godfrey , Michael Schapira , and Aviv Tamar . 2019 . A Deep Reinforcement Learning Perspective on Internet Congestion Control . In Proc. of ICML. Nathan Jay, Noga Rotman, Brighten Godfrey, Michael Schapira, and Aviv Tamar. 2019. A Deep Reinforcement Learning Perspective on Internet Congestion Control. In Proc. of ICML."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNET.2009.2035046"},{"key":"e_1_3_2_1_16_1","volume-title":"Kipf and Max Welling","author":"Thomas","year":"2017","unstructured":"Thomas N. Kipf and Max Welling . 2017 . Semi-Supervised Classification with Graph Convolutional Networks. In Proc. of ICLR. Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In Proc. of ICLR."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2494232.2465542"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFOCOM.2015.7218603"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNET.2015.2432053"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1155\/2010\/876216"},{"key":"e_1_3_2_1_21_1","volume-title":"Proc. of ICML.","author":"Liang Eric","year":"2018","unstructured":"Eric Liang , Richard Liaw , Robert Nishihara , Philipp Moritz , Roy Fox , Ken Goldberg , Joseph Gonzalez , Michael Jordan , and Ion Stoica . 2018 . RLlib: Abstractions for Distributed Reinforcement Learning . In Proc. of ICML. Eric Liang, Richard Liaw, Robert Nishihara, Philipp Moritz, Roy Fox, Ken Goldberg, Joseph Gonzalez, Michael Jordan, and Ion Stoica. 2018. RLlib: Abstractions for Distributed Reinforcement Learning. In Proc. of ICML."},{"key":"e_1_3_2_1_22_1","volume-title":"Proc. of ICLR.","author":"Lillicrap Timothy P","year":"2016","unstructured":"Timothy P Lillicrap , Jonathan J Hunt , Alexander Pritzel , Nicolas Heess , Tom Erez , Yuval Tassa , David Silver , and Daan Wierstra . 2016 . Continuous control with deep reinforcement learning . In Proc. of ICLR. Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2016. Continuous control with deep reinforcement learning. In Proc. of ICLR."},{"key":"e_1_3_2_1_23_1","volume-title":"Proc. of NeurIPS.","author":"Lowe Ryan","year":"2017","unstructured":"Ryan Lowe , Yi I Wu , Aviv Tamar , Jean Harb , Open AI Pieter Abbeel , and Igor Mordatch . 2017 . Multi-agent actor-critic for mixed cooperative-competitive environments . In Proc. of NeurIPS. Ryan Lowe, Yi I Wu, Aviv Tamar, Jean Harb, OpenAI Pieter Abbeel, and Igor Mordatch. 2017. Multi-agent actor-critic for mixed cooperative-competitive environments. In Proc. of NeurIPS."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/GLOCOM.2016.7842209"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"crossref","unstructured":"Volodymyr Mnih Koray Kavukcuoglu David Silver Andrei A Rusu Joel Veness Marc G Bellemare Alex Graves Martin Riedmiller Andreas K Fidjeland Georg Ostrovski etal 2015. Human-level control through deep reinforcement learning. nature 518 7540 (2015) 529--533.  Volodymyr Mnih Koray Kavukcuoglu David Silver Andrei A Rusu Joel Veness Marc G Bellemare Alex Graves Martin Riedmiller Andreas K Fidjeland Georg Ostrovski et al. 2015. Human-level control through deep reinforcement learning. nature 518 7540 (2015) 529--533.","DOI":"10.1038\/nature14236"},{"key":"e_1_3_2_1_27_1","volume-title":"Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning. arXiv preprint arXiv:2010.04740","author":"Naderializadeh Navid","year":"2020","unstructured":"Navid Naderializadeh , Fan H Hung , Sean Soleyman , and Deepak Khosla . 2020. Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning. arXiv preprint arXiv:2010.04740 ( 2020 ). Navid Naderializadeh, Fan H Hung, Sean Soleyman, and Deepak Khosla. 2020. Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning. arXiv preprint arXiv:2010.04740 (2020)."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TWC.2018.2879433"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"crossref","unstructured":"Frans A Oliehoek Christopher Amato etal 2016. A concise introduction to decentralized POMDPs. Vol. 1. Springer.  Frans A Oliehoek Christopher Amato et al. 2016. A concise introduction to decentralized POMDPs. Vol. 1. Springer.","DOI":"10.1007\/978-3-319-28929-8_1"},{"key":"e_1_3_2_1_30_1","volume-title":"Proc. of CVPR.","author":"Qi Charles R","year":"2017","unstructured":"Charles R Qi , Hao Su , Kaichun Mo , and Leonidas J Guibas . 2017 . Pointnet: Deep learning on point sets for 3d classification and segmentation . In Proc. of CVPR. Charles R Qi, Hao Su, Kaichun Mo, and Leonidas J Guibas. 2017. Pointnet: Deep learning on point sets for 3d classification and segmentation. In Proc. of CVPR."},{"key":"e_1_3_2_1_31_1","volume-title":"Modeling and tools for network simulation","author":"Riley George F","unstructured":"George F Riley and Thomas R Henderson . 2010. The ns-3 Network Simulator . In Modeling and tools for network simulation . Springer , 15--34. George F Riley and Thomas R Henderson. 2010. The ns-3 Network Simulator. In Modeling and tools for network simulation. Springer, 15--34."},{"key":"e_1_3_2_1_32_1","volume-title":"Proc. of ICLR.","author":"Schulman John","year":"2016","unstructured":"John Schulman , Philipp Moritz , Sergey Levine , Michael Jordan , and Pieter Abbeel . 2016 . High-dimensional continuous control using generalized advantage estimation . In Proc. of ICLR. John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, and Pieter Abbeel. 2016. High-dimensional continuous control using generalized advantage estimation. In Proc. of ICLR."},{"key":"e_1_3_2_1_33_1","volume-title":"Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347","author":"Schulman John","year":"2017","unstructured":"John Schulman , Filip Wolski , Prafulla Dhariwal , Alec Radford , and Oleg Klimov . 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 ( 2017 ). John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1160987.1160996"},{"key":"e_1_3_2_1_35_1","volume-title":"Reinforcement learning: An introduction","author":"Sutton Richard S","unstructured":"Richard S Sutton and Andrew G Barto . 2018. Reinforcement learning: An introduction . MIT press . Richard S Sutton and Andrew G Barto. 2018. Reinforcement learning: An introduction. MIT press."},{"key":"e_1_3_2_1_36_1","volume-title":"Proc. of NeurIPS.","author":"Sutton Richard S","year":"1999","unstructured":"Richard S Sutton , David McAllester , Satinder Singh , and Yishay Mansour . 1999 . Policy gradient methods for reinforcement learning with function approximation . In Proc. of NeurIPS. Richard S Sutton, David McAllester, Satinder Singh, and Yishay Mansour. 1999. Policy gradient methods for reinforcement learning with function approximation. In Proc. of NeurIPS."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0172395"},{"key":"e_1_3_2_1_38_1","volume-title":"Parameter Sharing is Surprisingly Useful for Multi-Agent Deep Reinforcement Learning. arXiv preprint arXiv:2005.13625","author":"Terry Justin K","year":"2020","unstructured":"Justin K Terry , Nathaniel Grammel , Ananth Hari , Luis Santos , Benjamin Black , and Dinesh Manocha . 2020. Parameter Sharing is Surprisingly Useful for Multi-Agent Deep Reinforcement Learning. arXiv preprint arXiv:2005.13625 ( 2020 ). Justin K Terry, Nathaniel Grammel, Ananth Hari, Luis Santos, Benjamin Black, and Dinesh Manocha. 2020. Parameter Sharing is Surprisingly Useful for Multi-Agent Deep Reinforcement Learning. arXiv preprint arXiv:2005.13625 (2020)."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.3390\/electronics9091363"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCCN.2018.2809722"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFCOM.2009.5061929"},{"key":"e_1_3_2_1_42_1","volume-title":"Proc. of ICLR.","author":"Wu Cathy","year":"2018","unstructured":"Cathy Wu , Aravind Rajeswaran , Yan Duan , Vikash Kumar , Alexandre M Bayen , Sham Kakade , Igor Mordatch , and Pieter Abbeel . 2018 . Variance reduction for policy gradient with action-dependent factorized baselines . In Proc. of ICLR. Cathy Wu, Aravind Rajeswaran, Yan Duan, Vikash Kumar, Alexandre M Bayen, Sham Kakade, Igor Mordatch, and Pieter Abbeel. 2018. Variance reduction for policy gradient with action-dependent factorized baselines. In Proc. of ICLR."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFOCOM.2018.8485853"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCS.2012.6406138"}],"event":{"name":"MobiHoc '21: The Twenty-second International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing","location":"Shanghai China","acronym":"MobiHoc '21","sponsor":["SIGMOBILE ACM Special Interest Group on Mobility of Systems, Users, Data and Computing"]},"container-title":["Proceedings of the Twenty-second International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3466772.3467043","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3466772.3467043","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T20:18:57Z","timestamp":1750191537000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3466772.3467043"}},"subtitle":["Design of Wireless MAC via Multi-Agent Reinforcement Learning Approach"],"short-title":[],"issued":{"date-parts":[[2021,7,26]]},"references-count":43,"alternative-id":["10.1145\/3466772.3467043","10.1145\/3466772"],"URL":"https:\/\/doi.org\/10.1145\/3466772.3467043","relation":{},"subject":[],"published":{"date-parts":[[2021,7,26]]},"assertion":[{"value":"2021-07-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}