{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T15:27:55Z","timestamp":1781018875578,"version":"3.54.1"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T00:00:00Z","timestamp":1774224000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/legalcode"}],"funder":[{"name":"NSERC Discovery Grant","award":["RGPIN 2025-00129"],"award-info":[{"award-number":["RGPIN 2025-00129"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2026,3,23]]},"DOI":"10.1145\/3748522.3779769","type":"proceedings-article","created":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T14:17:49Z","timestamp":1781014669000},"page":"1383-1390","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Variational Quantum Rainbow Deep Q-Network for Optimizing Resource Allocation Problem"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6750-9536","authenticated-orcid":false,"given":"Hung Truong Thanh","family":"Nguyen","sequence":"first","affiliation":[{"name":"Faculty of Computer Science, University of New Brunswick, Fredericton, Canada"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0006-6248-3783","authenticated-orcid":false,"given":"Truong Thinh","family":"Nguyen","sequence":"additional","affiliation":[{"name":"University of Science and Technology of Hanoi, Hanoi, Vietnam"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0788-4377","authenticated-orcid":false,"given":"Hung","family":"Cao","sequence":"additional","affiliation":[{"name":"Faculty of Computer Science, University of New Brunswick, Fredericton, Canada"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2026,6,9]]},"reference":[{"key":"e_1_3_2_1_1_1","first-page":"16","article-title":"On the use of quantum reinforcement learning in energy-efficiency scenarios","volume":"15","author":"Eva Andr\u00e9s","year":"2022","unstructured":"Eva Andr\u00e9s et al. 2022. On the use of quantum reinforcement learning in energy-efficiency scenarios. Energies, 15, 16, 6034.","journal-title":"Energies"},{"key":"e_1_3_2_1_2_1","volume-title":"2024 2nd International Conference on Disruptive Technologies (ICDT). IEEE, 1086\u20131091","author":"Sana","unstructured":"Sana Anjum et al. 2024. Machine learning-based resource allocation algorithms for 6g networks. In 2024 2nd International Conference on Disruptive Technologies (ICDT). IEEE, 1086\u20131091."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/TWC.2023.3330868"},{"key":"e_1_3_2_1_4_1","volume-title":"2024 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia). IEEE, 1\u20134.","author":"Rahimi Seyed Alireza","unstructured":"Seyed Alireza Rahimi Azghadi et al. 2024. An energy-efficient lora iot system for water monitoring: lessons learned and use cases. In 2024 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia). IEEE, 1\u20134."},{"key":"e_1_3_2_1_5_1","first-page":"48","article-title":"Optimal allocation of human resources by using linear programming in the beverage company","volume":"3","author":"Mina Azimi","year":"2013","unstructured":"Mina Azimi et al. 2013. Optimal allocation of human resources by using linear programming in the beverage company. Universal Journal of Management and Social Sciences, 3, 5, 48\u201354.","journal-title":"Universal Journal of Management and Social Sciences"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i04.5723"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12351-016-0247-8"},{"key":"e_1_3_2_1_8_1","volume-title":"2019 IEEE Intelligent Vehicles Symposium (IV). IEEE, 1469\u20131476","author":"Maxime","unstructured":"Maxime Bouton et al. 2019. Safe reinforcement learning with scene decomposition for navigating complex urban environments. In 2019 IEEE Intelligent Vehicles Symposium (IV). IEEE, 1469\u20131476."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Samuel Yen-Chi Chen et al. 2020. Variational quantum circuits for deep reinforcement learning. IEEE access 8 141007\u2013141024.","DOI":"10.1109\/ACCESS.2020.3010470"},{"key":"e_1_3_2_1_10_1","first-page":"1","article-title":"Characterizing randomness in parameterized quantum circuits through expressibility and average entanglement","volume":"10","author":"Correr Guilherme Il\u00e1rio","year":"2024","unstructured":"Guilherme Il\u00e1rio Correr et al. 2024. Characterizing randomness in parameterized quantum circuits through expressibility and average entanglement. Quantum Science and Technology, 10, 1, 015008.","journal-title":"Quantum Science and Technology"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCB.2008.925743"},{"key":"e_1_3_2_1_12_1","unstructured":"Theodora-Augustina Dr\u0103gan et al. 2022. Quantum reinforcement learning for solving a stochastic frozen lake environment and the impact of quantum architecture choices. arXiv preprint arXiv:2212.07932."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIRDC62824.2023.00057"},{"key":"e_1_3_2_1_14_1","first-page":"1041","article-title":"Multicommodity network flow model of a human resource allocation problem considering time periods","volume":"32","author":"Ercsey Zsolt","year":"2024","unstructured":"Zsolt Ercsey and Zolt\u00e1n Kov\u00e1cs. 2024. Multicommodity network flow model of a human resource allocation problem considering time periods. Central European Journal of Operations Research, 32, 4, 1041\u20131059.","journal-title":"Central European Journal of Operations Research"},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of the AAAI conference on artificial intelligence.","volume":"32","author":"Matteo","unstructured":"Matteo Hessel et al. 2018. Rainbow: combining improvements in deep reinforcement learning. In Proceedings of the AAAI conference on artificial intelligence. Vol. 32."},{"key":"e_1_3_2_1_16_1","first-page":"1","article-title":"Evaluation of parameterized quantum circuits: on the relation between classification accuracy, expressibility, and entangling capability","volume":"3","author":"Thomas Hubregtsen","year":"2021","unstructured":"Thomas Hubregtsen et al. 2021. Evaluation of parameterized quantum circuits: on the relation between classification accuracy, expressibility, and entangling capability. Quantum Machine Intelligence, 3, 1, 9.","journal-title":"Quantum Machine Intelligence"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.amc.2007.04.096"},{"key":"e_1_3_2_1_18_1","first-page":"28362","article-title":"Parametrized quantum policies for reinforcement learning","volume":"34","author":"Sofiene Jerbi","year":"2021","unstructured":"Sofiene Jerbi et al. 2021. Parametrized quantum policies for reinforcement learning. Advances in Neural Information Processing Systems, 34, 28362\u201328375.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2023.09.001"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.3390\/robotics2030122"},{"key":"e_1_3_2_1_21_1","unstructured":"Timothy P Lillicrap et al. 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"crossref","unstructured":"Neda Manavizadeh et al. 2013. A simulated annealing algorithm for a mixed model assembly u-line balancing type-i problem considering human efficiency and just-in-time approach. Computers & industrial engineering 64 2 669\u2013685.","DOI":"10.1016\/j.cie.2012.11.010"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.1497700"},{"key":"e_1_3_2_1_24_1","unstructured":"Nico Meyer et al. 2022. A survey on quantum reinforcement learning. arXiv preprint arXiv:2211.03464."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"crossref","unstructured":"Volodymyr Mnih et al. 2015. Human-level control through deep reinforcement learning. Nature 518 (Feb. 2015) 529\u2013533.","DOI":"10.1038\/nature14236"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cor.2012.07.010"},{"key":"e_1_3_2_1_27_1","volume-title":"Bridging the Gap Between AI Planning and Reinforcement Learning (PRL) Workshop, ICAPS","author":"Phong","year":"2021","unstructured":"Phong Nguyen et al. 2021. Can reinforcement learning solve a human allocation problem? Bridging the Gap Between AI Planning and Reinforcement Learning (PRL) Workshop, ICAPS 2021."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","unstructured":"Truong Thanh Hung Nguyen et al. 2024. Temporal point processes for business process monitoring. Quy Nhon University Journal of Science. 10.52111\/qnjs.2024.18501","DOI":"10.52111\/qnjs.2024.18501"},{"key":"e_1_3_2_1_29_1","volume-title":"2021 36th IEEE\/ACM International Conference on Automated Software Engineering Workshops (ASEW). IEEE, 96\u2013101","author":"Ciprian","unstructured":"Ciprian Paduraru et al. 2021. Task distribution and human resource management using reinforcement learning. In 2021 36th IEEE\/ACM International Conference on Automated Software Engineering Workshops (ASEW). IEEE, 96\u2013101."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3072959.3073602","article-title":"Deeploco: dynamic locomotion skills using hierarchical deep reinforcement learning","volume":"36","author":"Peng Xue Bin","year":"2017","unstructured":"Xue Bin Peng et al. 2017. Deeploco: dynamic locomotion skills using hierarchical deep reinforcement learning. ACM Transactions on Graphics (TOG), 36, 4, 1\u201313.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"crossref","unstructured":"Andrew J Scott. 2004. Multipartite entanglement quantum-error-correcting codes and entangling power of quantum evolutions. Physical Review A\u2014Atomic Molecular and Optical Physics 69 5 052330.","DOI":"10.1103\/PhysRevA.69.052330"},{"key":"e_1_3_2_1_32_1","unstructured":"David Silver et al. 2017. Mastering chess and shogi by self-play with a general reinforcement learning algorithm. arXiv preprint arXiv:1712.01815."},{"key":"e_1_3_2_1_33_1","first-page":"12","article-title":"Expressibility and entangling capability of parameterized quantum circuits for hybrid quantum-classical algorithms","volume":"2","author":"Sukin Sim","year":"2019","unstructured":"Sukin Sim et al. 2019. Expressibility and entangling capability of parameterized quantum circuits for hybrid quantum-classical algorithms. Advanced Quantum Technologies, 2, 12, 1900070.","journal-title":"Advanced Quantum Technologies"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.22331\/q-2022-05-24-720"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCE.2023.3249402"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.51316\/jst.171.ssad.2024.34.1.2"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"crossref","first-page":"e0286433","DOI":"10.1371\/journal.pone.0286433","article-title":"Optimization and inventory management under stochastic demand using metaheuristic algorithm","volume":"19","author":"Tan Nguyen Duy","year":"2024","unstructured":"Nguyen Duy Tan et al. 2024. Optimization and inventory management under stochastic demand using metaheuristic algorithm. Plos one, 19, 1, e0286433.","journal-title":"Plos one"},{"key":"e_1_3_2_1_38_1","volume-title":"Proceedings of the AAAI conference on artificial intelligence.","volume":"30","author":"Van Hado","unstructured":"Hado Van Hasselt et al. 2016. Deep reinforcement learning with double q-learning. In Proceedings of the AAAI conference on artificial intelligence. Vol. 30."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cor.2013.10.016"},{"key":"e_1_3_2_1_40_1","volume-title":"International conference on machine learning. PMLR","author":"Ziyu","unstructured":"Ziyu Wang et al. 2016. Dueling network architectures for deep reinforcement learning. In International conference on machine learning. PMLR, 1995\u20132003."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"crossref","unstructured":"Christopher JCH Watkins and Peter Dayan. 1992. Q-learning. Machine learning 8 279\u2013292.","DOI":"10.1023\/A:1022676722315"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.apenergy.2025.125279"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2021.3097493"}],"event":{"name":"SAC '26: 41st ACM\/SIGAPP Symposium on Applied Computing","location":"Grand Hotel Palace Thessaloniki Greece","acronym":"SAC '26","sponsor":["SIGAPP ACM Special Interest Group on Applied Computing"]},"container-title":["Proceedings of the 41st ACM\/SIGAPP Symposium on Applied Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3748522.3779769","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T14:40:04Z","timestamp":1781016004000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3748522.3779769"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,23]]},"references-count":43,"alternative-id":["10.1145\/3748522.3779769","10.1145\/3748522"],"URL":"https:\/\/doi.org\/10.1145\/3748522.3779769","relation":{},"subject":[],"published":{"date-parts":[[2026,3,23]]},"assertion":[{"value":"2026-06-09","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}