{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T08:58:32Z","timestamp":1778317112778,"version":"3.51.4"},"reference-count":31,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2026,2,28]],"date-time":"2026-02-28T00:00:00Z","timestamp":1772236800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Applied Sciences"],"abstract":"<jats:p>Material handling is an important process in open-pit mining, where trucks transport material extracted by shovels to different destinations within the mine. The decision regarding the next destination of a truck strongly influences operational efficiency. In current mining operations, this decision is typically handled by centralized dispatching systems based on predefined criteria. However, such approaches often struggle to adapt to dynamic operating conditions and rely on a central control unit, which may reduce flexibility and robustness. This paper proposes a decentralized multi-agent system for truck dispatching with reinforcement learning (MAS-TDRL). In the proposed approach, autonomous agents representing trucks, shovels, and unloading points cooperate through a negotiation mechanism based on an enhanced Contract Net Protocol to generate operational schedules. Reinforcement learning is integrated into the decision-making process of truck agents, allowing them to learn from previous negotiations and improve their participation over time. The proposed system is evaluated through simulation using scenarios based on real data from an open-pit copper mine in Chile. The results show that incorporating reinforcement learning increases the material transported per hour by approximately 18\u201329% compared to a multi-agent system without learning, while maintaining computation times below 10 min even in the largest scenario, which remains compatible with operational decision-making in open-pit mining contexts.<\/jats:p>","DOI":"10.3390\/app16052343","type":"journal-article","created":{"date-parts":[[2026,3,2]],"date-time":"2026-03-02T12:39:56Z","timestamp":1772455196000},"page":"2343","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Reinforcement Learning-Driven Negotiation in a Multi-Agent System for Truck Dispatching in Open-Pit Mining"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4781-2551","authenticated-orcid":false,"given":"Otthein","family":"Herzog","sequence":"first","affiliation":[{"name":"Department of Mathematics\/Informatics, University of Bremen, 28359 Bremen, Germany"},{"name":"College of Architecture and Urban Planning, Tongji University, Shanghai 200092, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1997-0053","authenticated-orcid":false,"given":"Gabriel","family":"Icarte-Ahumada","sequence":"additional","affiliation":[{"name":"Faculty of Engineering and Architecture, Arturo Prat University, Iquique 1110939, Chile"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel","family":"Arratia","sequence":"additional","affiliation":[{"name":"Faculty of Engineering and Architecture, Arturo Prat University, Iquique 1110939, Chile"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Cristian","family":"Lucero","sequence":"additional","affiliation":[{"name":"Faculty of Engineering and Architecture, Arturo Prat University, Iquique 1110939, Chile"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2026,2,28]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1076\/ijsm.16.1.59.3408","article-title":"Overview of Solution Strategies Used in Truck Dispatching Systems for Open Pit Mines","volume":"16","author":"Alarie","year":"2002","journal-title":"Int. J. Surf. Min. Reclam. Environ."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1287\/inte.1090.0492","article-title":"A review of operations research in mine planning","volume":"40","author":"Newman","year":"2010","journal-title":"Interfaces"},{"key":"ref_3","unstructured":"Adams, K.K., and Bansah, K.K. (2016). Review of Operational Delays in Shovel-Truck System of Surface Mining Operations. 4th UMaT Biennial International Mining and Mineral Conference, University of Mines and Technology."},{"key":"ref_4","first-page":"215","article-title":"The impact of mixed fleet hauling on mining operations at Venetia mine","volume":"107","author":"Krzyzanowska","year":"2007","journal-title":"J. South. Afr. Inst. Min. Metall."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1016\/j.simpat.2019.04.006","article-title":"Simulation-based optimization of truck-shovel material handling systems in multi-pit surface mines","volume":"95","author":"Ozdemir","year":"2019","journal-title":"Simul. Model. Pract. Theory"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"103026","DOI":"10.1016\/j.simpat.2024.103026","article-title":"A Stochastic Energy-Efficient Robust Simulation-Based Truck Dispatching Optimization for Simultaneous GHG Mitigation and Operational Excellence in Open-Pit Mines","volume":"138","author":"Ashtiani","year":"2025","journal-title":"Simul. Model. Pract. Theory"},{"key":"ref_7","first-page":"153","article-title":"Sustainable open pit fleet management system: Integrating economic and environmental objectives into truck allocation","volume":"132","author":"Anaraki","year":"2023","journal-title":"Min. Technol. Trans. Inst. Min. Metall."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1080\/25726668.2021.1916170","article-title":"Optimization of truck-shovel allocation in open-pit mines under uncertainty: A chance-constrained goal programming approach","volume":"130","author":"Mohtasham","year":"2021","journal-title":"Min. Technol."},{"key":"ref_9","unstructured":"Cox, W., French, T., Reynolds, M., and While, L. (2017). A Genetic Algorithm for Truck Dispatching in Mining, EasyChair."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"160","DOI":"10.5220\/0010391101600170","article-title":"Application of multiagent system and tabu search for truck dispatching in open-pit mines","volume":"Volume 1","author":"Ahumada","year":"2021","journal-title":"ICAART 2021\u2014Proceedings of the 13th International Conference on Agents and Artificial Intelligence"},{"key":"ref_11","first-page":"745378","article-title":"Modelling and optimizing an open-pit truck scheduling problem","volume":"2015","author":"Chang","year":"2015","journal-title":"Discret. Dyn. Nat. Soc."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1016\/j.ejor.2017.03.081","article-title":"Energy efficient scheduling of open-pit coal mine trucks","volume":"262","author":"Patterson","year":"2017","journal-title":"Eur. J. Oper. Res."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"102727","DOI":"10.1016\/j.resourpol.2022.102727","article-title":"A systematic review of artificial intelligence and data-driven approaches in strategic open-pit mine planning","volume":"77","author":"Noriega","year":"2022","journal-title":"Resour. Policy"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Cohen, M.W., and Coelho, V.N. (2021). Open-pit mining operational planning using Multi Agent Systems. Procedia Computer Science, Elsevier B.V.","DOI":"10.1016\/j.procs.2021.08.172"},{"key":"ref_15","unstructured":"Rocha, A., Steels, L., and van den Herik, J. (2020). An Agent-based System for Truck Dispatching in Open-pit Mines. Proceedings of the 12th International Conference on Agents and Artificial Intelligence\u2014Volume 1: ICAART, SciTePress."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Freitag, M., Haasis, H., Kotzab, H., and Pannek, J. (2020). A Multiagent System for Truck Dispatching in Open-pit Mines. Dynamics in Logistics. LDIC 2020, Springer. Lecture Notes in Logistics.","DOI":"10.1007\/978-3-030-44783-0"},{"key":"ref_17","unstructured":"Sutton, R.S., and Barto, A.G. (2018). Reinforcement Learning: An Introduction, MIT Press. [2nd ed.]."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"23179","DOI":"10.1007\/s10489-023-04774-3","article-title":"Integrating short-term stochastic production planning updating with mining fleet management in industrial mining complexes: An actor-critic reinforcement learning approach","volume":"53","author":"Dimitrakopoulos","year":"2023","journal-title":"Appl. Intell."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"106349","DOI":"10.1016\/j.cor.2023.106349","article-title":"Learning to schedule heuristics for the simultaneous stochastic optimization of mining complexes","volume":"159","author":"Yaakoubi","year":"2023","journal-title":"Comput. Oper. Res."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"106664","DOI":"10.1016\/j.resconrec.2022.106664","article-title":"Reinforcement Learning-Based Fleet Dispatching for Greenhouse Gas Emission Reduction in Open-Pit Mining Operations","volume":"188","author":"Huo","year":"2023","journal-title":"Resour. Conserv. Recycl."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"120495","DOI":"10.1016\/j.eswa.2023.120495","article-title":"Reinforcement learning algorithms: A brief survey","volume":"30","author":"Shakya","year":"2023","journal-title":"Expert Syst. Appl."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1104","DOI":"10.1109\/TC.1980.1675516","article-title":"The Contract Net Protocol: High-level communication and control in a distributed problem solver","volume":"C\u201329","author":"Smith","year":"1980","journal-title":"IEEE Trans. Comput."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"132","DOI":"10.1007\/978-3-030-71158-0_6","article-title":"A Dynamic Scheduling Multiagent System for Truck Dispatching in Open-Pit Mines","volume":"Volume 12613","author":"Ahumada","year":"2021","journal-title":"Lecture Notes in Computer Science"},{"key":"ref_24","first-page":"279","article-title":"Q-Learning","volume":"8","author":"Watkins","year":"1992","journal-title":"Mach. Learn."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Bellifemine, F., Caire, G., and Greenwood, D. (2007). Developing Multi-Agent Systems with JADE, John Wiley & Sons.","DOI":"10.1002\/9780470058411"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3381449","article-title":"JGraphT-A Java Library for Graph Data Structures and Algorithms","volume":"46","author":"Michail","year":"2020","journal-title":"ACM Trans. Math. Softw."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Modrak, V., Sudhakarapandian, R., Balamurugan, A., and Soltysova, Z. (2024). A Review on Reinforcement Learning in Production Scheduling: An Inferential Perspective. Algorithms, 17.","DOI":"10.20944\/preprints202405.0335.v1"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"102412","DOI":"10.1016\/j.rcim.2022.102412","article-title":"Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems","volume":"78","author":"Zhang","year":"2022","journal-title":"Robot. Comput. Integr. Manuf."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"257","DOI":"10.23919\/CSMS.2021.0027","article-title":"A Review of Reinforcement Learning Based Intelligent Optimization for Manufacturing Scheduling","volume":"1","author":"Wang","year":"2021","journal-title":"Complex Syst. Model. Simul."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1016\/j.jmsy.2025.12.017","article-title":"A literature review on deep reinforcement learning for machine scheduling problems","volume":"85","author":"Ercan","year":"2026","journal-title":"J. Manuf. Syst."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.procir.2021.09.089","article-title":"A Deep Reinforcement Learning Based Scheduling Policy for Reconfigurable Manufacturing Systems","volume":"103","author":"Tang","year":"2021","journal-title":"Procedia CIRP"}],"container-title":["Applied Sciences"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2076-3417\/16\/5\/2343\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,2]],"date-time":"2026-03-02T13:14:18Z","timestamp":1772457258000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2076-3417\/16\/5\/2343"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,2,28]]},"references-count":31,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2026,3]]}},"alternative-id":["app16052343"],"URL":"https:\/\/doi.org\/10.3390\/app16052343","relation":{},"ISSN":["2076-3417"],"issn-type":[{"value":"2076-3417","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,2,28]]}}}