{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T20:03:54Z","timestamp":1773173034690,"version":"3.50.1"},"reference-count":61,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2021,6,7]],"date-time":"2021-06-07T00:00:00Z","timestamp":1623024000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,6,7]],"date-time":"2021-06-07T00:00:00Z","timestamp":1623024000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Auton Agent Multi-Agent Syst"],"published-print":{"date-parts":[[2021,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>We present a novel negotiation model that allows an agent to learn how to negotiate during concurrent bilateral negotiations in unknown and dynamic e-markets. The agent uses an actor-critic architecture with model-free reinforcement learning to learn a strategy expressed as a deep neural network. We pre-train the strategy by supervision from synthetic market data, thereby decreasing the exploration time required for learning during negotiation. As a result, we can build automated agents for concurrent negotiations that can adapt to different e-market settings without the need to be pre-programmed. Our experimental evaluation shows that our deep reinforcement learning based agents outperform two existing well-known negotiation strategies in one-to-many concurrent bilateral negotiations for a range of e-market settings.<\/jats:p>","DOI":"10.1007\/s10458-021-09513-x","type":"journal-article","created":{"date-parts":[[2021,6,7]],"date-time":"2021-06-07T07:04:57Z","timestamp":1623049497000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":23,"title":["ANEGMA: an automated negotiation model for e-markets"],"prefix":"10.1007","volume":"35","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5226-1948","authenticated-orcid":false,"given":"Pallavi","family":"Bagga","sequence":"first","affiliation":[]},{"given":"Nicola","family":"Paoletti","sequence":"additional","affiliation":[]},{"given":"Bedour","family":"Alrayes","sequence":"additional","affiliation":[]},{"given":"Kostas","family":"Stathis","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,6,7]]},"reference":[{"key":"9513_CR1","doi-asserted-by":"crossref","unstructured":"Alrayes, B., Kafal\u0131, \u00d6., & Stathis, K. (2016). Recon: A robust multi-agent environment for simulating concurrent negotiations. In Recent advances in agent-based complex automated negotiation (pp. 157\u2013174). Springer.","DOI":"10.1007\/978-3-319-30307-9_10"},{"issue":"2","key":"9513_CR2","doi-asserted-by":"publisher","first-page":"463","DOI":"10.1007\/s10115-017-1125-2","volume":"56","author":"B Alrayes","year":"2018","unstructured":"Alrayes, B., Kafal\u0131, \u00d6., & Stathis, K. (2018). Concurrent bilateral negotiation for open e-markets: The Conan strategy. Knowledge and Information Systems, 56(2), 463\u2013501.","journal-title":"Knowledge and Information Systems"},{"issue":"6","key":"9513_CR3","doi-asserted-by":"publisher","first-page":"1261","DOI":"10.1109\/TSMCB.2006.874686","volume":"36","author":"Bo An","year":"2006","unstructured":"An, B., Sim, K. M., Tang, L. G., Li, S. Q., & Cheng, D. J. (2006). Continuous-time negotiation mechanism for software agents. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 36(6), 1261\u20131272.","journal-title":"IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)"},{"key":"9513_CR4","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1007\/978-4-431-54758-7_4","volume-title":"Novel insights in agent-based complex automated negotiation","author":"Tim Baarslag","year":"2014","unstructured":"Baarslag, T., Hindriks, K., Hendrikx, M., Dirkzwager, A., & Jonker, C. (2014). Decoupling negotiating agents to explore the space of negotiation strategies. In Novel insights in agent-based complex automated negotiation (pp. 61\u201383). Springer."},{"key":"9513_CR5","doi-asserted-by":"crossref","unstructured":"Bagga, P., Paoletti, N., & Alrayes Bedour\u00a0Stathis, K. (2020). A deep reinforcement learning approach to concurrent bilateral negotiation. In Proceedings of the twenty-ninth international joint conference on artificial intelligence (pp. 297\u2013303).","DOI":"10.24963\/ijcai.2020\/42"},{"key":"9513_CR6","unstructured":"Bagga, P., Paoletti, N., & Stathis, K. (2020). Learnable strategies for bilateral agent negotiation over multiple issues. arXiv preprint arXiv:2009.08302."},{"key":"9513_CR7","unstructured":"Bakker, J., Hammond, A., Bloembergen, D., & Baarslag, T. (2019). Rlboa: A modular reinforcement learning framework for autonomous negotiating agents. In Proceedings of the 18th international conference on autonomous agents and multiagent systems (pp. 260\u2013268). International Foundation for Autonomous Agents and Multiagent Systems."},{"key":"9513_CR8","doi-asserted-by":"crossref","unstructured":"Bala, M. I., Vij, S., & Mukhopadhyay, D. (2013). Intelligent agent for prediction in e-negotiation: An approach. In 2013 International conference on cloud & ubiquitous computing & emerging technologies (pp. 183\u2013187). IEEE.","DOI":"10.1109\/CUBE.2013.41"},{"issue":"3","key":"9513_CR9","doi-asserted-by":"publisher","first-page":"274","DOI":"10.1016\/j.elerap.2006.06.008","volume":"6","author":"S Buffett","year":"2007","unstructured":"Buffett, S., & Spencer, B. (2007). A Bayesian classifier for learning opponents\u2019 preferences in multi-object automated negotiation. Electronic Commerce Research and Applications, 6(3), 274\u2013284.","journal-title":"Electronic Commerce Research and Applications"},{"key":"9513_CR10","doi-asserted-by":"crossref","unstructured":"Cardoso, H. L., & Oliveira, E. (2000). Using and evaluating adaptive agents for electronic commerce negotiation. In Advances in artificial intelligence (pp. 96\u2013105). Springer.","DOI":"10.1007\/3-540-44399-1_11"},{"key":"9513_CR11","doi-asserted-by":"crossref","unstructured":"Chang, H. C. H. (2020). Multi-issue negotiation with deep reinforcement learning. Knowledge-Based Systems, 106544.","DOI":"10.1016\/j.knosys.2020.106544"},{"issue":"16","key":"9513_CR12","doi-asserted-by":"publisher","first-page":"7630","DOI":"10.1016\/j.eswa.2014.06.003","volume":"41","author":"L Chen","year":"2014","unstructured":"Chen, L., Dong, H., & Zhou, Y. (2014). A reinforcement learning optimized negotiation method based on mediator agent. Expert Systems with Applications, 41(16), 7630\u20137640.","journal-title":"Expert Systems with Applications"},{"issue":"2","key":"9513_CR13","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1016\/S1389-1286(01)00215-8","volume":"37","author":"SP Choi","year":"2001","unstructured":"Choi, S. P., Liu, J., & Chan, S. P. (2001). A genetic agent-based negotiation system. Computer Networks, 37(2), 195\u2013204.","journal-title":"Computer Networks"},{"issue":"12","key":"9513_CR14","doi-asserted-by":"publisher","first-page":"16221","DOI":"10.1007\/s11042-018-6984-3","volume":"78","author":"Nirmal Choudhary","year":"2018","unstructured":"Choudhary, N., & Bharadwaj, K. (2018). Evolutionary learning approach to multi-agent negotiation for group recommender systems. Multimedia Tools and Applications, 1\u201323.","journal-title":"Multimedia Tools and Applications"},{"issue":"3\u20134","key":"9513_CR15","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1016\/S0921-8890(98)00029-3","volume":"24","author":"P Faratin","year":"1998","unstructured":"Faratin, P., Sierra, C., & Jennings, N. R. (1998). Negotiation decision functions for autonomous agents. Robotics and Autonomous Systems, 24(3\u20134), 159\u2013182.","journal-title":"Robotics and Autonomous Systems"},{"key":"9513_CR16","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511751691","volume-title":"Principles of automated negotiation","author":"S Fatima","year":"2014","unstructured":"Fatima, S., Kraus, S., & Wooldridge, M. (2014). Principles of automated negotiation. Cambridge: Cambridge University Press."},{"issue":"1","key":"9513_CR17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/S0004-3702(03)00115-2","volume":"152","author":"SS Fatima","year":"2004","unstructured":"Fatima, S. S., Wooldridge, M., & Jennings, N. R. (2004). An agenda-based framework for multi-issue negotiation. Artificial Intelligence, 152(1), 1\u201345.","journal-title":"Artificial Intelligence"},{"key":"9513_CR18","volume-title":"Deep Learning","author":"Ian Goodfellow","year":"2016","unstructured":"Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT press."},{"key":"9513_CR19","unstructured":"Hendrikx, M. (2012). Evaluating the quality of opponent models in automated bilateral negotiations."},{"key":"9513_CR20","unstructured":"Hindriks, K., & Tykhonov, D. (2008). Opponent modelling in automated multi-issue negotiation using bayesian learning. In Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems (Vol. 1, pp. 331\u2013338)."},{"issue":"2","key":"9513_CR21","doi-asserted-by":"publisher","first-page":"239","DOI":"10.1007\/s12525-018-0307-4","volume":"29","author":"J Hopkins","year":"2019","unstructured":"Hopkins, J., Kafali, O., Alrayes, B., & Stathis, K. (2019). PIRASA: Strategic protocol selection for e-commerce agents. Electronic Markets, 29(2), 239\u2013252.","journal-title":"Electronic Markets"},{"key":"9513_CR22","doi-asserted-by":"publisher","first-page":"106390","DOI":"10.1016\/j.epsr.2020.106390","volume":"186","author":"K Imran","year":"2020","unstructured":"Imran, K., Zhang, J., Pal, A., Khattak, A., Ullah, K., & Baig, S. M. (2020). Bilateral negotiations for electricity market by adaptive agent-tracking strategy. Electric Power Systems Research, 186, 106390.","journal-title":"Electric Power Systems Research"},{"issue":"2","key":"9513_CR23","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1023\/A:1008746126376","volume":"10","author":"NR Jennings","year":"2001","unstructured":"Jennings, N. R., Faratin, P., Lomuscio, A. R., Parsons, S., Wooldridge, M. J., & Sierra, C. (2001). Automated negotiation: Prospects, methods and challenges. Group Decision and Negotiation, 10(2), 199\u2013215.","journal-title":"Group Decision and Negotiation"},{"key":"9513_CR24","doi-asserted-by":"crossref","unstructured":"Jian, L. (2008). An agent bilateral multi-issue alternate bidding negotiation protocol based on reinforcement learning and its application in e-commerce. In 2008 International symposium on electronic commerce and security (pp. 217\u2013220). IEEE.","DOI":"10.1109\/ISECS.2008.102"},{"issue":"6","key":"9513_CR25","doi-asserted-by":"publisher","first-page":"1239","DOI":"10.1007\/s10726-020-09704-z","volume":"29","author":"Usha Kiruthika","year":"2020","unstructured":"Kiruthika, U., Somasundaram, T. S., & Raja, S. K. S. (2020). Lifecycle model of a negotiation agent: A survey of automated negotiation techniques. Group Decision and Negotiation, 1\u201324.","journal-title":"Group Decision and Negotiation"},{"key":"9513_CR26","doi-asserted-by":"crossref","unstructured":"Kraus, S. (2001). Automated negotiation and decision making in multiagent environments. In ECCAI advanced course on artificial intelligence (pp. 150\u2013172). Springer.","DOI":"10.1007\/3-540-47745-4_7"},{"issue":"1","key":"9513_CR27","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1002\/int.20120","volume":"21","author":"RY Lau","year":"2006","unstructured":"Lau, R. Y., Tang, M., Wong, O., Milliner, S. W., & Chen, Y. P. P. (2006). An evolutionary learning approach for adaptive negotiation agents. International Journal of Intelligent Systems, 21(1), 41\u201372.","journal-title":"International Journal of Intelligent Systems"},{"key":"9513_CR28","doi-asserted-by":"crossref","unstructured":"Lewis, M., Yarats, D., Dauphin, Y. N., Parikh, D., & Batra, D. (2017). Deal or no deal? end-to-end learning for negotiation dialogues. arXiv preprint arXiv:1706.05125.","DOI":"10.18653\/v1\/D17-1259"},{"key":"9513_CR29","unstructured":"Li, J., & Cao, Y. D. (2004). Bayesian learning in bilateral multi-issue negotiation and its application in mas-based electronic commerce. In Proceedings. IEEE\/WIC\/ACM international conference on intelligent agent technology IAT 2004) (pp. 437\u2013440). IEEE."},{"key":"9513_CR30","unstructured":"Lillicrap, T. P., Hunt, J. J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., Silver, D., & Wierstra, D. (2015). Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971."},{"key":"9513_CR31","unstructured":"Lillicrap, T. P., Hunt, J. J., Pritzel, A., Heess, N. M. O., Erez, T., Tassa, Y., Silver, D., & Wierstra, D.P. (2017). Continuous control with deep reinforcement learning. US Patent App. 15\/217,758."},{"issue":"1","key":"9513_CR32","doi-asserted-by":"publisher","first-page":"78","DOI":"10.1145\/1629175.1629199","volume":"53","author":"R Lin","year":"2010","unstructured":"Lin, R., & Kraus, S. (2010). Can automated agents proficiently negotiate with humans? Communications of the ACM, 53(1), 78\u201388.","journal-title":"Communications of the ACM"},{"key":"9513_CR33","first-page":"270","volume":"141","author":"R Lin","year":"2006","unstructured":"Lin, R., Kraus, S., Wilkenfeld, J., & Barry, J. (2006). An automated agent for bilateral negotiation with bounded rational agents with incomplete information. Frontiers in Artificial Intelligence and Applications, 141, 270.","journal-title":"Frontiers in Artificial Intelligence and Applications"},{"issue":"1","key":"9513_CR34","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1023\/A:1022232410606","volume":"12","author":"AR Lomuscio","year":"2003","unstructured":"Lomuscio, A. R., Wooldridge, M., & Jennings, N. R. (2003). A classification scheme for negotiation in electronic commerce. Group Decision and Negotiation, 12(1), 31\u201356.","journal-title":"Group Decision and Negotiation"},{"issue":"10","key":"9513_CR35","doi-asserted-by":"publisher","first-page":"2261","DOI":"10.1109\/TCYB.2014.2369015","volume":"45","author":"K Mansour","year":"2014","unstructured":"Mansour, K., & Kowalczyk, R. (2014). Coordinating the bidding strategy in multi-issue multi-object negotiation with single and multiple providers. IEEE Transactions on Cybernetics, 45(10), 2261\u20132272.","journal-title":"IEEE Transactions on Cybernetics"},{"issue":"7540","key":"9513_CR36","doi-asserted-by":"publisher","first-page":"529","DOI":"10.1038\/nature14236","volume":"518","author":"Volodymyr Mnih","year":"2015","unstructured":"Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A. A., Veness, J., Bellemare, M. G., et al. (2015). Human-level control through deep reinforcement learning. Nature, 518(7540), 529.","journal-title":"nature"},{"key":"9513_CR37","unstructured":"Munim, A. (2013). GOLEMLite: A framework for the development of agent-based applications. Master\u2019s thesis, Royal Holloway, University of London."},{"key":"9513_CR38","doi-asserted-by":"crossref","unstructured":"Narayanan, V., & Jennings, N. R. (2006). Learning to negotiate optimally in non-stationary environments. In International workshop on cooperative information agents (pp. 288\u2013300). Springer.","DOI":"10.1007\/11839354_21"},{"key":"9513_CR39","unstructured":"Nguyen, T. D., & Jennings, N. R. (2004). Coordinating multiple concurrent negotiations. In Proceedings of the third international joint conference on autonomous agents and multiagent systems (Vol. 3, pp. 1064\u20131071). IEEE Computer Society."},{"issue":"3","key":"9513_CR40","doi-asserted-by":"publisher","first-page":"83","DOI":"10.1080\/07421222.1996.11518135","volume":"13","author":"JR Oliver","year":"1996","unstructured":"Oliver, J. R. (1996). A machine-learning approach to automated negotiation and prospects for electronic commerce. Journal of management information systems, 13(3), 83\u2013112.","journal-title":"Journal of management information systems"},{"key":"9513_CR41","unstructured":"Oshrat, Y., Lin, R., & Kraus, S. (2009). Facing the challenge of human-agent negotiations via effective general opponent modeling. In Proceedings of the 8th international conference on autonomous agents and multiagent systems (Vol. 1, pp. 377\u2013384). International Foundation for Autonomous Agents and Multiagent Systems."},{"key":"9513_CR42","doi-asserted-by":"crossref","unstructured":"Papangelis, A., & Georgila, K. (2015). Reinforcement learning of multi-issue negotiation dialogue policies. In Proceedings of the 16th annual meeting of the special interest group on discourse and dialogue (pp. 154\u2013158).","DOI":"10.18653\/v1\/W15-4621"},{"issue":"4","key":"9513_CR43","doi-asserted-by":"publisher","first-page":"1824","DOI":"10.3906\/elk-1907-215","volume":"28","author":"Y Razeghi","year":"2020","unstructured":"Razeghi, Y., & Yavuz, C. O. B., & Aydog\u0306an R. . (2020). Deep reinforcement learning for acceptance strategy in bilateral negotiations. Turkish Journal of Electrical Engineering & Computer Sciences, 28(4), 1824\u20131840.","journal-title":"Turkish Journal of Electrical Engineering & Computer Sciences"},{"issue":"4","key":"9513_CR44","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1016\/S1474-0346(03)00015-6","volume":"16","author":"Z Ren","year":"2002","unstructured":"Ren, Z., & Anumba, C. J. (2002). Learning in multi-agent systems: A case study of construction claims negotiation. Advanced Engineering Informatics, 16(4), 265\u2013275.","journal-title":"Advanced Engineering Informatics"},{"key":"9513_CR45","doi-asserted-by":"crossref","unstructured":"Rubinstein, A. (1982). Perfect equilibrium in a bargaining model. Econometrica: Journal of the Econometric Society, 97\u2013109.","DOI":"10.2307\/1912531"},{"key":"9513_CR46","unstructured":"Silver, D., Lever, G., Heess, N., Degris, T., Wierstra, D., & Riedmiller, M. (2014). Deterministic policy gradient algorithms. In Proceedings of the 31st international conference on machine learning (pp. 387\u2013395)."},{"key":"9513_CR47","unstructured":"Sim, K. M., Guo, Y., & Shi, B. (2007). Adaptive bargaining agents that negotiate optimally and rapidly. In 2007 IEEE congress on evolutionary computation (pp. 1007\u20131014). IEEE."},{"issue":"1","key":"9513_CR48","doi-asserted-by":"crossref","first-page":"198","DOI":"10.1109\/TSMCB.2008.2004501","volume":"39","author":"Kwang Mong Sim","year":"2008","unstructured":"Sim, K. M., Guo, Y., & Shi, B. (2008). Blgan: Bayesian learning and genetic algorithm for supporting negotiation with incomplete information. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 39(1), 198\u2013211.","journal-title":"IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)"},{"key":"9513_CR49","doi-asserted-by":"crossref","unstructured":"Sridharan, M., & Tesauro, G. (2002). Multi-agent q-learning and regression trees for automated pricing decisions. In Game theory and decision theory in agent-based systems (pp. 217\u2013234). Springer.","DOI":"10.1007\/978-1-4615-1107-6_11"},{"issue":"13","key":"9513_CR50","first-page":"2773","volume":"8","author":"T Sun","year":"2011","unstructured":"Sun, T., Zhu, Q., Xia, Y., & Cao, F. (2011). A bilateral price negotiation strategy based on Bayesian classification and q-learning. Journal of Information & Computational Science, 8(13), 2773\u20132780.","journal-title":"Journal of Information & Computational Science"},{"key":"9513_CR51","first-page":"1057","volume":"99","author":"Richard S Sutton","year":"1999","unstructured":"Sutton, R. S., McAllester, D. A., Singh, S. P., & Mansour, Y. (2000). Policy gradient methods for reinforcement learning with function approximation. Advances in Neural Information Processing Systems, 1057\u20131063.","journal-title":"Advances in Neural Information Processing Systems"},{"key":"9513_CR52","unstructured":"Sycara, K., Zeng, D., et\u00a0al. (1997). Benefits of learning in negotiation. In Proceedings of the AAAI national conference on artificial intelligence (pp. 36\u201341). Menlo Park, California"},{"key":"9513_CR53","doi-asserted-by":"crossref","unstructured":"Tesauro, G. (2000). Pricing in agent economies using neural networks and multi-agent q-learning. In Sequence learning (pp. 288\u2013307). Springer.","DOI":"10.1007\/3-540-44565-X_13"},{"issue":"3","key":"9513_CR54","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1023\/A:1015504423309","volume":"5","author":"G Tesauro","year":"2002","unstructured":"Tesauro, G., & Kephart, J. O. (2002). Pricing in agent economies using multi-agent q-learning. Autonomous Agents and Multi-agent Systems, 5(3), 289\u2013304.","journal-title":"Autonomous Agents and Multi-agent Systems"},{"key":"9513_CR55","unstructured":"Williams, C. R., Robu, V., Gerding, E. H., & Jennings, N. R. (2012). Negotiating concurrently with unknown opponents in complex, real-time domains. In Proceedings of the 20th European conference on artificial intelligence."},{"key":"9513_CR56","doi-asserted-by":"crossref","unstructured":"Yu, C., Ren, F., & Zhang, M. (2013). An adaptive bilateral negotiation model based on bayesian learning. In Complex automated negotiations: Theories, models, and software competitions (pp. 75\u201393). Springer.","DOI":"10.1007\/978-3-642-30737-9_5"},{"issue":"1","key":"9513_CR57","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1006\/ijhc.1997.0164","volume":"48","author":"D Zeng","year":"1998","unstructured":"Zeng, D., & Sycara, K. (1998). Bayesian learning in negotiation. International Journal of Human-Computer Studies, 48(1), 125\u2013141.","journal-title":"International Journal of Human-Computer Studies"},{"key":"9513_CR58","doi-asserted-by":"publisher","first-page":"108","DOI":"10.1016\/j.knosys.2015.04.006","volume":"84","author":"J Zhang","year":"2015","unstructured":"Zhang, J., Ren, F., & Zhang, M. (2015). Bayesian-based preference prediction in bilateral multi-issue negotiation between intelligent agents. Knowledge-Based Systems, 84, 108\u2013120.","journal-title":"Knowledge-Based Systems"},{"key":"9513_CR59","doi-asserted-by":"crossref","unstructured":"Zhang, M., Tan, Z., Zhao, J., & Li, L. (2008): A bayesian learning model in the agent-based bilateral negotiation between the coal producers and electric power generators. In 2008 International symposium on intelligent information technology application workshops (pp. 859\u2013862). IEEE.","DOI":"10.1109\/IITA.Workshops.2008.144"},{"key":"9513_CR60","unstructured":"Zhang, X., & Ma, H. (2018). Pretraining deep actor-critic reinforcement learning algorithms with expert demonstrations. arXiv preprint arXiv:1801.10459."},{"issue":"7","key":"9513_CR61","doi-asserted-by":"publisher","first-page":"e102840","DOI":"10.1371\/journal.pone.0102840","volume":"9","author":"Y Zou","year":"2014","unstructured":"Zou, Y., Zhan, W., & Shao, Y. (2014). Evolution with reinforcement learning in negotiation. PLoS ONE, 9(7), e102840.","journal-title":"PLoS ONE"}],"container-title":["Autonomous Agents and Multi-Agent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10458-021-09513-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10458-021-09513-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10458-021-09513-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,24]],"date-time":"2021-09-24T13:29:15Z","timestamp":1632490155000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10458-021-09513-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,6,7]]},"references-count":61,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2021,10]]}},"alternative-id":["9513"],"URL":"https:\/\/doi.org\/10.1007\/s10458-021-09513-x","relation":{},"ISSN":["1387-2532","1573-7454"],"issn-type":[{"value":"1387-2532","type":"print"},{"value":"1573-7454","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,6,7]]},"assertion":[{"value":"28 May 2021","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 June 2021","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"27"}}