{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,16]],"date-time":"2026-01-16T07:57:56Z","timestamp":1768550276902,"version":"3.49.0"},"reference-count":52,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2005,3,1]],"date-time":"2005-03-01T00:00:00Z","timestamp":1109635200000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Auton Agent Multi-Agent Syst"],"published-print":{"date-parts":[[2005,3]]},"DOI":"10.1007\/s10458-004-6977-7","type":"journal-article","created":{"date-parts":[[2005,2,25]],"date-time":"2005-02-25T12:16:09Z","timestamp":1109333769000},"page":"103-130","source":"Crossref","is-referenced-by-count":15,"title":["Learning and Exploiting Relative Weaknesses of Opponent Agents"],"prefix":"10.1007","volume":"10","author":[{"given":"Shaul","family":"Markovitch","sequence":"first","affiliation":[]},{"given":"Ronit","family":"Reger","sequence":"additional","affiliation":[]}],"member":"297","reference":[{"key":"CR1","volume-title":"A knowledge-based approach of Connect-Four - the game is solved White wins, Master?s thesis, Department of mathematics and Computer Science","author":"V Allis","year":"1988"},{"key":"CR2","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1016\/S0019-9958(78)90683-6","volume":"39","author":"D Angluin","year":"1978","journal-title":"Information and Control"},{"key":"CR3","unstructured":"C. Atkeson, and J. Santamaria, ?A comparison of direct and model-based reinforcement learning,? 1997."},{"key":"CR4","unstructured":"D. Billings, D. Papp, J. Schaeffer, and D. Szafron, ?Opponent modeling in poker?, in Proceedings of the Fifteenth National Conference on Artificial Intelligence, Madison, Wisconsin, pp. 493?499, 1998."},{"key":"CR5","unstructured":"J. Bruce, M. Bowling, B. Browning, and M. Veloso, ?Multi-robot team response to a multi-robot opponent team?, in Proceedings of IROS-2002 workshop on Collaborative Robots, 2002."},{"key":"CR6","unstructured":"D. Carmel, and S. Markovitch, ?Learning models of the opponent?s strategy in game-playing,? in, Proceedings of The AAAI Fall Symposium on Games: Planning and Learning, North Carolina, 1993."},{"key":"CR7","unstructured":"D. Carmel, and S. Markovitch, ?Incorporating Opponent Models into Adversary Search?. in, Proceedings of the Thirteenth National Conference on Artificial Intelligence. Portland, Oregon, pp. 120?125."},{"key":"CR8","unstructured":"D. Carmel, and S. Markovitch, ?Learning and using opponent models in adversary search?, Technical Report CIS9609, Technion, 1996b."},{"key":"CR9","unstructured":"D. Carmel, and S. Markovitch, ?Learning models of intelligent agents?, in, Proceedings of the Thirteenth National Conference on Artificial Intelligence. Portland, Oregon, pp. 62?67, 1996c."},{"issue":"3","key":"CR10","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1080\/095281398146789","volume":"10","author":"D Carmel","year":"1998","journal-title":"Journal of Experimental and Theoretical Artificial Intelligence"},{"issue":"2","key":"CR11","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1023\/A:1010007108196","volume":"2","author":"D Carmel","year":"1999","journal-title":"Autonomous Agents and Multi-agent Systems"},{"issue":"3-4","key":"CR12","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1016\/S0020-0255(01)00133-5","volume":"135","author":"H. Donkers","year":"2001","journal-title":"Information Sciences"},{"key":"CR13","doi-asserted-by":"crossref","unstructured":"Y. Freund, M. Kearns, Y. Mansour, D. Ron, and R. Rubinfeld, ?Efficient algorithms for learning to play repeated games against computationally bounded adversaries?, in, Proceeding. of the 36th Annual Symposium on Foundations of Computer Science. IEEE Computer Society Press, Los Alamitos, CA, pp. 332?341, 1995.","DOI":"10.1109\/SFCS.1995.492489"},{"issue":"1-2","key":"CR14","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1016\/S0304-3975(00)00077-3","volume":"252","author":"X. Gao","year":"2001","journal-title":"Theoretical Computer Science"},{"key":"CR15","unstructured":"X. Gao, H. Iida, J. W. H. M. Uiterwijk, and H. J. van den Herik, ?Performance of (D,d)-OM search in othello?, in, Proceedings of JSSST 14th Conference, Shikawa, Japan, pp. 229?232, 1997."},{"key":"CR16","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1007\/3-540-48957-6_5","volume":"1558","author":"X. Gao","year":"1999","journal-title":"Lecture Notes in Computer Science"},{"key":"CR17","volume-title":"Proceedings of the First International Conference on Multi-Agent Systems (ICMAS-95)","author":"P.J. Gmytrasiewicz","year":"1995"},{"issue":"1-2","key":"CR18","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1023\/A:1008269427670","volume":"8","author":"P.J. Gmytrasiewicz","year":"1998","journal-title":"User Modeling and User-Adapted Interaction"},{"key":"CR19","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1007\/978-1-4613-9080-0_5","volume-title":"Computers, Chess and Cognition","author":"F.-H. Hsu","year":"1990"},{"key":"CR20","unstructured":"Y.-J. Hu, and D. F. Kibler, ?Generation of attributes for learning algorithms?, in, Proceedings of the Thirteenth National Conference on Artificial Intelligence and the Eighth Innovative Applications of Artificial Intelligence Conference, Menlo Park, AAAI Press \/ MIT Press, pp. 806?811, 1996."},{"issue":"4","key":"CR21","first-page":"201","volume":"16","author":"J.H. Iida","year":"1993","journal-title":"ICCA Journal"},{"issue":"1","key":"CR22","first-page":"10","volume":"17","author":"J.H. Iida","year":"1994","journal-title":"ICCA Journal"},{"key":"CR23","unstructured":"P. J. Jansen, ?Using knowledge about the opponent in game-tree search?, Ph.D. thesis, Carnegie Mellon University, 1992."},{"key":"CR24","unstructured":"A. Junghanns, and J. Schaeffer, ?Search versus knowledge in game-playing programs revisited?. in Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-97). Nagoya, Japan, pp. 692?697, 1997."},{"key":"CR25","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1613\/jair.301","volume":"4","author":"L. P. Kaelbling","year":"1996","journal-title":"Journal of Artificial Intelligence Research"},{"key":"CR26","doi-asserted-by":"crossref","unstructured":"M.L. Littman, ?Markov games as a framework for multi-agent reinforcement learning?, in, Proceedings of the 11th International Conference on Machine Learning (ML-94), New Brunswick, NJ, Morgan Kaufmann, pp. 157?163, 1994.","DOI":"10.1016\/B978-1-55860-335-6.50027-1"},{"key":"CR27","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1023\/A:1014046307775","volume":"49","author":"S. Markovitch","year":"2001","journal-title":"Machine Learning"},{"issue":"1","key":"CR28","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1111\/j.1467-8640.1996.tb00254.x","volume":"12","author":"S. Markovitch","year":"1996","journal-title":"Computational Intelligence"},{"key":"CR29","unstructured":"C. J. Matheus, and L. A. Rendell, ?Constructive induction on decision trees?, in, N. S. Sridharan (ed.), Proceedings of the 11th International Joint Conference on Artificial Intelligence, Detroit, MI, USA, Morgan Kaufmann, pp. 645?650, 1989."},{"key":"CR30","first-page":"103","volume":"13","author":"A.W. Moore","year":"1993","journal-title":"Machine Learning"},{"key":"CR31","unstructured":"Y. Mor, C. Goldman, and J. Rosenschein, ?Learn your opponent?s strategy (in polynomial time)!?, in, G. Weiss and S. Sen (eds.), Adaptation and Learning in Multi-agent Systems, Lecture Notes in Artificial Intelligence, vol. 1042. Springer-Verlag, 1996."},{"key":"CR32","unstructured":"D.S.Nau 1980A?Pathology on game trees summary of results?, in Proceedings of the First National Conference on Artificial Intelligence, Stanford, California, pp. 102?104"},{"key":"CR33","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1016\/0004-3702(82)90002-9","volume":"19","author":"D.S. Nau","year":"1982","journal-title":"Artificial Intelligence"},{"issue":"1","key":"CR34","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1023\/A:1022611825350","volume":"5","author":"G. Pagallo","year":"1990","journal-title":"Machine Learning"},{"key":"CR35","doi-asserted-by":"crossref","first-page":"427","DOI":"10.1016\/0004-3702(83)90004-8","volume":"20","author":"J. Pearl","year":"1983","journal-title":"Artificial Intelligence"},{"key":"CR36","first-page":"81","volume":"1","author":"J.R. Quinlan","year":"1986","journal-title":"In Machine Learning"},{"key":"CR37","doi-asserted-by":"crossref","unstructured":"A. Reibman, and B. Ballard, ?Non-minimax strategies for use against fallible opponents?, in, Proceedings of the international conference on artificial intelligence AAAI-83, Los Altos, CA, William Kaufman,pp. 338?343, 1983.","DOI":"10.21236\/ADA127487"},{"key":"CR38","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/2474.001.0001","volume-title":"Do the right thing :studies in limited rationality, Artificial Intelligence","author":"S. Russell","year":"1991"},{"key":"CR39","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1016\/0303-2647(95)01551-5","volume":"37","author":"T.W. Sandholm","year":"1995","journal-title":"Biosystems Journal"},{"key":"CR40","unstructured":"R. Schapire, P. Stone, D. McAllester, M. Littman, and J. Csirik, ?Modeling auction price uncertainty using boosting-based conditional density estimation?, in Proceedings of the Nineteenth International Conference on Machine Learning, 2002."},{"key":"CR41","unstructured":"S. Sen, and N. Arora, ?Learning to take risks?, in AAAI-97 Workshop on Multiagent Learning, pp. 59?64, 1997."},{"key":"CR42","unstructured":"S. Sen, and G. Weiss, ?Learning in multiagent systems,? in, G. Weiss (ed.),Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence. Cambridge, Massachusetts, The MIT Press, Chapt. 6, pp. 259--298, 1999."},{"key":"CR43","volume-title":"Models of Bounded Rationality","author":"H. A. Simon","year":"1982"},{"key":"CR44","unstructured":"P.Stone, P. Riley, and M. Veloso, ?Defining and using ideal teammate and opponent agent models?, in, Proceedings of the 7th Conference on Artificial Intelligence (AAAI-00) and of the 12th Conference on Innovative Applications of Artificial Intelligence (IAAI-00). Menlo Park, CA, AAAI Press, pp. 1040?1045, 2000."},{"key":"CR45","doi-asserted-by":"crossref","unstructured":"R.Sutton, ?Integrated architectures for learning, planning, and reacting based on approximating dynamic programming?, in Proceedings of the Seventh International Conference on Machine Learning. pp. 216?224, 1990.","DOI":"10.1016\/B978-1-55860-141-3.50030-4"},{"key":"CR46","unstructured":"W. T. B. Uther, and M. M. Veloso, ?Generalizing adversarial reinforcement learning?, in Proceedings of the AAAI Fall Symposium on Model Directed Autonomous Systems, 1997."},{"key":"CR47","unstructured":"J. M. Vidal, and E. H. Durfee, ?The impact of nested agent models in an information economy?, in, V. Lesser (ed.), Proceedings of the Second International Conference on Multi-Agent Systems (ICMAS?96). Kyoto, Japan, The MIT Press, Cambridge, MA, USA, 1995."},{"key":"CR48","unstructured":"J. M. Vidal, and E. H. Durfee, ?Using recursive agent models effectively,? in M. Wooldridge, J. P. M\u00fcller, and M. Tambe (eds.),Proceedings on the IJCAI Workshop on Intelligent Agents II: Agent Theories, Architectures, and Languages, vol. 1037 of LNAI. Springer-Verlag, Heidelberg, Germany, pp. 171--186, 1996."},{"key":"CR49","unstructured":"C. J. Watkins, ?Learning from delayed rewards?, Ph.D. thesis, University of Cambridge, 1989."},{"key":"CR50","first-page":"279","volume":"8","author":"C.J. Watkins","year":"1992","journal-title":"Machine Learning"},{"key":"CR51","unstructured":"G. Weiss, and S. Sen, Adaptation and learning in multi-agent systems, Lectures Notes in Articial Intelligence, vol. 1042. Springer-Verlag, 1996."},{"key":"CR52","unstructured":"S. Zilberstein,?Optimizing decision quality with contract algorithms?. in, Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence. Montreal, Canada, pp. 1576?1582, 1995."}],"container-title":["Autonomous Agents and Multi-Agent Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10458-004-6977-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10458-004-6977-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10458-004-6977-7","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,23]],"date-time":"2024-12-23T22:12:22Z","timestamp":1734991942000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10458-004-6977-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,3]]},"references-count":52,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2005,3]]}},"alternative-id":["6977"],"URL":"https:\/\/doi.org\/10.1007\/s10458-004-6977-7","relation":{},"ISSN":["1387-2532","1573-7454"],"issn-type":[{"value":"1387-2532","type":"print"},{"value":"1573-7454","type":"electronic"}],"subject":[],"published":{"date-parts":[[2005,3]]}}}