{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,5]],"date-time":"2024-09-05T22:06:21Z","timestamp":1725573981920},"publisher-location":"Berlin, Heidelberg","reference-count":26,"publisher":"Springer Berlin Heidelberg","isbn-type":[{"type":"print","value":"9783540407980"},{"type":"electronic","value":"9783540452171"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2003]]},"DOI":"10.1007\/978-3-540-45217-1_19","type":"book-chapter","created":{"date-parts":[[2011,1,7]],"date-time":"2011-01-07T00:35:33Z","timestamp":1294360533000},"page":"250-265","source":"Crossref","is-referenced-by-count":1,"title":["Exchanging Advice and Learning to Trust"],"prefix":"10.1007","author":[{"given":"Lu\u00eds","family":"Nunes","sequence":"first","affiliation":[]},{"given":"Eug\u00e9nio","family":"Oliveira","sequence":"additional","affiliation":[]}],"member":"297","reference":[{"key":"19_CR1","unstructured":"Nunes, L., Oliveira, E.: On learning by exchanging advice. In: Proc. of the First Symposium on Adaptive Agents and Multi-Agent Systems (AAMAS\/AISB 2002), pp. 29\u201340 (2002)"},{"key":"19_CR2","unstructured":"Dorigo, M., Colombetti, M.: The role of the trainer in reinforcement learning. In: Mahadevan, S. (ed.) Proc. of MLC-COLT 1994, pp. 37\u201345 (1994)"},{"key":"19_CR3","doi-asserted-by":"crossref","unstructured":"Clouse, J.A.: On integrating apprentice learning and reinforcement learning. PhD thesis, University of Massachusetts, Department of Computer Science (1997)","DOI":"10.1016\/S0166-4115(97)80108-2"},{"key":"19_CR4","doi-asserted-by":"crossref","unstructured":"Nunes, L., Oliveira, E.: Advice-exchange in heterogeneous groups of learning agents. Technical Report 1 12\/02, FEUP\/LIACC (2002)","DOI":"10.1145\/860575.860807"},{"key":"19_CR5","unstructured":"Nunes, L., Oliveira, E.: Advice exchange between evolutionary algorithms and reinforcement learning agents: Experimental results in the pursuit domain. In: Proc. of the Second Symposium on Adaptive Agents and Multi-Agent Systems, AAMAS\/AISB 2003 (2003)"},{"key":"19_CR6","unstructured":"Nunes, L., Oliveira, E.: Advice exchange architecture. Technical Report 3 04\/03, FEUP\/LIACC (2003)"},{"key":"19_CR7","doi-asserted-by":"crossref","unstructured":"Rumelhart, D.E., Zipser, D.: Feature discovery by competitive learning. Cognitive Science\u00a09 (1985)","DOI":"10.1207\/s15516709cog0901_5"},{"key":"19_CR8","first-page":"318","volume":"1","author":"D.E. Rumelhart","year":"1986","unstructured":"Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation. Parallel Distributed Processing: Exploration in the Microstructure of Cognition\u00a01, 318\u2013362 (1986)","journal-title":"Parallel Distributed Processing: Exploration in the Microstructure of Cognition"},{"key":"19_CR9","first-page":"279","volume":"8","author":"C.J.C.H. Watkins","year":"1992","unstructured":"Watkins, C.J.C.H., Dayan, P.D.: Technical note: Q-learning. Machine Learning\u00a08, 279\u2013292 (1992)","journal-title":"Machine Learning"},{"key":"19_CR10","unstructured":"Whitehead, S.D.: A complexity analisys of cooperative mechanisms in reinforcement learning. In: Proc. of the 9th National Conf. on AI (AAAI 1991), pp. 607\u2013613 (1991)"},{"key":"19_CR11","unstructured":"Holland, J.H.: Adaptation in Natural and Artificial Systems. University of Michigan Press (1975)"},{"key":"19_CR12","volume-title":"Genetic programming: On the Programming of Computers by Means of Natural Selection","author":"J.R. Koza","year":"1992","unstructured":"Koza, J.R.: Genetic programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge (1992)"},{"key":"19_CR13","unstructured":"Glickman, M., Sycara, K.: Evolution of goal-directed behavior using limited information in a complex environment. In: Proc. of the Genetic and Evolutionary Computation Conference (GECCO 1999) (1999)"},{"key":"19_CR14","unstructured":"Benda, M., Jagannathan, V., Dodhiawalla, R.: On optimal cooperation of knowledge resources. Technical Report BCS G-2012-28, Boeing AI Center, Boeing Computer Services, Bellevue, WA (1985)"},{"key":"19_CR15","doi-asserted-by":"crossref","unstructured":"Tan, M.: Multi-agent reinforcement learning: Independent vs. cooperative agents. In: Proc. of the Tenth Int. Conf. on Machine Learning, pp. 330\u2013337 (1993)","DOI":"10.1016\/B978-1-55860-307-3.50049-6"},{"key":"19_CR16","unstructured":"Haynes, T., Wainwright, R., Sen, S., Schoenfeld, D.: Strongly typed genetic programming in evolving cooperation strategies. In: Proc. of the Sixth Int. Conf. on Genetic Algorithms, pp. 271\u2013278 (1995)"},{"key":"19_CR17","unstructured":"Sen, S., Sekaran, M., Hale, J.: Learning to coordinate without sharing information. In: Proc. of the National Conf. on AI, pp. 426\u2013431 (1994)"},{"key":"19_CR18","first-page":"293","volume":"8","author":"L.J. Lin","year":"1992","unstructured":"Lin, L.J.: Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning\u00a08, 293\u2013321 (1992)","journal-title":"Machine Learning"},{"key":"19_CR19","doi-asserted-by":"crossref","unstructured":"Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: Proc. of the Eleventh International Conference on Machine Learning, pp. 157\u2013163 (1994)","DOI":"10.1016\/B978-1-55860-335-6.50027-1"},{"key":"19_CR20","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1016\/0921-8890(95)00004-Y","volume":"15","author":"S. Thrun","year":"1995","unstructured":"Thrun, S., Mitchell, T.: Lifelong robot learning. Robotics and Autonomous Systems\u00a015, 25\u201346 (1995)","journal-title":"Robotics and Autonomous Systems"},{"key":"19_CR21","first-page":"251","volume":"22","author":"R. Maclin","year":"1996","unstructured":"Maclin, R., Shavlik, J.: Creating advicetaking reinforcement learners. Machine Learning 22, 251\u2013281 (1996)","journal-title":"Machine Learning"},{"key":"19_CR22","unstructured":"Matari\u0107, M.J.: Using communication to reduce locality in distributed multi-agent learning. Computer Science Technical Report CS-96-190, Brandeis University (1996)"},{"key":"19_CR23","unstructured":"Claus, C., Boutilier, C.: The dynamics of reinforcement learning in cooperative multiagent systems. In: Proc. of the Fifteenth National Conference on Artificial Intelligence, Madison, WI, pp. 746\u2013752 (1998)"},{"key":"19_CR24","unstructured":"Price, B., Boutilier, C.: Implicit imitation in multiagent reinforcement learning. In: Proc. of the Sixteenth Int. Conf. on Machine Learning, pp. 325\u2013334 (1999)"},{"key":"19_CR25","doi-asserted-by":"crossref","unstructured":"Berenji, H.R., Vengerov, D.: Advantages of cooperation between reinforcement learning agents in difficult stochastic problems. In: Proc. of the Nineth IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2000) (2000)","DOI":"10.1109\/FUZZY.2000.839146"},{"key":"19_CR26","unstructured":"Price, B., Boutilier, C.: Imitation and reinforcement learning in agents with heterogeneous actions. In: Proc. of the Seveteenth Int. Conf. on Machine Learning (ICML2000)(2000)"}],"container-title":["Lecture Notes in Computer Science","Cooperative Information Agents VII"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-540-45217-1_19","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,4]],"date-time":"2023-06-04T15:34:29Z","timestamp":1685892869000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/978-3-540-45217-1_19"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2003]]},"ISBN":["9783540407980","9783540452171"],"references-count":26,"URL":"https:\/\/doi.org\/10.1007\/978-3-540-45217-1_19","relation":{},"ISSN":["0302-9743","1611-3349"],"issn-type":[{"type":"print","value":"0302-9743"},{"type":"electronic","value":"1611-3349"}],"subject":[],"published":{"date-parts":[[2003]]}}}