{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T04:18:57Z","timestamp":1777522737295,"version":"3.51.4"},"reference-count":23,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2014,12,9]],"date-time":"2014-12-09T00:00:00Z","timestamp":1418083200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["Adaptive Behavior"],"published-print":{"date-parts":[[2015,2]]},"abstract":"<jats:p>When transferring knowledge between reinforcement learning agents with different state representations or actions, past knowledge must be efficiently mapped to novel tasks so that it aids learning. The majority of the existing approaches use pre-defined mappings provided by a domain expert. To overcome this limitation and enable autonomous transfer learning, this paper introduces a method for weighting and using multiple inter-task mappings based on a probabilistic framework. Experimental results show that the use of multiple inter-task mappings, accompanied with a probabilistic selection mechanism, can significantly boost the performance of transfer learning relative to 1) learning without transfer and 2) using a single hand-picked mapping. We especially introduce novel tasks for transfer learning in a realistic simulation of the iCub robot, demonstrating the ability of the method to select mappings in complex tasks where human intuition could not be applied to select them. The results verified the efficacy of the proposed approach in a real world and complex environment.<\/jats:p>","DOI":"10.1177\/1059712314559525","type":"journal-article","created":{"date-parts":[[2014,12,9]],"date-time":"2014-12-09T22:59:55Z","timestamp":1418165995000},"page":"3-19","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":13,"title":["Transfer learning with probabilistic mapping selection"],"prefix":"10.1177","volume":"23","author":[{"given":"Anestis","family":"Fachantidis","sequence":"first","affiliation":[{"name":"Department of Informatics, Aristotle University of Thessaloniki, Greece"}]},{"given":"Ioannis","family":"Partalas","sequence":"additional","affiliation":[{"name":"University of Grenoble Alpes, LIG, Grenoble, France"}]},{"given":"Matthew E","family":"Taylor","sequence":"additional","affiliation":[{"name":"School of Electrical Engineering and Computer Science, Washington State University, WA, USA"}]},{"given":"Ioannis","family":"Vlahavas","sequence":"additional","affiliation":[{"name":"Department of Informatics, Aristotle University of Thessaloniki, Greece"}]}],"member":"179","published-online":{"date-parts":[[2014,12,9]]},"reference":[{"key":"bibr1-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40991-2_29"},{"key":"bibr2-1059712314559525","volume-title":"International conference on autonomous agents and multiagent systems (AAMAS)","author":"Ammar H.B.","year":"2012"},{"key":"bibr3-1059712314559525","first-page":"213","volume":"3","author":"Brafman R.I.","year":"2003","journal-title":"Journal of Machine Learning Research"},{"key":"bibr4-1059712314559525","volume-title":"Developmental robotics: From babies to robots","author":"Cangelosi A.","year":"2012"},{"key":"bibr5-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1145\/1160633.1160762"},{"key":"bibr6-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-73580-9_21"},{"key":"bibr7-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1145\/1390156.1390225"},{"key":"bibr8-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1007\/11552246_35"},{"key":"bibr9-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1109\/DevLrn.2012.6400810"},{"key":"bibr10-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-77296-5_32"},{"key":"bibr11-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1007\/BF00114726"},{"key":"bibr12-1059712314559525","first-page":"741","volume-title":"Proceedings of the 8th international conference on autonomous agents and multiagent systems \u2013 volume 2","author":"Sorg J.","year":"2009"},{"key":"bibr13-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1109\/ADPRL.2007.368165"},{"key":"bibr14-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1109\/TNN.1998.712192"},{"key":"bibr15-1059712314559525","first-page":"1065","volume-title":"20th international joint conferences on artificial intelligence","author":"Talvitie E.","year":"2007"},{"key":"bibr16-1059712314559525","first-page":"2133","volume":"10","author":"Tanner B.","year":"2010","journal-title":"Journal of Machine Learning Research"},{"key":"bibr17-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-87481-2_32"},{"key":"bibr18-1059712314559525","first-page":"283","volume-title":"7th international conference on autonomous agents and multiagent systems","author":"Taylor M.E.","year":"2008"},{"issue":"1","key":"bibr19-1059712314559525","first-page":"1633","volume":"10","author":"Taylor M.E.","year":"2009","journal-title":"Journal of Machine Learning Research"},{"key":"bibr20-1059712314559525","first-page":"2125","volume":"8","author":"Taylor M.E.","year":"2007","journal-title":"Journal of Machine Learning Research"},{"key":"bibr21-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1145\/1774674.1774684"},{"key":"bibr22-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1007\/11564096_40"},{"key":"bibr23-1059712314559525","doi-asserted-by":"publisher","DOI":"10.1145\/1273496.1273624"}],"container-title":["Adaptive Behavior"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1059712314559525","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1059712314559525","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1059712314559525","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,28]],"date-time":"2026-04-28T16:18:32Z","timestamp":1777393112000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1059712314559525"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,12,9]]},"references-count":23,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2015,2]]}},"alternative-id":["10.1177\/1059712314559525"],"URL":"https:\/\/doi.org\/10.1177\/1059712314559525","relation":{},"ISSN":["1059-7123","1741-2633"],"issn-type":[{"value":"1059-7123","type":"print"},{"value":"1741-2633","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,12,9]]}}}