{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,4,1]],"date-time":"2022-04-01T05:00:15Z","timestamp":1648789215993},"reference-count":10,"publisher":"World Scientific Pub Co Pte Lt","issue":"07","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2014,11]]},"abstract":"<jats:p> Transfer learning focuses on developing methods to reuse information gathered from a source task in order to improve the learning performance in a related task. In this work, we present a novel approach to transfer knowledge between tasks in a reinforcement learning (RL) framework with continuous states and actions, where the transition and policy functions are approximated by Gaussian processes. The novelty in the proposed approach lies in the idea of transferring information about the hyper-parameters of the state transition function from the source task, which represents qualitative knowledge about the type of transition function that the target task might have, constraining the search space and accelerating the learning process. We performed experiments on relevant tasks for RL, which show a clear improvement in the overall performance when compared to state-of-the-art reinforcement learning and transfer learning algorithms for continuous state and action spaces. <\/jats:p>","DOI":"10.1142\/s0218001414600076","type":"journal-article","created":{"date-parts":[[2014,8,3]],"date-time":"2014-08-03T23:08:57Z","timestamp":1407107337000},"page":"1460007","source":"Crossref","is-referenced-by-count":1,"title":["TRANSFER LEARNING FOR CONTINUOUS STATE AND ACTION SPACES"],"prefix":"10.1142","volume":"28","author":[{"given":"ESTEBAN O.","family":"GARC\u00cdA","sequence":"first","affiliation":[{"name":"Computer Science, Instituto Nacional de Astrofisica, Optica y Electronica, Tonantzintla, Puebla 72840, Mexico"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"ENRIQUE","family":"MUNOZ DE COTE","sequence":"additional","affiliation":[{"name":"Computer Science, Instituto Nacional de Astrofisica, Optica y Electronica, Tonantzintla, Puebla 72840, Mexico"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"EDUARDO F.","family":"MORALES","sequence":"additional","affiliation":[{"name":"Computer Science, Instituto Nacional de Astrofisica, Optica y Electronica, Tonantzintla, Puebla 72840, Mexico"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2014,10,14]]},"reference":[{"key":"rf1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364910371999"},{"key":"rf8","doi-asserted-by":"publisher","DOI":"10.1109\/TAC.2011.2179426"},{"key":"rf9","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1613\/jair.904","volume":"16","author":"Drummond C.","year":"2002","journal-title":"J. Artif. Intell. Res. (JAIR)"},{"key":"rf14","volume-title":"Reinforcement Learning: State of the Art","author":"Hasselt H. V.","year":"2011"},{"key":"rf18","doi-asserted-by":"publisher","DOI":"10.1016\/j.neucom.2010.07.027"},{"key":"rf19","series-title":"Adaptive Computation and Machine Learning Series","volume-title":"Machine Learning: A Probabilistic Perspective","author":"Murphy K. P.","year":"2012"},{"key":"rf24","first-page":"69","volume":"14","author":"Rasmussen C. E.","year":"2006","journal-title":"Int. J. Neural Syst."},{"key":"rf27","doi-asserted-by":"crossref","DOI":"10.1109\/TNN.1998.712192","volume-title":"Introduction to Reinforcement Learning","author":"Sutton R.","year":"1998"},{"key":"rf28","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-87481-2_32"},{"key":"rf29","first-page":"1633","volume":"10","author":"Taylor M. E.","year":"2009","journal-title":"J. Mach. Learning Res."}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001414600076","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,6]],"date-time":"2019-08-06T11:44:01Z","timestamp":1565091841000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0218001414600076"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,10,14]]},"references-count":10,"journal-issue":{"issue":"07","published-online":{"date-parts":[[2014,10,14]]},"published-print":{"date-parts":[[2014,11]]}},"alternative-id":["10.1142\/S0218001414600076"],"URL":"https:\/\/doi.org\/10.1142\/s0218001414600076","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"value":"0218-0014","type":"print"},{"value":"1793-6381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,10,14]]}}}