{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,9,5]],"date-time":"2024-09-05T13:05:59Z","timestamp":1725541559322},"publisher-location":"Berlin, Heidelberg","reference-count":38,"publisher":"Springer Berlin Heidelberg","isbn-type":[{"type":"print","value":"9783642051760"},{"type":"electronic","value":"9783642051777"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010]]},"DOI":"10.1007\/978-3-642-05177-7_7","type":"book-chapter","created":{"date-parts":[[2009,12,5]],"date-time":"2009-12-05T08:44:48Z","timestamp":1260002688000},"page":"147-170","source":"Crossref","is-referenced-by-count":8,"title":["Transfer Learning via Advice Taking"],"prefix":"10.1007","author":[{"given":"Lisa","family":"Torrey","sequence":"first","affiliation":[]},{"given":"Jude","family":"Shavlik","sequence":"additional","affiliation":[]},{"given":"Trevor","family":"Walker","sequence":"additional","affiliation":[]},{"given":"Richard","family":"Maclin","sequence":"additional","affiliation":[]}],"member":"297","reference":[{"key":"7_CR1","unstructured":"Asadi, M., Huber, M.: Effective control knowledge transfer through learning skill and representation hierarchies. In: International Joint Conference on Artificial Intelligence, Hyderabad, India (2007)"},{"key":"7_CR2","doi-asserted-by":"crossref","unstructured":"Bloedorn, E., Michalski, R., Wnek, J.: Multistrategy constructive induction: AQ17-MCI. In: International Workshop on Multistrategy Learning (1993)","DOI":"10.1007\/978-1-4615-3202-6"},{"key":"7_CR3","unstructured":"Croonenborghs, T., Driessens, K., Bruynooghe, M.: Learning relational skills for inductive transfer in relational reinforcement learning. In: International Conference on Inductive Logic Programming, Corvallis, OR (2007)"},{"key":"7_CR4","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1613\/jair.639","volume":"13","author":"T. Dietterich","year":"2000","unstructured":"Dietterich, T.: Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research\u00a013, 227\u2013303 (2000)","journal-title":"Journal of Artificial Intelligence Research"},{"key":"7_CR5","doi-asserted-by":"crossref","unstructured":"Fernandez, F., Veloso, M.: Probabilistic policy reuse in a reinforcement learning agent. In: Conference on Autonomous Agents and Multi-Agent Systems, Hakodate, Japan (2006)","DOI":"10.1145\/1160633.1160762"},{"key":"7_CR6","doi-asserted-by":"crossref","unstructured":"Konidaris, G., Barto, A.: Autonomous shaping: Knowledge transfer in reinforcement learning. In: International Conference on Machine Learning, Pittsburgh, PA (2006)","DOI":"10.1145\/1143844.1143906"},{"key":"7_CR7","doi-asserted-by":"crossref","unstructured":"Lazaric, A., Restelli, M., Bonarini, A.: Transfer of samples in batch reinforcement learning. In: International Conference on Machine Learning, Helsinki, Finland (2008)","DOI":"10.1145\/1390156.1390225"},{"key":"7_CR8","unstructured":"Maclin, R., Shavlik, J., Torrey, L., Walker, T.: Knowledge-based support vector regression for reinforcement learning. In: IJCAI Workshop on Reasoning, Representation, and Learning in Computer Games, Edinburgh, Scotland (2005)"},{"key":"7_CR9","unstructured":"Maclin, R., Shavlik, J., Torrey, L., Walker, T., Wild, E.: Giving advice about preferred actions to reinforcement learners via knowledge-based kernel regression. In: AAAI Conference on Artificial Intelligence, Pittsburgh, PA (2005)"},{"key":"7_CR10","unstructured":"Maclin, R., Shavlik, J., Walker, T., Torrey, L.: A simple and effective method for incorporating advice into kernel methods. In: AAAI Conference on Artificial Intelligence, Boston, MA (2006)"},{"key":"7_CR11","doi-asserted-by":"publisher","first-page":"375","DOI":"10.1023\/B:AIRE.0000036264.95672.64","volume":"21","author":"M. Madden","year":"2004","unstructured":"Madden, M., Howley, T.: Transfer of experience between reinforcement learning environments with progressive difficulty. Artificial Intelligence Review\u00a021, 375\u2013398 (2004)","journal-title":"Artificial Intelligence Review"},{"key":"7_CR12","first-page":"1127","volume":"5","author":"O. Mangasarian","year":"2004","unstructured":"Mangasarian, O., Shavlik, J., Wild, E.: Knowledge-based kernel approximation. Journal of Machine Learning Research\u00a05, 1127\u20131141 (2004)","journal-title":"Journal of Machine Learning Research"},{"key":"7_CR13","doi-asserted-by":"crossref","unstructured":"Mehta, N., Ray, S., Tadepalli, P., Dietterich, T.: Automatic discovery and transfer of MAXQ hierarchies. In: International Conference on Machine Learning, Helsinki, Finland (2008)","DOI":"10.1145\/1390156.1390238"},{"issue":"2","key":"7_CR14","doi-asserted-by":"publisher","first-page":"111","DOI":"10.1016\/0004-3702(83)90016-4","volume":"20","author":"R. Michalski","year":"1983","unstructured":"Michalski, R.: A theory and methodology of inductive learning. Artificial Intelligence\u00a020(2), 111\u2013161 (1983)","journal-title":"Artificial Intelligence"},{"key":"7_CR15","volume-title":"Readings in Knowledge Acquisition and Learning: Automating the Construction and Improvement of Expert Systems","author":"R. Michalski","year":"1993","unstructured":"Michalski, R.: Toward a unified theory of learning: Multistrategy task-adaptive learning. In: Buchanan, B.G., Wilkins, D.C. (eds.) Readings in Knowledge Acquisition and Learning: Automating the Construction and Improvement of Expert Systems. Morgan Kaufmann, San Francisco (1993)"},{"key":"7_CR16","volume-title":"Machine Learning","author":"T. Mitchell","year":"1997","unstructured":"Mitchell, T.: Machine Learning. McGraw-Hill, New York (1997)"},{"key":"7_CR17","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1080\/088395198117848","volume":"12","author":"I. Noda","year":"1998","unstructured":"Noda, I., Matsubara, H., Hiraki, K., Frank, I.: Soccer server: A tool for research on multiagent systems. Applied Artificial Intelligence\u00a012, 233\u2013250 (1998)","journal-title":"Applied Artificial Intelligence"},{"key":"7_CR18","unstructured":"Perkins, T., Precup, D.: Using options for knowledge transfer in reinforcement learning. Technical Report UM-CS-1999-034, University of Massachusetts, Amherst (1999)"},{"key":"7_CR19","unstructured":"Price, B., Boutilier, C.: Implicit imitation in multiagent reinforcement learning. In: International Conference on Machine Learning, Bled, Slovenia (1999)"},{"key":"7_CR20","unstructured":"Sharma, M., Holmes, M., Santamaria, J., Irani, A., Isbell, C., Ram, A.: Transfer learning in real-time strategy games using hybrid CBR\/RL. In: International Joint Conference on Artificial Intelligence, Hyderabad, India (2007)"},{"key":"7_CR21","unstructured":"Sherstov, A., Stone, P.: Action-space knowledge transfer in MDPs: Formalism, suboptimality bounds, and algorithms. In: Conference on Learning Theory, Bertinoro, Italy (2005)"},{"issue":"3-4","key":"7_CR22","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1007\/BF00992700","volume":"8","author":"S. Singh","year":"1992","unstructured":"Singh, S.: Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning\u00a08(3-4), 323\u2013339 (1992)","journal-title":"Machine Learning"},{"key":"7_CR23","unstructured":"Srinivasan, A.: The Aleph manual (2001)"},{"key":"7_CR24","unstructured":"Stone, P., Sutton, R.: Scaling reinforcement learning toward RoboCup soccer. In: International Conference on Machine Learning, Williamstown, MA (2001)"},{"key":"7_CR25","first-page":"9","volume":"3","author":"R. Sutton","year":"1988","unstructured":"Sutton, R.: Learning to predict by the methods of temporal differences. Machine Learning\u00a03, 9\u201344 (1988)","journal-title":"Machine Learning"},{"key":"7_CR26","volume-title":"Reinforcement Learning: An Introduction","author":"R. Sutton","year":"1998","unstructured":"Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)"},{"issue":"5","key":"7_CR27","first-page":"1004","volume":"123","author":"F. Tanaka","year":"2003","unstructured":"Tanaka, F., Yamamura, M.: Multitask reinforcement learning on the distribution of MDPs. Transactions of the Institute of Electrical Engineers of Japan\u00a0123(5), 1004\u20131011 (2003)","journal-title":"Transactions of the Institute of Electrical Engineers of Japan"},{"key":"7_CR28","series-title":"Lecture Notes in Artificial Intelligence","doi-asserted-by":"publisher","first-page":"488","DOI":"10.1007\/978-3-540-87481-2_32","volume-title":"Machine Learning and Knowledge Discovery in Databases","author":"M. Taylor","year":"2008","unstructured":"Taylor, M., Jong, N., Stone, P.: Transferring instances for model-based reinforcement learning. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol.\u00a05212, pp. 488\u2013505. Springer, Heidelberg (2008)"},{"key":"7_CR29","doi-asserted-by":"crossref","unstructured":"Taylor, M., Stone, P.: Cross-domain transfer for reinforcement learning. In: International Conference on Machine Learning, Corvallis, OR (2007)","DOI":"10.1145\/1273496.1273607"},{"key":"7_CR30","unstructured":"Taylor, M., Stone, P., Liu, Y.: Value functions for RL-based behavior transfer: A comparative study. In: AAAI Conference on Artificial Intelligence, Pittsburgh, PA (2005)"},{"key":"7_CR31","unstructured":"Taylor, M., Whiteson, S., Stone, P.: Transfer learning for policy search methods. In: ICML Workshop on Structural Knowledge Transfer for Machine Learning, Pittsburgh, PA (2006)"},{"key":"7_CR32","series-title":"Lecture Notes in Artificial Intelligence","doi-asserted-by":"publisher","first-page":"425","DOI":"10.1007\/11871842_41","volume-title":"Machine Learning: ECML 2006","author":"L. Torrey","year":"2006","unstructured":"Torrey, L., Shavlik, J., Walker, T., Maclin, R.: Skill acquisition via transfer learning and advice taking. In: F\u00fcrnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol.\u00a04212, pp. 425\u2013436. Springer, Heidelberg (2006)"},{"key":"7_CR33","series-title":"Lecture Notes in Artificial Intelligence","doi-asserted-by":"publisher","first-page":"412","DOI":"10.1007\/11564096_40","volume-title":"Machine Learning: ECML 2005","author":"L. Torrey","year":"2005","unstructured":"Torrey, L., Walker, T., Shavlik, J., Maclin, R.: Using advice to transfer knowledge acquired in one reinforcement learning task to another. In: Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L. (eds.) ECML 2005. LNCS (LNAI), vol.\u00a03720, pp. 412\u2013424. Springer, Heidelberg (2005)"},{"key":"7_CR34","unstructured":"Z\u0306elezn\u00fd, F., Srinivasan, A., Page, D.:"},{"key":"7_CR35","unstructured":"Walsh, T., Li, L., Littman, M.: Transferring state abstractions between MDPs. In: ICML Workshop on Structural Knowledge Transfer for Machine Learning, Pittsburgh, PA (2006)"},{"key":"7_CR36","unstructured":"Watkins, C.: Learning from delayed rewards. PhD thesis, University of Cambridge (1989)"},{"key":"7_CR37","first-page":"279","volume":"8","author":"C. Watkins","year":"1992","unstructured":"Watkins, C., Dayan, P.: Q-learning. Machine Learning\u00a08, 279\u2013292 (1992)","journal-title":"Machine Learning"},{"key":"7_CR38","doi-asserted-by":"crossref","unstructured":"Wilson, A., Fern, A., Ray, S., Tadepalli, P.: Multi-task reinforcement learning: A hierarchical Bayesian approach. In: International Conference on Machine Learning, Corvallis, OR (2007)","DOI":"10.1145\/1273496.1273624"}],"container-title":["Studies in Computational Intelligence","Advances in Machine Learning I"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/978-3-642-05177-7_7.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,24]],"date-time":"2020-11-24T02:48:52Z","timestamp":1606186132000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/978-3-642-05177-7_7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010]]},"ISBN":["9783642051760","9783642051777"],"references-count":38,"URL":"https:\/\/doi.org\/10.1007\/978-3-642-05177-7_7","relation":{},"ISSN":["1860-949X","1860-9503"],"issn-type":[{"type":"print","value":"1860-949X"},{"type":"electronic","value":"1860-9503"}],"subject":[],"published":{"date-parts":[[2010]]}}}