{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:34:27Z","timestamp":1750307667776,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":20,"publisher":"ACM","license":[{"start":{"date-parts":[[2009,6,14]],"date-time":"2009-06-14T00:00:00Z","timestamp":1244937600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000185","name":"Defense Advanced Research Projects Agency","doi-asserted-by":"publisher","award":["FA8750-05-2-0249"],"award-info":[{"award-number":["FA8750-05-2-0249"]}],"id":[{"id":"10.13039\/100000185","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000082","name":"Division of Graduate Education","doi-asserted-by":"publisher","award":["DGE 0549115"],"award-info":[{"award-number":["DGE 0549115"]}],"id":[{"id":"10.13039\/100000082","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000145","name":"Division of Information and Intelligent Systems","doi-asserted-by":"publisher","award":["IIS-0713435"],"award-info":[{"award-number":["IIS-0713435"]}],"id":[{"id":"10.13039\/100000145","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2009,6,14]]},"DOI":"10.1145\/1553374.1553406","type":"proceedings-article","created":{"date-parts":[[2009,6,16]],"date-time":"2009-06-16T13:34:36Z","timestamp":1245159276000},"page":"249-256","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":14,"title":["The adaptive\n            <i>k<\/i>\n            -meteorologists problem and its application to structure learning and feature selection in reinforcement learning"],"prefix":"10.1145","author":[{"given":"Carlos","family":"Diuk","sequence":"first","affiliation":[{"name":"Rutgers University, Piscataway, NJ"}]},{"given":"Lihong","family":"Li","sequence":"additional","affiliation":[{"name":"Rutgers University, Piscataway, NJ"}]},{"given":"Bethany R.","family":"Leffler","sequence":"additional","affiliation":[{"name":"Rutgers University, Piscataway, NJ"}]}],"member":"320","published-online":{"date-parts":[[2009,6,14]]},"reference":[{"doi-asserted-by":"publisher","key":"e_1_3_2_1_1_1","DOI":"10.5555\/1248547.1248611"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_2_1","DOI":"10.5555\/3013545.3013546"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_3_1","DOI":"10.1162\/153244303765208377"},{"key":"e_1_3_2_1_4_1","volume-title":"Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI-08)","author":"Brunskill E.","year":"2008","unstructured":"Brunskill , E. , Leffler , B. R. , Li , L. , Littman , M. L. , &amp; Roy , N. ( 2008 ). CORL: A continuous-state offset-dynamics reinforcement learner . Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI-08) . Brunskill, E., Leffler, B. R., Li, L., Littman, M. L., &amp; Roy, N. (2008). CORL: A continuous-state offset-dynamics reinforcement learner. Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI-08)."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_5_1","DOI":"10.1145\/258128.258179"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_6_1","DOI":"10.1111\/j.1467-8640.1989.tb00324.x"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_7_1","DOI":"10.5555\/1622434.1622447"},{"key":"e_1_3_2_1_9_1","first-page":"740","volume-title":"Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99)","author":"Kearns M. J.","year":"1999","unstructured":"Kearns , M. J. , &amp; Koller , D. ( 1999 ). Efficient reinforcement learning in factored MDPs . Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99) (pp. 740 -- 747 ). Kearns, M. J., &amp; Koller, D. (1999). Efficient reinforcement learning in factored MDPs. Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99) (pp. 740--747)."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_10_1","DOI":"10.1016\/S0022-0000(05)80062-5"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_11_1","DOI":"10.1023\/A:1017984413808"},{"key":"e_1_3_2_1_12_1","first-page":"572","volume-title":"Proceedings of the Twenty-Second Conference on Artificial Intelligence (AAAI-07)","author":"Leffler B. R.","year":"2007","unstructured":"Leffler , B. R. , Littman , M. L. , &amp; Edmunds , T. ( 2007 ). Efficient reinforcement learning with relocatable action models . Proceedings of the Twenty-Second Conference on Artificial Intelligence (AAAI-07) (pp. 572 -- 577 ). Leffler, B. R., Littman, M. L., &amp; Edmunds, T. (2007). Efficient reinforcement learning with relocatable action models. Proceedings of the Twenty-Second Conference on Artificial Intelligence (AAAI-07) (pp. 572--577)."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_14_1","DOI":"10.1145\/1390156.1390228"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_15_1","DOI":"10.5555\/1005332.1005355"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","DOI":"10.1002\/9780470316887","volume-title":"Markov decision processes: Discrete stochastic dynamic programming","author":"Puterman M. L.","year":"1994","unstructured":"Puterman , M. L. ( 1994 ). Markov decision processes: Discrete stochastic dynamic programming . New York : Wiley-Interscience . Puterman, M. L. (1994). Markov decision processes: Discrete stochastic dynamic programming. New York: Wiley-Interscience."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_17_1","DOI":"10.1109\/ADPRL.2007.368176"},{"key":"e_1_3_2_1_18_1","first-page":"645","volume-title":"Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence (AAAI-07)","author":"Strehl A. L.","year":"2007","unstructured":"Strehl , A. L. , Diuk , C. , &amp; Littman , M. L. ( 2007 ). Efficient structure learning in factored-state MDPs . Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence (AAAI-07) (pp. 645 -- 650 ). Strehl, A. L., Diuk, C., &amp; Littman, M. L. (2007). Efficient structure learning in factored-state MDPs. Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence (AAAI-07) (pp. 645--650)."},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_19_1","DOI":"10.5555\/3020419.3020478"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_20_1","DOI":"10.1145\/1143844.1143955"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_21_1","DOI":"10.1145\/1968.1972"},{"doi-asserted-by":"publisher","key":"e_1_3_2_1_22_1","DOI":"10.1007\/BF00992676"}],"event":{"sponsor":["NSF","Microsoft Research Microsoft Research","MITACS"],"acronym":"ICML '09","name":"ICML '09: The 26th Annual International Conference on Machine Learning held in conjunction with the 2007 International Conference on Inductive Logic Programming","location":"Montreal Quebec Canada"},"container-title":["Proceedings of the 26th Annual International Conference on Machine Learning"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1553374.1553406","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1553374.1553406","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T13:29:34Z","timestamp":1750253374000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1553374.1553406"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,6,14]]},"references-count":20,"alternative-id":["10.1145\/1553374.1553406","10.1145\/1553374"],"URL":"https:\/\/doi.org\/10.1145\/1553374.1553406","relation":{},"subject":[],"published":{"date-parts":[[2009,6,14]]},"assertion":[{"value":"2009-06-14","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}