{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,4,2]],"date-time":"2022-04-02T15:04:48Z","timestamp":1648911888478},"reference-count":8,"publisher":"World Scientific Pub Co Pte Lt","issue":"02","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Artif. Intell. Tools"],"published-print":{"date-parts":[[2002,6]]},"abstract":"<jats:p> We consider applications where agents have to cooperate without any communication taking place between them, apart from the fact that they can see part of the environment in which they act. We present a multi-agent system, defined in Golog, that needs to service tasks whose value degrades in time. Initial plans, reflecting prior knowledge about the environment, are expressed as Golog procedures, and are provided to the agents. Then the agents are trained using reinforcement learning, in order to ensure coordination both at the action level and at the plan level. This ensures better scalability and increased performance of the system. <\/jats:p>","DOI":"10.1142\/s0218213002000873","type":"journal-article","created":{"date-parts":[[2002,7,28]],"date-time":"2002-07-28T19:09:56Z","timestamp":1027883396000},"page":"233-246","source":"Crossref","is-referenced-by-count":1,"title":["DEVELOPING COLLABORATIVE GOLOG AGENTS BY REINFORCEMENT LEARNING"],"prefix":"10.1142","volume":"11","author":[{"given":"IOAN ALFRED","family":"LEITA","sequence":"first","affiliation":[{"name":"Technical University of Cluj-Napoca, Department of Computer Science, Baritiu 28, RO-3400 Cluj-Napoca, Romania"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"DOINA","family":"PRECUP","sequence":"additional","affiliation":[{"name":"McGill University, School of Computer Science, Montreal, Quebec, Canada H3A 2A7, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"219","published-online":{"date-parts":[[2011,11,21]]},"reference":[{"key":"p_5","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(00)00031-X"},{"issue":"4","key":"p_6","first-page":"13","volume":"20","author":"desJardins M.E.","year":"1999","journal-title":"AI Magazine"},{"key":"p_10","doi-asserted-by":"publisher","DOI":"10.1023\/A:1010046623013"},{"key":"p_12","doi-asserted-by":"publisher","DOI":"10.1016\/S0743-1066(96)00121-5"},{"key":"p_15","doi-asserted-by":"publisher","DOI":"10.1080\/095281398146806"},{"key":"p_22","doi-asserted-by":"publisher","DOI":"10.1080\/095281398146798"},{"key":"p_25","doi-asserted-by":"publisher","DOI":"10.1016\/S0004-3702(99)00052-1"},{"key":"p_28","doi-asserted-by":"publisher","DOI":"10.1080\/095281398146770"}],"container-title":["International Journal on Artificial Intelligence Tools"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218213002000873","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,6]],"date-time":"2019-08-06T23:13:53Z","timestamp":1565133233000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0218213002000873"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2002,6]]},"references-count":8,"journal-issue":{"issue":"02","published-online":{"date-parts":[[2011,11,21]]},"published-print":{"date-parts":[[2002,6]]}},"alternative-id":["10.1142\/S0218213002000873"],"URL":"https:\/\/doi.org\/10.1142\/s0218213002000873","relation":{},"ISSN":["0218-2130","1793-6349"],"issn-type":[{"value":"0218-2130","type":"print"},{"value":"1793-6349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2002,6]]}}}