{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,11]],"date-time":"2025-12-11T07:30:17Z","timestamp":1765438217803},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2010,12,22]],"date-time":"2010-12-22T00:00:00Z","timestamp":1292976000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J Supercomput"],"published-print":{"date-parts":[[2012,3]]},"DOI":"10.1007\/s11227-010-0510-3","type":"journal-article","created":{"date-parts":[[2010,12,21]],"date-time":"2010-12-21T13:48:05Z","timestamp":1292939285000},"page":"1188-1217","source":"Crossref","is-referenced-by-count":10,"title":["A hybrid cognitive\/reactive intelligent agent autonomous path planning technique in\u00a0a\u00a0networked-distributed unstructured environment for\u00a0reinforcement learning"],"prefix":"10.1007","volume":"59","author":[{"given":"Dalila B.","family":"Megherbi","sequence":"first","affiliation":[]},{"given":"Vikram","family":"Malayia","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2010,12,22]]},"reference":[{"key":"510_CR1","volume-title":"Proceedings of the first international conference on multi-agent systems","author":"AAAI","year":"1995","unstructured":"AAAI (1995) In: Lessor V (ed) Proceedings of the first international conference on multi-agent systems, Menlo Park, CA, June. AAAI Press, Menlo Park"},{"key":"510_CR2","volume-title":"Proceedings of the 2006 international conference on machine learning; models, technologies & applications","author":"HS Al-Dayaa","year":"2006","unstructured":"Al-Dayaa HS, Megherbi DB (2006) Fast reinforcement learning technique via multiple lookahead levels. In: Proceedings of the 2006 international conference on machine learning; models, technologies & applications, Nevada, USA"},{"issue":"2","key":"510_CR3","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1109\/TSMCB.2006.883264","volume":"32","author":"BN Araabi","year":"2007","unstructured":"Araabi BN, Mastoureshgh S, Ahmadabadi MN (2007) A study on expertise of agents and its effects on cooperative Q-learning. IEEE Trans Syst Man Cybern, Part B, Cybern 32(2):398\u2013409","journal-title":"IEEE Trans Syst Man Cybern, Part B, Cybern"},{"key":"510_CR4","volume-title":"Readings in distributed artificial intelligence","year":"1988","unstructured":"Bond AH, Gasser L (eds) (1988) Readings in distributed artificial intelligence. Morgan Kaufmann, San Mateo"},{"key":"510_CR5","doi-asserted-by":"crossref","unstructured":"Cao J, Spooner DP, Jarvis SA, Nudd GR (2005) Grid load balancing using intelligent agents. J\u00a0Future Gener Comput Syst","DOI":"10.1016\/j.future.2004.09.032"},{"issue":"2","key":"510_CR6","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1109\/72.839000","volume":"11","author":"C Clausen","year":"2000","unstructured":"Clausen C, Wechsler H (2000) Quad-Q-learning. IEEE Trans Neural Netw 11(2):279\u2013294","journal-title":"IEEE Trans Neural Netw"},{"issue":"3","key":"510_CR7","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1109\/TITS.2005.853698","volume":"6","author":"X Dai","year":"2005","unstructured":"Dai X, Li C-K, Rad AB (2005) An approach to tune fuzzy controllers based on reinforcement learning for autonomous vehicle control. IEEE Trans Intell Transp Syst 6(3):285\u2013293","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"510_CR8","unstructured":"Decker KS, Williamson M (1987) Intelligent adaptive information agents. The Robotics Institute, Carnegie Mellon University (decker,sycara,mikew)@cs.cmu.edu"},{"key":"510_CR9","doi-asserted-by":"crossref","unstructured":"Durfee EH, Lesser VR, Corkill DD (1989) Trends in cooperative distributed problem solving. IEEE Trans Data Knowl Eng","DOI":"10.1109\/69.43404"},{"key":"510_CR10","volume-title":"Multi-agent systems, an introduction to distributed artificial intelligence","author":"J Ferber","year":"1999","unstructured":"Ferber J (1999) Multi-agent systems, an introduction to distributed artificial intelligence. Addison-Wesley, Reading"},{"key":"510_CR11","volume-title":"Using MPI","author":"W Gropp","year":"1999","unstructured":"Gropp W, Lusk E, Skjellum A (1999) Using MPI. MIT Press, Cambridge"},{"issue":"5","key":"510_CR12","doi-asserted-by":"crossref","first-page":"2140","DOI":"10.1109\/TSMCB.2004.832154","volume":"34","author":"M Guo","year":"2004","unstructured":"Guo M, Liu Y, Malec J (2004) A new Q-learning algorithm based on the metropolis criterion. IEEE Trans Syst Man Cybern, Part\u00a0B, Cybern 34(5):2140\u20132143","journal-title":"IEEE Trans Syst Man Cybern, Part\u00a0B, Cybern"},{"key":"510_CR13","volume-title":"IEEE Canadian conference","author":"R Hadidi","year":"2010","unstructured":"Hadidi R, Jeyasurya B (2010) Selective initial state criteria to enhance convergence rate of Q-learning algorithm in power system stability application. In: IEEE Canadian conference"},{"key":"510_CR14","unstructured":"Hartvigsen G, Johansen D (2010) Co-operation in distributed artificial intelligence environment\u2014the StromCast application"},{"issue":"3","key":"510_CR15","doi-asserted-by":"crossref","first-page":"1444","DOI":"10.1109\/TIE.2007.908526","volume":"55","author":"L Hu","year":"2008","unstructured":"Hu L, Zhou C, Sun Z (2008) Estimating biped gait using spline-based probability distribution function with Q-learning. IEEE Trans Ind Electron 55(3):1444\u20131452","journal-title":"IEEE Trans Ind Electron"},{"key":"510_CR16","doi-asserted-by":"crossref","unstructured":"Khatib O (1986) Real-time obstacle avoidance for manipulators and mobile robots. Int J Robot Res","DOI":"10.1007\/978-1-4613-8997-2_29"},{"key":"510_CR17","volume-title":"Proc. IEEE international conference of robotics and automation","author":"P Khosla","year":"1998","unstructured":"Khosla P, Volpe R (1998) Superquadratic artificial potentials for obstacle avoidance and approach. In: Proc. IEEE international conference of robotics and automation, Philadelphia, PA"},{"key":"510_CR18","volume-title":"Proceedings of the international conference on parallel and distributed processing techniques and applications","author":"DB Megherbi","year":"2007","unstructured":"Megherbi DB, Malayia V (2007) An autonomous hybrid cognitive\/reactive agent path planning technique in a networked distributed unstructured environment for reinforcement learning. In: Proceedings of the international conference on parallel and distributed processing techniques and applications, Las Vegas, June"},{"key":"510_CR19","volume-title":"IEEE international conference on computational intelligence for measurement systems and applications","author":"DB Megherbi","year":"2009","unstructured":"Megherbi DB, Radumilo-Franklin J (2009) An intelligent multi-agent distributed battlefield via multi-token message passing. In: IEEE international conference on computational intelligence for measurement systems and applications, China, May 2009"},{"key":"510_CR20","first-page":"419","volume-title":"Proceedings of the SPIE international conference on defense sensing, unmanned ground vehicle technology","author":"DB Megherbi","year":"2001","unstructured":"Megherbi DB, Teirelbar A, Boulenouar AJ (2001) A time-varying-environment machine learning technique for autonomous agent shortest path planning. In: Proceedings of the SPIE international conference on defense sensing, unmanned ground vehicle technology, Orlando, Florida, April, pp\u00a0419\u2013428"},{"key":"510_CR21","volume-title":"Machine learning","author":"TM Mitchell","year":"1997","unstructured":"Mitchell TM (1997) Machine learning. McGraw Hill, New York"},{"key":"510_CR22","volume-title":"Proc IEEE conf on robotics & automation","author":"WS Newman","year":"1985","unstructured":"Newman WS, Hogan N (1985) High-speed robot control and obstacle avoidance using dynamic potential functions. In: Proc IEEE conf on robotics & automation"},{"key":"510_CR23","doi-asserted-by":"crossref","DOI":"10.1007\/978-4-431-67919-6","volume-title":"Distributed autonomous robotic systems 4","author":"L Parker","year":"2000","unstructured":"Parker L (2000) Current state of the art in distributed robot systems. In: Parker LE, Bekey G, Barhen\u00a0J (eds) Distributed autonomous robotic systems 4. Springer, Berlin"},{"key":"510_CR24","volume-title":"Proceedings of the int conf on the simulation of adaptive behavior","author":"R Riolo","year":"1991","unstructured":"Riolo R (1991) Lookahead planning and latent learning in a classifier system. In: Proceedings of the int conf on the simulation of adaptive behavior"},{"key":"510_CR25","doi-asserted-by":"crossref","unstructured":"So Y, Durfee E (1992) A\u00a0distributed problem solving infrastructure for computer network management. Int J Intell Comp Inf Syst","DOI":"10.1142\/S0218215792000246"},{"key":"510_CR26","unstructured":"Stone P, Veloso M (2000) Multiagent system: a survey from a machine learning. Auton Robot 8"},{"key":"510_CR27","first-page":"216","volume-title":"Proceedings of the seventh international conference on machine learning","author":"RS Sutton","year":"1990","unstructured":"Sutton RS (1990) Integrated architectures for learning, planning, and reaction based on approximating dynamic programming. In: Proceedings of the seventh international conference on machine learning, pp\u00a0216\u2013224"},{"key":"510_CR28","first-page":"151","volume-title":"Working notes of 1991 AAAI spring symposium","author":"RS Sutton","year":"1991","unstructured":"Sutton RS (1991) Dyna, an integrated architecture for learning, planning, and reacting. In: Working notes of 1991 AAAI spring symposium, pp\u00a0151\u2013155"},{"key":"510_CR29","volume-title":"Reinforcement learning: an introduction","author":"RS Sutton","year":"1998","unstructured":"Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. MIT Press, Cambridge"},{"issue":"2","key":"510_CR30","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1109\/37.126844","volume":"12","author":"RS Sutton","year":"1992","unstructured":"Sutton RS, Barto AG, Williams RJ (1992) Reinforcement learning is direct adaptive optimal control. IEEE Control Syst Mag 12(2):19\u201322","journal-title":"IEEE Control Syst Mag"},{"key":"510_CR31","volume-title":"Multi-agent reinforcement learning: independent vs. cooperative agents. Readings in agents","author":"M Tan","year":"1993","unstructured":"Tan M (1993) Multi-agent reinforcement learning: independent vs. cooperative agents. Readings in agents. Morgan Kaufmann, San Mateo"},{"issue":"4","key":"510_CR32","doi-asserted-by":"crossref","first-page":"1014","DOI":"10.1109\/TSMCB.2008.922018","volume":"38","author":"J Valasek","year":"2008","unstructured":"Valasek J, Doebbler J, Tandale MD, Meade AJ (2008) Improved adaptive\u2013reinforcement learning control for morphing unmanned air vehicles. IEEE Trans Syst Man Cybern, Part B, Cybern 38(4):1014\u20131020","journal-title":"IEEE Trans Syst Man Cybern, Part B, Cybern"},{"key":"510_CR33","volume-title":"ICMAS proceedings of the second international conference on multi-agent systems","author":"H Dyke Parunak Van","year":"1996","unstructured":"Van Dyke Parunak H (1996) In: ICMAS proceedings of the second international conference on multi-agent systems"},{"key":"510_CR34","first-page":"279","volume":"8","author":"C Watkins","year":"1992","unstructured":"Watkins C, Dayan P (1992) Q-learning. Mach Learn 8:279\u2013292","journal-title":"Mach Learn"},{"key":"510_CR35","unstructured":"Weiss G (1999) A multiagent framework for planning, reacting and learning. Technical Report FKI-233-99"},{"key":"510_CR36","volume-title":"Multiagent systems a modern approach to distributed artificial intelligence","author":"G Weiss","year":"1999","unstructured":"Weiss G (1999) Multiagent systems a modern approach to distributed artificial intelligence. MIT Press, Cambridge"},{"key":"510_CR37","doi-asserted-by":"crossref","unstructured":"Weiss G (1998) A multi-agent perspective of parallel and distributed machine learning. http:\/\/wwwbrauer.in.tum.de\/~weissg\/Docs\/weissgaa98.pdf","DOI":"10.1145\/280765.280806"},{"issue":"4","key":"510_CR38","doi-asserted-by":"crossref","first-page":"930","DOI":"10.1109\/TSMCB.2008.920231","volume":"38","author":"MA Wiering","year":"2008","unstructured":"Wiering MA, van Hasselt H (2008) Ensemble algorithms in reinforcement learning. IEEE Trans Syst Man Cybern, Part\u00a0B, Cybern 38(4):930\u2013936","journal-title":"IEEE Trans Syst Man Cybern, Part\u00a0B, Cybern"}],"container-title":["The Journal of Supercomputing"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-010-0510-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s11227-010-0510-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-010-0510-3","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,6,7]],"date-time":"2019-06-07T06:51:12Z","timestamp":1559890272000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s11227-010-0510-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,12,22]]},"references-count":38,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2012,3]]}},"alternative-id":["510"],"URL":"https:\/\/doi.org\/10.1007\/s11227-010-0510-3","relation":{},"ISSN":["0920-8542","1573-0484"],"issn-type":[{"value":"0920-8542","type":"print"},{"value":"1573-0484","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,12,22]]}}}