{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T15:19:00Z","timestamp":1774624740916,"version":"3.50.1"},"reference-count":92,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2008,9,7]],"date-time":"2008-09-07T00:00:00Z","timestamp":1220745600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Auton Agent Multi-Agent Syst"],"published-print":{"date-parts":[[2009,6]]},"DOI":"10.1007\/s10458-008-9062-9","type":"journal-article","created":{"date-parts":[[2008,9,6]],"date-time":"2008-09-06T10:25:52Z","timestamp":1220696752000},"page":"342-375","source":"Crossref","is-referenced-by-count":179,"title":["Opportunities for multiagent systems and multiagent reinforcement learning in traffic control"],"prefix":"10.1007","volume":"18","author":[{"given":"Ana L. C.","family":"Bazzan","sequence":"first","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2008,9,7]]},"reference":[{"key":"9062_CR1","doi-asserted-by":"crossref","first-page":"616","DOI":"10.1145\/1160633.1160743","volume-title":"Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems","author":"G. Balan","year":"2006","unstructured":"Balan G. and Luke S. (2006). History-based traffic control. In: Nakashima, H., Wellman, M.P., Weiss, G. and Stone, P. (eds) Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems, pp 616\u2013621. ACM Press, New York"},{"key":"9062_CR2","unstructured":"Balmer, M., Cetin, N., Nagel, K., & Raney, B. (2004). Towards truly agent-based traffic and mobility simulations. In N. Jennings, C. Sierra, L. Sonenberg, & M. Tambe (Eds.), Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multi Agent Systems, AAMAS, July 2004 (Vol.\u00a01, pp.\u00a060\u201367). New York: IEEE Computer Society."},{"key":"9062_CR3","unstructured":"Bazzan, A. L. C. (1995). A game-theoretic approach to distributed control of traffic signals. In Proceedings of the 1st International Conference on Multi-Agent Systems (ICMAS) (p. 439, extended abstract). San Francisco."},{"key":"9062_CR4","unstructured":"Bazzan, A. L. C. (1997). An evolutionary game-theoretic approach for coordination of traffic signal agents. PhD thesis, University of Karlsruhe."},{"issue":"1","key":"9062_CR5","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1007\/s10458-004-6975-9","volume":"10","author":"A.L.C. Bazzan","year":"2005","unstructured":"Bazzan A.L.C. (2005). A distributed approach for coordination of traffic signal agents. Autonomous Agents and Multiagent Systems 10(1): 131\u2013164","journal-title":"Autonomous Agents and Multiagent Systems"},{"key":"9062_CR6","unstructured":"Bazzan, A. L. C., de Oliveira, D., & da Silva, B. C. (2008). Learning in groups of traffic signals. Technical report, UFRGS."},{"key":"9062_CR7","first-page":"1","volume-title":"Adaptive agents and multi-agent systems III, Lecture notes in artificial intelligence (Vol 4865)","author":"A.L.C. Bazzan","year":"2008","unstructured":"Bazzan A.L.C., Kl\u00fcgl F., Nagel K. and Oliveira D. (2008). Adapt or not to adapt\u2014Consequences of adapting driver and traffic light agents. In: Tuyls, K., Nowe, A., Guessoum, Z., and Kudenko, D. (eds) Adaptive agents and multi-agent systems III, Lecture notes in artificial intelligence (Vol 4865), pp 1\u201314. Springer-Verlag, New York"},{"key":"9062_CR8","first-page":"126","volume-title":"Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems, May 2006","author":"A.L.C. Bazzan","year":"2006","unstructured":"Bazzan A.L.C. and Junges R. (2006). Congestion tolls as utility alignment between agent and system optimum. In: Nakashima, H., Wellman, M.P., Weiss, G. and Stone, P. (eds) Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems, May 2006, pp 126\u2013128. ACM Press, New York"},{"issue":"4","key":"9062_CR9","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1016\/j.trc.2005.07.003","volume":"13","author":"A.L.C. Bazzan","year":"2005","unstructured":"Bazzan A.L.C. and Kl\u00fcgl F. (2005). Case studies on the Braess paradox: Simulating route recommendation and learning in abstract and microscopic models. Transportation Research C 13(4): 299\u2013319","journal-title":"Transportation Research C"},{"key":"9062_CR10","first-page":"63","volume-title":"Advances in artificial intelligence, Lecture notes in artificial intelligence (Vol. 5249)","author":"A.L.C. Bazzan","year":"2008","unstructured":"Bazzan A.L.C. and Kl\u00fcgl F. (2008). Re-routing agents in an abstract traffic scenario. In: Zaverucha, G. and da Costa, A.L. (eds) Advances in artificial intelligence, Lecture notes in artificial intelligence (Vol. 5249), pp 63\u201372. Springer-Verlag, Berlin"},{"key":"9062_CR11","first-page":"195","volume-title":"EPIA 2007, Lecture notes in artificial intelligence (Vol. 4874)","author":"A.L.C. Bazzan","year":"2007","unstructured":"Bazzan A.L.C., Kl\u00fcgl F. and Nagel K. (2007). Adaptation in games with many co-evolving agents. In: Neves, J., Santos, M., and Machado, J. (eds) EPIA 2007, Lecture notes in artificial intelligence (Vol. 4874), pp 195\u2013206. Springer-Verlag, Berlin"},{"key":"9062_CR12","unstructured":"Bazzan, A. L. C., Wahle, J., & Kl\u00fcgl, F. (1999). Agents in traffic modelling\u2014From reactive to social behavior. In Advances in artificial intelligence, Lecture notes in artificial intelligence (Vol.\u00a01701, pp.\u00a0303\u2013306). Berlin\/Heidelberg: Springer. Extended version appeared in Proceedings of the U.K. Special Interest Group on Multi-Agent Systems (UKMAS), Bristol, UK."},{"key":"9062_CR13","first-page":"1021","volume-title":"Proceedings of the 17th International Joint Conference on Artificial Intelligence","author":"M.H. Bowling","year":"2001","unstructured":"Bowling M.H. and Veloso M.M. (2001). Rational and convergent learning in stochastic games. In: Nebel, B. (eds) Proceedings of the 17th International Joint Conference on Artificial Intelligence., pp 1021\u20131026. Morgan Kaufmann, Seattle"},{"key":"9062_CR14","first-page":"258","volume":"12","author":"D. Braess","year":"1968","unstructured":"Braess D. (1968). \u00fcber ein Paradoxon aus der Verkehrsplanung. Unternehmensforschung 12: 258","journal-title":"Unternehmensforschung"},{"issue":"5","key":"9062_CR15","doi-asserted-by":"crossref","first-page":"056132","DOI":"10.1103\/PhysRevE.64.056132","volume":"64","author":"E. Brockfeld","year":"2001","unstructured":"Brockfeld E., Barlovic R., Schadschneider A. and Schreckenberg M. (2001). Optimizing traffic lights in a cellular automaton model for city traffic. Physical Review E 64(5): 056132","journal-title":"Physical Review E"},{"key":"9062_CR16","first-page":"276","volume-title":"Applications of learning classifier systems, Studies in fuzziness and soft computing (Vol. 150)","author":"L. Bull","year":"2004","unstructured":"Bull L., Sha\u2019Aban J., Tomlinson A., Addison J.D. and Heydecker B.G. (2004). Towards distributed adaptive control for road traffic junction signals using learning classifier systems. In: Bull, L. (eds) Applications of learning classifier systems, Studies in fuzziness and soft computing (Vol. 150), pp 276\u2013299. Springer, New York"},{"key":"9062_CR17","first-page":"79","volume":"14","author":"B. Burmeister","year":"1997","unstructured":"Burmeister B., Doormann J. and Matylis G. (1997). Agent-oriented traffic simulation. Transactions Society for Computer Simulation 14: 79\u201386","journal-title":"Transactions Society for Computer Simulation"},{"key":"9062_CR18","first-page":"324","volume-title":"EPIA","author":"E. Camponogara","year":"2003","unstructured":"Camponogara E. and Kraus W. (2003). Distributed learning agents in urban traffic control. In: Moura-Pires, F. and Abreu, S. (eds) EPIA., pp 324\u2013335. Portugal, Beja"},{"key":"9062_CR19","first-page":"264","volume-title":"Sequence learning: paradigms, algorithms, and applications","author":"S.P.M. Choi","year":"2001","unstructured":"Choi S.P.M., Yeung D.-Y. and Zhang N.L. (2001). Hidden-mode markov decision processes for nonstationary sequential decision making. In: Sun, R. and Giles, C.L. (eds) Sequence learning: paradigms, algorithms, and applications, pp 264\u2013287. Springer, Berlin"},{"key":"9062_CR20","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1016\/S0370-1573(99)00117-9","volume":"329","author":"D. Chowdhury","year":"2000","unstructured":"Chowdhury D., Santen L. and Schadschneider A. (2000). Statistical physics of vehicular traffic and some related systems. Physics Reports 329: 199\u2013329","journal-title":"Physics Reports"},{"issue":"2","key":"9062_CR21","doi-asserted-by":"crossref","first-page":"R1311","DOI":"10.1103\/PhysRevE.59.R1311","volume":"59","author":"D. Chowdhury","year":"1999","unstructured":"Chowdhury D. and Schadschneider A. (1999). Self-organization of traffic jams in cities: Effects of stochastic dynamics and signal periods. Physical Review E 59(2): R1311\u2013R1314","journal-title":"Physical Review E"},{"key":"9062_CR22","unstructured":"Claus, C., & Boutilier, C. (1998). The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the 15th National Conference on Artificial Intelligence (pp.\u00a0746\u2013752). Madison, Wisconsin."},{"key":"9062_CR23","unstructured":"Di Taranto, M. (1989). UTOPIA. In Proceedings of the IFAC-IFIP-IFORS Conference on Control, Computers, Communication in Transportation, Paris. ifac."},{"issue":"2","key":"9062_CR24","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1016\/S0967-0661(01)00121-6","volume":"10","author":"C. Diakaki","year":"2002","unstructured":"Diakaki C., Papageorgiou M. and Aboudolas K. (2002). A multivariable regulator approach to traffic-responsive network-wide signal control. Control Engineering Practice 10(2): 183\u2013195","journal-title":"Control Engineering Practice"},{"issue":"6","key":"9062_CR25","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1162\/089976602753712972","volume":"14","author":"K. Doya","year":"2002","unstructured":"Doya K., Samejima K., Katagiri K. and Kawato M. (2002). Multiple model-based reinforcement learning. Neural Computation 14(6): 1347\u20131369","journal-title":"Neural Computation"},{"key":"9062_CR26","first-page":"530","volume-title":"The 3rd International Joint Conference on Autonomous Agents and Multiagent Systems, July 2004","author":"K. Dresner","year":"2004","unstructured":"Dresner K. and Stone P. (2004). Multiagent traffic management: A reservation-based intersection control mechanism. In: Jennings, N., Sierra, C., Sonenberg, L. and Tambe, M. (eds) The 3rd International Joint Conference on Autonomous Agents and Multiagent Systems, July 2004, pp 530\u2013537. IEEE Computer Society, New York"},{"key":"9062_CR27","volume-title":"The 4th International Joint Conference on Autonomous Agents and Multiagent Systems, July 2005","author":"K. Dresner","year":"2005","unstructured":"Dresner K. and Stone P. (2005). Multiagent traffic management: An improved intersection control mechanism. In: Dignum, F., Dignum, V., Koenig, S., Kraus, S., Singh, M.P. and Wooldridge, M. (eds) The 4th International Joint Conference on Autonomous Agents and Multiagent Systems, July 2005, pp. ACM Press, New York"},{"key":"9062_CR28","first-page":"129","volume-title":"LAMAS 2005, Lecture notes in artificial intelligence (Vol 3898)","author":"K. Dresner","year":"2006","unstructured":"Dresner K. and Stone P. (2006). Multiagent traffic management: Opportunities for multiagent learning. In: Tuyls, K., Hoen, P.J., Verbeeck, K. and Sen, S. (eds) LAMAS 2005, Lecture notes in artificial intelligence (Vol 3898)., pp 129\u2013138. Springer Verlag, Berlin"},{"key":"9062_CR29","unstructured":"Dresner, K., Stone, P. (2007). Sharing the road: Autonomous vehicles meet human drivers. In The 20th International Joint Conference on Artificial Intelligence, January 2007 (pp.\u00a01263\u20131268). Hyderabad, India."},{"key":"9062_CR30","unstructured":"Elhadouaj, S., Drogoul, A., & Espi\u00e9, S. (2000). How to combine reactivity and anticipation: The case of conflicts resolution in a simulated road traffic. In Proceedings of the Multiagent Based Simulation (MABS) (pp.\u00a082\u201396). New York: Springer."},{"key":"9062_CR31","doi-asserted-by":"crossref","unstructured":"France, J., & Ghorbani, A. A. (2003). A multiagent system for optimizing urban traffic. In Proceedings of the IEEE\/WIC International Conference on Intelligent Agent Technology (pp.\u00a0411\u2013414). Washington, DC: IEEE Computer Society.","DOI":"10.1109\/IAT.2003.1241110"},{"key":"9062_CR32","first-page":"75","volume":"906","author":"N.H. Gartner","year":"1983","unstructured":"Gartner N.H. (1983). OPAC\u2014A demand-responsive strategy for traffic signal control. Transportation Research Record 906: 75\u201381","journal-title":"Transportation Research Record"},{"issue":"1","key":"9062_CR33","doi-asserted-by":"crossref","first-page":"29","DOI":"10.25088\/ComplexSystems.16.1.29","volume":"16","author":"C. Gershenson","year":"2005","unstructured":"Gershenson C. (2005). Self-organizing traffic lights. Complex Systems 16(1): 29\u201353","journal-title":"Complex Systems"},{"key":"9062_CR34","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1038\/380121a0","volume":"380","author":"D. Gordon","year":"1996","unstructured":"Gordon D. (1996). The organization of work in social insect colonies. Nature 380: 121\u2013124","journal-title":"Nature"},{"key":"9062_CR35","volume-title":"Handbook of transportation science","year":"2003","unstructured":"Hall R.W. (2003). Handbook of transportation science 2nd ed. Kluwer Academic Pub., Dordrecht","edition":"2"},{"key":"9062_CR36","unstructured":"Haugeneder, H., & Steiner, D. (1993). MECCA\/UTS: A multi-agent scenario for cooperation in urban traffic. In Proceedings of the Special Interest Group on Cooperating Knowledge Based Systems (pp. 83\u201398). Keele, UK."},{"key":"9062_CR37","doi-asserted-by":"crossref","first-page":"738","DOI":"10.1038\/25499","volume":"396","author":"D. Helbing","year":"1998","unstructured":"Helbing D. and Huberman B.A. (1998). Coherent moving states in highway traffic. Nature 396: 738","journal-title":"Nature"},{"key":"9062_CR38","doi-asserted-by":"crossref","unstructured":"Henry, J., Farges, J. L., & Tuffal, J. (1983). The PRODYN real time traffic algorithm. In Proceedings of the International Federation of Automatic Control (IFAC) Conference, Baden-Baden: IFAC.","DOI":"10.1016\/S1474-6670(17)62577-1"},{"key":"9062_CR39","unstructured":"Hu, J., & Wellman, M. P. (1998). Multiagent reinforcement learning: Theoretical framework and an algorithm. In Proceedings of the 15th International Conference on Machine Learning (pp.\u00a0242\u2013250). Los Altos: Morgan Kaufmann."},{"key":"9062_CR40","unstructured":"Hunt, P. B., Robertson, D. I., Bretherton, R. D., & Winton, R. I. (1981). SCOOT\u2014A traffic responsive method of coordinating signals. TRRL Lab. Report 1014, Transport and Road Research Laboratory, Berkshire, 1981."},{"key":"9062_CR41","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1007\/978-3-662-07809-9_12","volume-title":"Human behaviour and traffic networks","author":"F. Kl\u00fcgl","year":"2004","unstructured":"Kl\u00fcgl F. and Bazzan A.L.C. (2004). Simulated route decision behaviour: Simple heuristics and adaptation. In: Selten, R. and Schreckenberg, M. (eds) Human behaviour and traffic networks., pp 285\u2013304. Springer, New York"},{"key":"9062_CR42","unstructured":"Kl\u00fcgl, F., Bazzan, A. L. C., & Wahle, J. (2003). Selection of information types based on personal utility\u2014A testbed for traffic information markets. In Proceedings of the 2nd International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS) (pp.\u00a0377\u2013384). Melbourne, Australia: ACM Press."},{"key":"9062_CR43","first-page":"192","volume-title":"Proceedings of Operations Research (OR), Operations Research Proceedings","author":"E. K\u00f6hler","year":"2004","unstructured":"K\u00f6hler E., M\u00f6hring R.H. and W\u00fcnsch G. (2004). Minimizing total delay in fixed-time controlled traffic networks. In: Fleuren, H., den Hertog, D., and Kort, P. (eds) Proceedings of Operations Research (OR), Operations Research Proceedings., pp 192. Springer, Tilburg"},{"issue":"5","key":"9062_CR44","doi-asserted-by":"crossref","first-page":"389","DOI":"10.1016\/S0968-090X(03)00032-9","volume":"11","author":"I. Kosonen","year":"2003","unstructured":"Kosonen I. (2003). Multi-agent fuzzy signal control based on real-time simulation. Transportation Research C 11(5): 389\u2013403","journal-title":"Transportation Research C"},{"key":"9062_CR45","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-61353-1","volume-title":"Introduction to the theory of traffic flow","author":"W. Leutzbach","year":"1988","unstructured":"Leutzbach W. (1988). Introduction to the theory of traffic flow. Springer, Berlin"},{"key":"9062_CR46","doi-asserted-by":"crossref","unstructured":"Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the 11th International Conference on Machine Learning, ML (pp.\u00a0157\u2013163). New Brunswick, NJ: Morgan Kaufmann.","DOI":"10.1016\/B978-1-55860-335-6.50027-1"},{"key":"9062_CR47","unstructured":"Lowrie, P. (1982). The Sydney coordinate adaptive traffic system\u2014Principles, methodology, algorithms. In Proceedings of the International Conference on Road Traffic Signalling, Sydney, Australia."},{"key":"9062_CR48","doi-asserted-by":"crossref","unstructured":"M\u00f6hring, R. H., N\u00f6kel, K., & W\u00fcnsch, G. (2006). A model and fast optimization method for signal coordination in a network. In Proceedings of the 11th IFAC Symposium on Control in Transportation Systems, August 2006. Delft, The Netherlands.","DOI":"10.3182\/20060829-3-NL-2908.00013"},{"key":"9062_CR49","first-page":"103","volume":"13","author":"A.W. Moore","year":"1993","unstructured":"Moore A.W. and Atkeson C.G. (1993). Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning 13: 103\u2013130","journal-title":"Machine Learning"},{"key":"9062_CR50","doi-asserted-by":"crossref","first-page":"897","DOI":"10.1287\/opre.12.6.896","volume":"12","author":"J.T. Morgan","year":"1964","unstructured":"Morgan J.T. and Little J.D.C. (1964). Synchronizing traffic signals for maximal bandwidth. Operations Research 12: 897\u2013912","journal-title":"Operations Research"},{"key":"9062_CR51","doi-asserted-by":"crossref","first-page":"2221","DOI":"10.1051\/jp1:1992277","volume":"2","author":"K. Nagel","year":"1992","unstructured":"Nagel K. and Schreckenberg M. (1992). A cellular automaton model for freeway traffic. Journal de Physique I 2: 2221","journal-title":"Journal de Physique I"},{"key":"9062_CR52","unstructured":"Nunes, L., & Oliveira, E. C. (2004). Learning from multiple sources. In N. Jennings, C. Sierra, L. Sonenberg, & M. Tambe (Eds.), Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multi Agent Systems, AAMAS, July 2004 (Vol.\u00a03, pp.\u00a01106\u20131113). New York: IEEE Computer Society."},{"key":"9062_CR53","first-page":"520","volume-title":"Proceedings of the 5th International Workshop on Ant Colony Optimization and Swarm Intelligence, ANTS 2006, Lecture notes in computer science, September 2006","author":"D. Oliveira","year":"2006","unstructured":"Oliveira D. and Bazzan A.L.C. (2006). Traffic lights control with adaptive group formation based on swarm intelligence. In: Dorigo, M., Gambardella, L.M., Birattari, M., Martinoli, A., Poli, R. and Stuetzle, T. (eds) Proceedings of the 5th International Workshop on Ant Colony Optimization and Swarm Intelligence, ANTS 2006, Lecture notes in computer science, September 2006., pp 520\u2013521. Springer, Berlin"},{"key":"9062_CR54","doi-asserted-by":"crossref","unstructured":"Oliveira, D., Bazzan, A. L. C., & Lesser, V. (2005). Using cooperative mediation to coordinate traffic lights: A case study. In Proceedings of the 4th International Joint Conference on Autonomous Agents and Multi Agent Systems (AAMAS), July 2005 (pp.\u00a0463\u2013470). New York: IEEE Computer Society.","DOI":"10.1145\/1082473.1082544"},{"key":"9062_CR55","doi-asserted-by":"crossref","unstructured":"Oliveira, D., Ferreira, P. R., Jr., Bazzan, A. L. C., & Kl\u00fcgl, F. (2004). A swarm-based approach for selection of signal plans in urban scenarios. In Proceedings of 4th International Workshop on Ant Colony Optimization and Swarm Intelligence\u2014ANTS 2004, Lecture notes in computer science (Vol.\u00a03172, pp.\u00a0416\u2013417). Berlin, Germany.","DOI":"10.1007\/978-3-540-28646-2_43"},{"key":"9062_CR56","volume-title":"Urban travel demand modeling: From individual choices to general equilibrium","author":"N. Oppenheim","year":"1995","unstructured":"Oppenheim N. (1995). Urban travel demand modeling: From individual choices to general equilibrium. Wiley, New York, NY"},{"key":"9062_CR57","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1007\/3-7643-7363-6_4","volume-title":"Applications of agent technology in traffic and transportation, Whitestein series in software agent technologies and autonomic computing","author":"S. Ossowski","year":"2005","unstructured":"Ossowski S., Fern\u00e1ndez A., Serrano J.M., P\u00e9rez-de-la-Cruz J.L., Belmonte M.V. and Hern\u00e1ndez J.Z. (2005). Designing multiagent decision support systems for traffic management. In: Kl\u00fcgl, F., Bazzan, A.L.C., and Ossowski, S. (eds) Applications of agent technology in traffic and transportation, Whitestein series in software agent technologies and autonomic computing., pp 51\u201367. Birkh\u00e4user, Basel"},{"issue":"3","key":"9062_CR58","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1007\/s10458-005-2631-2","volume":"11","author":"L. Panait","year":"2005","unstructured":"Panait L. and Luke S. (2005). Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent Systems 11(3): 387\u2013434","journal-title":"Autonomous Agents and Multi-Agent Systems"},{"key":"9062_CR59","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1007\/0-306-48058-1_8","volume-title":"Handbook of transportation science (Chap. 8)","author":"M. Papageorgiou","year":"2003","unstructured":"Papageorgiou M. (2003). Traffic control. In: Hall, R.W. (eds) Handbook of transportation science (Chap. 8)., pp 243\u2013277. Kluwer Academic Pub, Dordrecht"},{"issue":"12","key":"9062_CR60","doi-asserted-by":"crossref","first-page":"2043","DOI":"10.1109\/JPROC.2003.819610","volume":"91","author":"M. Papageorgiou","year":"2003","unstructured":"Papageorgiou M., Diakaki C., Dinopoulou V., Kotsialos A. and Wang Y. (2003). Review of road traffic control strategies. Proceedings of the IEEE 91(12): 2043\u20132067","journal-title":"Proceedings of the IEEE"},{"key":"9062_CR61","doi-asserted-by":"crossref","unstructured":"Paruchuri, P., Pullalarevu, A. R., & Karlapalem, K. (2002). Multi agent simulation of unorganized traffic. In Proceedings of the 1st International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS) (Vol.\u00a01, pp.\u00a0176\u2013183). Bologna, Italy: ACM Press.","DOI":"10.1145\/544741.544786"},{"key":"9062_CR62","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1145\/1082473.1082542","volume-title":"Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems","author":"M. Rigolli","year":"2005","unstructured":"Rigolli M. and Brady M. (2005). Towards a behavioural traffic monitoring system. In: Dignum, F., Dignum, V., Koenig, S., Kraus, S., Singh, M.P. and Wooldridge, M. (eds) Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems., pp 449\u2013454. ACM Press, New York"},{"key":"9062_CR63","unstructured":"Robertson, D. I. (1969). TRANSYT: A traffic network study tool. Rep. LR 253, Road Res. Lab., London."},{"key":"9062_CR64","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1146\/annurev.en.37.010192.003225","volume":"37","author":"G.E. Robinson","year":"1992","unstructured":"Robinson G.E. (1992). Regulation of division of labor in insect societies. Annual Review of Entomology 37: 637\u2013665","journal-title":"Annual Review of Entomology"},{"key":"9062_CR65","first-page":"120","volume-title":"Proceedings of the Informatik 2006\u2014Informatik f\u00fcr Menschen, Lecture notes in informatics (Vol P-93)","author":"F. Rochner","year":"2006","unstructured":"Rochner F., Prothmann H., Branke J., M\u00fcller-Schloer C. and Schmeck H. (2006). An organic architecture for traffic light controllers. In: Hochberger, C. and Liskowsky, R. (eds) Proceedings of the Informatik 2006\u2014Informatik f\u00fcr Menschen, Lecture notes in informatics (Vol P-93)., pp 120\u2013127. K\u00f6llen Verlag, Berlin"},{"key":"9062_CR66","volume-title":"Traffic engineering","author":"R.P. Roess","year":"2004","unstructured":"Roess R.P., Prassas E.S. and McShane W.R. (2004). Traffic engineering. Prentice Hall, Englewood Cliffs, NJ"},{"key":"9062_CR67","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1007\/3-7643-7363-6_12","volume-title":"Applications of agent technology in traffic and transportation, Whitestein series in software agent technologies and autonomic computing","author":"R. Rossetti","year":"2005","unstructured":"Rossetti R. and Liu R. (2005). A dynamic network simulation model based on multi-agent systems. In: Kl\u00fcgl, F., Bazzan, A.L.C. and Ossowski, S. (eds) Applications of agent technology in traffic and transportation, Whitestein series in software agent technologies and autonomic computing. pp 181\u2013192. Birkh\u00e4user, Basel"},{"issue":"5\u20136","key":"9062_CR68","first-page":"47","volume":"10","author":"R.J.F. Rossetti","year":"2002","unstructured":"Rossetti R.J.F., Bordini R.H., Bazzan A.L.C., Bampi S., Liu R. and Van Vliet D. (2002). Using BDI agents to improve driver modelling in a commuter scenario. Transportation Research Part C: Emerging Technologies 10(5\u20136): 47\u201372","journal-title":"Transportation Research Part C: Emerging Technologies"},{"key":"9062_CR69","unstructured":"Shoham, Y., Powers, R., & Grenager, T. (2003). Multi-agent reinforcement learning: A critical survey. Unpublished survey."},{"issue":"7","key":"9062_CR70","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1016\/j.artint.2006.02.006","volume":"171","author":"Y. Shoham","year":"2007","unstructured":"Shoham Y., Powers R. and Grenager T. (2007). If multi-agent learning is the answer, what is the question?. Artificial Intelligence 171(7): 365\u2013377","journal-title":"Artificial Intelligence"},{"key":"9062_CR71","first-page":"217","volume-title":"Proceedings of the 23rd International Conference on Machine Learning ICML, June 2006","author":"B.C.d. Silva","year":"2006","unstructured":"Silva B.C.d., Basso E.W., Bazzan A.L.C. and Engel P.M. (2006). Dealing with non-stationary environments using context detection. In: Cohen, W.W. and Moore, A. (eds) Proceedings of the 23rd International Conference on Machine Learning ICML, June 2006, pp 217\u2013224. ACM Press, New York"},{"key":"9062_CR72","unstructured":"Silva, B. C. d., Oliveira, D. d., Bazzan, A. L. C., & Basso, E. W. (2006). Adaptive traffic control with reinforcement learning. In A. L. C. Bazzan, B. Chaib-Draa, F. Kl\u00fcgl, & S. Ossowski (Eds.), Proceedings of the 4th Workshop on Agents in Traffic and Transportation (at AAMAS 2006), May 2006 (pp.\u00a080\u201386). Hakodate, Japan."},{"key":"9062_CR73","unstructured":"Steingrover, M., Schouten, R., Peelen, S., Nijhuis, E., & Bakker, B. (2005). Reinforcement learning of traffic light controllers adapting to traffic congestion. In K. Verbeeck, K. Tuyls, A. Now\u00e9, B. Manderick, & B. Kuijpers (Eds.), Proceedings of the 17th Belgium-Netherlands Conference on Artificial Intelligence (BNAIC 2005), October 2005 (pp.\u00a0216\u2013223). Brussels, Belgium: Koninklijke Vlaamse Academie van Belie voor Wetenschappen en Kunsten."},{"key":"9062_CR74","unstructured":"Stone, P. (2007). Learning and multiagent reasoning for autonomous agents. In The 20th International Joint Conference on Artificial Intelligence, January 2007 (pp.\u00a013\u201330)."},{"issue":"7","key":"9062_CR75","doi-asserted-by":"crossref","first-page":"402","DOI":"10.1016\/j.artint.2006.12.005","volume":"171","author":"P. Stone","year":"2007","unstructured":"Stone P. (2007). Multiagent learning is not the answer. It is the question. Artificial Intelligence 171(7): 402\u2013405","journal-title":"It is the question. Artificial Intelligence"},{"key":"9062_CR76","doi-asserted-by":"crossref","unstructured":"Sutton, R. S. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proceedings of the 7th International Conference on Machine Learning (pp.\u00a0216\u2013224). Austin, Texas.","DOI":"10.1016\/B978-1-55860-141-3.50030-4"},{"key":"9062_CR77","doi-asserted-by":"crossref","unstructured":"Tan, M. (1993). Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings of the 10th International Conference on Machine Learning (ICML 1993), June 1993 (pp.\u00a0330\u2013337). Los Altos, CA: Morgan Kaufmann.","DOI":"10.1016\/B978-1-55860-307-3.50049-6"},{"key":"9062_CR78","unstructured":"TRANSYT-7F. (1988). TRANSYT-7F user\u2019s manual. Transportation Research Center, University of Florida."},{"key":"9062_CR79","unstructured":"Tumer, K., Welch, Z. T., & Agogino, A. (2008). Aligning social welfare and agent preferences to alleviate traffic congestion. In L. Padgham, D. Parkes, J. M\u00fcller, & S. Parsons (Eds.), Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems, May 2008 (pp.\u00a0655\u2013662). Estoril: IFAAMAS."},{"key":"9062_CR80","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/978-1-4419-8909-3_1","volume-title":"Collectives and the design of complex systems","author":"K. Tumer","year":"2004","unstructured":"Tumer K. and Wolpert D. (2004). A survey of collectives. In: Tumer, K. and Wolpert, D. (eds) Collectives and the design of complex systems., pp 1\u201342. Springer, New York"},{"key":"9062_CR81","unstructured":"Tuyls, K. (2004). Learning in multi-agent systems, an evolutionary game theoretic approach. PhD thesis, Vrije Universiteit Brussel."},{"issue":"1","key":"9062_CR82","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1007\/s10458-005-3783-9","volume":"12","author":"K. Tuyls","year":"2006","unstructured":"Tuyls K., Hoen P.J. and Vanschoenwinkel B. (2006). An evolutionary dynamical analysis of multi-agent learning in iterated games. Autonomous Agents and Multiagent Systems 12(1): 115\u2013153","journal-title":"Autonomous Agents and Multiagent Systems"},{"issue":"7","key":"9062_CR83","doi-asserted-by":"crossref","first-page":"406","DOI":"10.1016\/j.artint.2007.01.004","volume":"171","author":"K. Tuyls","year":"2007","unstructured":"Tuyls K. and Parsons S. (2007). What evolutionary game theory tells us about multiagent learning. Artificial Intelligence 171(7): 406\u2013416","journal-title":"Artificial Intelligence"},{"key":"9062_CR84","unstructured":"van Katwijk, R. T., van Koningsbruggen, P., Schutter, B. D., & Hellendoorn, J. (2005). A test bed for multi-agent control systems in road traffic management. In F. Kl\u00fcgl, A. L. C. Bazzan, & S. Ossowski (Eds.), Applications of agent technology in traffic and transportation, Whitestein series in software agent technologies and autonomic computing (pp.\u00a0113\u2013131). Basel: Birkh\u00e4user."},{"key":"9062_CR85","unstructured":"Verbeeck, K., Now\u00e9, A., Peeters, M., & Tuyls, K. (2005). Multi-agent reinforcement learning in stochastic games and multi-stage games. In D. K. et\u00a0al. (Eds.), Adaptive agents and MAS II, LNAI (Vol.\u00a03394, pp.\u00a0275\u2013294). Berlin: Springer."},{"key":"9062_CR86","doi-asserted-by":"crossref","first-page":"752","DOI":"10.1145\/1160633.1160766","volume-title":"Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems","author":"T. Vu","year":"2006","unstructured":"Vu T., Powers R. and Shoham Y. (2006). Learning against multiple opponents. In: Nakashima, H., Wellman, M.P., Weiss, G., and Stone, P. (eds) Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems., pp 752\u2013760. Hakodate, Japan"},{"issue":"5\u20136","key":"9062_CR87","first-page":"73","volume":"10","author":"J. Wahle","year":"2002","unstructured":"Wahle J., Bazzan A.L.C. and Kluegl F. (2002). The impact of real time information in a two route scenario using agent based simulation. Transportation Research Part C: Emerging Technologies 10(5\u20136): 73\u201391","journal-title":"Transportation Research Part C: Emerging Technologies"},{"key":"9062_CR88","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1007\/978-3-642-59751-0_8","volume-title":"Traffic and granular flow","author":"J. Wahle","year":"2000","unstructured":"Wahle J., Bazzan A.L.C., Kl\u00fcgl F. and Schreckenberg M. (2000). Anticipatory traffic forecast using multi-agent techniques. In: Helbing, D., Hermann, H.J., Schreckenberg, M., and Wolf, D. (eds) Traffic and granular flow., pp 87\u201392. Springer, New york"},{"issue":"3\u20134","key":"9062_CR89","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1016\/S0378-4371(00)00510-0","volume":"287","author":"J. Wahle","year":"2000","unstructured":"Wahle J., Bazzan A.L.C., Kl\u00fcgl F. and Schreckenberg M. (2000). Decision dynamics in a traffic scenario. Physica A 287(3\u20134): 669\u2013681","journal-title":"Physica A"},{"key":"9062_CR90","doi-asserted-by":"crossref","unstructured":"Wardrop, J. G. (1952). Some theoretical aspects of road traffic research. In Proceedings of the Institute of Civil Engineers (Vol.\u00a02, pp.\u00a0325\u2013378). UK.","DOI":"10.1680\/ipeds.1952.11362"},{"issue":"3","key":"9062_CR91","first-page":"279","volume":"8","author":"C.J.C.H. Watkins","year":"1992","unstructured":"Watkins C.J.C.H. and Dayan P. (1992). Q-learning. Machine Learning 8(3): 279\u2013292","journal-title":"Machine Learning"},{"key":"9062_CR92","unstructured":"Wiering, M. (2000). Multi-agent reinforcement learning for traffic light control. In Proceedings of the 17th International Conference on Machine Learning (ICML 2000) (pp.\u00a01151\u20131158). Stanford."}],"container-title":["Autonomous Agents and Multi-Agent Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10458-008-9062-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10458-008-9062-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10458-008-9062-9","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,31]],"date-time":"2025-01-31T21:11:57Z","timestamp":1738357917000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10458-008-9062-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,9,7]]},"references-count":92,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2009,6]]}},"alternative-id":["9062"],"URL":"https:\/\/doi.org\/10.1007\/s10458-008-9062-9","relation":{},"ISSN":["1387-2532","1573-7454"],"issn-type":[{"value":"1387-2532","type":"print"},{"value":"1573-7454","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,9,7]]}}}