Learning Automata as a Basis for Multi Agent Reinforcement Learning

Nowé, Ann; Verbeeck, Katja; Peeters, Maarten

doi:10.1007/11691839_3

Ann Nowé²²,
Katja Verbeeck²² &
Maarten Peeters²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3898))

Included in the following conference series:

International Workshop on Learning and Adaption in Multi-Agent Systems

1170 Accesses
37 Citations

Abstract

In this paper we summarize some important theoretical results from the domain of Learning Automata. We start with single stage, single agent learning schema’s, and gradually extend the setting to multi-stage multi agent systems. We argue that the theory of Learning Automata is an ideal basis to build multi agent learning algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Multi-agent deep reinforcement learning: a survey

Article Open access 15 April 2021

A New Approach for Multi-agent Reinforcement Learning

Multi-agent Path Finding for Timed Tasks Using Evolutionary Games

References

Bonabeau, E., Dorigo, M., Theraulaz, G.: Swarm Intelligence, From Natural to Artificial Systems. Santa Fe Institute studies in the sciences of complexity. Oxford University Press, Oxford (1999)
MATH Google Scholar
Boutilier, C.: Planning, learning and coordination in multiagent decision processes. In: Proceedings of the 6th Conference on Theoretical Aspects of Rationality and Knowledge, Renesse, Holland, pp. 195–210 (1996)
Google Scholar
Boutilier, C.: Sequential optimality and coordination in multiagent systems. In: Proceedings of the 16th International Joint Conference on Artificial Intelligence, Stockholm, Sweden, pp. 478–485 (1999)
Google Scholar
Bush, R.R., Mosteller, F.: Stochastic Models for Learning. Wiley, New York (1958)
MATH Google Scholar
Claus, C., Boutilier, C.: The dynamics of reinforcement learning in cooperative multiagent systems. In: Proceedings of the 15th National Conference on Artificial Intelligence, pp. 746–752 (1998)
Google Scholar
Colorni, A., Dorigo, M., Maffioli, F., Maniezzo, V., Righini, G., Trubian, M.: Heuristics from nature for hard combinatorial optimization problems. International Transactions in Operational Research (1996)
Google Scholar
Dorigo, M., Caro, G.D., Gambardella, L.M.: Ant algorithms for discrete optimization. Artificial Life 5, 137–172 (1999)
Article Google Scholar
Dorigo, M., Caro, G.D.: The ant colony optimization meta-heuristic. In: Corne, D., Dorigo, M., Glover, F. (eds.) New Ideas In Optimization. McGraw-Hill, Maidenhaid (1999)
Google Scholar
Dorigo, M., Maniezzo, V., Colorni, A.: The ant system: Optimization by a colony of cooperating agents. IEE Transactions on Systems, Man, and Cybernetics (1996)
Google Scholar
Dorigo, M., Stützle, T.: Ant Colony Optimization. MIT Press, Cambridge (2004)
MATH Google Scholar
Littman, M.: Markov games as a framework for multi-agent reinforcement learning. In: Proceedings of the 11th International Conference on Machine Learning, pp. 322–328 (1994)
Google Scholar
Narendra, K., Thathachar, M.: Learning Automata: An Introduction. Prentice- Hall International, Inc., Upper Saddle River (1989)
MATH Google Scholar
Narendra, K.S., Parthasarathy, K.: Learning automata approach to hierarchical multiobjective analysis. Technical Report Report No. 8811, Electrical Engineering Yale University, New Haven, Connecticut (1988)
Google Scholar
Oommen, B.J., Roberts, T.D.: Continuous learning automata solutions to the capacity assignment problem. IEEE Transactions on Computations 49, 608–620 (2000)
Article Google Scholar
Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Tsetlin, M.L.: Automaton theory and modelling of biological systems. Mathematics in Science and Engineering, vol. 102 (1973)
Google Scholar
Unsal, C., Kachroo, P., Bay, J.S.: Multiple stochastic learning automata for vehicule path control in an automated highway system. IEEE Transactions on Systems, Man, and Cybernetics, Part A 29, 120–128 (1999)
Article Google Scholar
Verbeeck, K.: Coordinated Exploration in Multi-Agent Reinforcement Learning. PhD thesis, Computational Modeling Lab, Vrije Universiteit Brussel, Belgium (2004)
Google Scholar
Verbeeck, K., Nowé, A., Tuyls, K., Peeters, M.: Multi-agent reinforcement learning in stochastic single and multi-stage games. In: Kudenko, D., Kazakov, D., Alonso, E. (eds.) AAMAS 2004. LNCS (LNAI), vol. 3394, pp. 275–294. Springer, Heidelberg (2005)
Chapter Google Scholar
Wheeler, R.M., Narendra, K.S.: Decentralized learning in finite markov chains. IEEE Transactions on Automatic Control AC-31, 519–526 (1986)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Computational Modeling Lab, Vrije Universiteit Brussel, Pleinlaan 2, Brussel, 1050, Belgium
Ann Nowé, Katja Verbeeck & Maarten Peeters

Authors

Ann Nowé
View author publications
Search author on:PubMed Google Scholar
Katja Verbeeck
View author publications
Search author on:PubMed Google Scholar
Maarten Peeters
View author publications
Search author on:PubMed Google Scholar

Editor information

Editors and Affiliations

MICC-IKAT, Universiteit Maastricht, The Netherlands
Karl Tuyls
Center for Mathematics and Computer Science (CWI), Kruislaan 413, P.O. Box 94079, 1090, Amsterdam, GB, The Netherlands
Pieter Jan’t Hoen
KaHo Sint-Lieven, Information Technology Group, Gebr. Desmetstraat 1, 9000, Gent, Belgium
Katja Verbeeck
Department of Mathematical and Computer Science, University of Tulsa, USA
Sandip Sen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nowé, A., Verbeeck, K., Peeters, M. (2006). Learning Automata as a Basis for Multi Agent Reinforcement Learning. In: Tuyls, K., Hoen, P.J., Verbeeck, K., Sen, S. (eds) Learning and Adaption in Multi-Agent Systems. LAMAS 2005. Lecture Notes in Computer Science(), vol 3898. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11691839_3

Download citation

DOI: https://doi.org/10.1007/11691839_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33053-0
Online ISBN: 978-3-540-33059-2
eBook Packages: Computer ScienceComputer Science (R0)Springer Nature Proceedings Computer Science

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us

Policies and ethics