An Online Learning Approach to a Multi-player N-armed Functional Bandit

Research Output

Congestion games possess the property of emitting at least one pure Nash equilibrium and have a rich history of practical use in transport modelling. In this paper we approach the problem of modelling equilibrium within congestion games using a decentralised multi-player probabilistic approach via stochastic bandit feedback. Restricting the strategies available to players under the assumption of bounded rationality, we explore an online multiplayer exponential weights algorithm for unweighted atomic routing games and compare this with a ϵ-greedy algorithm.

Date:

14 February 2020
Publication Status:

Published
Publisher

Springer International Publishing
DOI:

10.1007/978-3-030-40616-5_41
Funders:

Historic Funder (pre-Worktribe)

http://researchrepository.napier.ac.uk/output/2560144 <p>O’Neill, S., Bagdasar, O., & Liotta, A. (2020). An Online Learning Approach to a Multi-player N-armed Functional Bandit. In <i>Numerical Computations: Theory and Algorithms</i>. , (438-445). https://doi.org/10.1007/978-3-030-40616-5_41</p>

Citation

O’Neill, S., Bagdasar, O., & Liotta, A. (2020). An Online Learning Approach to a Multi-player N-armed Functional Bandit. In Numerical Computations: Theory and Algorithms. , (438-445). https://doi.org/10.1007/978-3-030-40616-5_41

Authors

Prof Antonio Liotta PhD MSc

Professor of Data Science and Intelligent Systems
School of Computing

0131 455 2850

A.Liotta@napier.ac.uk

Keywords

Congestion games, Online learning, Multi-armed bandit

Monthly Views:

Available Documents

Files currently unavailable for download , please contact repository@napier.ac.uk to request a copy
Downloadable citations
HTML BIB RTF

Date:

Publication Status:

Publisher

DOI:

Funders:

Citation

Authors

Prof Antonio Liotta PhD MSc

Keywords

Monthly Views:

Files currently unavailable for download , please contact repository@napier.ac.uk to request a copy

Downloadable citations