Thursday September 22nd, 2022 / Last updated : Monday May 22nd, 2023 Working Paper

[UTMD-029] Mutation-Driven Follow the Regularized Leader for Last-Iterate Convergence in Zero-Sum Games (by Kenshi Abe, Mitsuki Sakamoto, Atsushi Iwasaki)

Author

Kenshi Abe, Mitsuki Sakamoto, Atsushi Iwasaki

Abstract

In this study, we consider a variant of the Follow the Regularized Leader (FTRL) dynamics in twoplayer zero-sum games. FTRL is guaranteed to converge to a Nash equilibrium when time-averaging the strategies, while a lot of variants suffer from the issue of limit cycling behavior, i.e., lack the last-iterate convergence guarantee. To this end, we propose mutant FTRL (M-FTRL), an algorithm that introduces mutation for the perturbation of action probabilities. We then investigate the continuous-time dynamics of M-FTRL and provide the strong convergence guarantees toward stationary points that approximate Nash equilibria under full-information feedback. Furthermore, our simulation demonstrates that M-FTRL can enjoy faster convergence rates than FTRL and optimistic FTRL under full-information feedback and surprisingly exhibits clear convergence under bandit feedback.

PDF

Categories: Working Paper

[UTMD-029] Mutation-Driven Follow the Regularized Leader for Last-Iterate Convergence in Zero-Sum Games (by Kenshi Abe, Mitsuki Sakamoto, Atsushi Iwasaki)

Author

Abstract

Professor Michihiro Kandori (Vice-director of UTMD) elected as President of the Game Theory Society

[UTMD-030] Anytime Capacity Expansion in Medical Residency Match by Monte Carlo Tree Search (by Kenshi Abe, Junpei Komiyama, Atsushi Iwasaki)