Toll-Based Learning for Minimising Congestion under Heterogeneous Preferences

Toll-Based Learning for Minimising Congestion under Heterogeneous Preferences ■

Gabriel De Oliveira Ramos, Roxana Radulescu, Ann Nowe, Anderson Tavares

Abstract ■

Multiagent reinforcement learning has shown its potential for tackling real world problems, like traffic. We consider the toll-based route choice problem, where self-interested agents repeatedly commute attempting to minimise their travel costs. In this paper, we introduce Generalised Toll-based Q-learning (GTQ-learning), a multiagent reinforcement learning algorithm capable of realigning agents' heterogeneous preferences over travel time and monetary expenses to obtain a system-efficient equilibrium. GTQ-learning also includes a mechanism to enforce agents to truthfully report their preferences. Our theoretical analysis and empirical results show that GTQ-learning minimises congestion on realistic road networks.