Toll-Based Learning for Minimising Congestion under Heterogeneous Preferences
 
Toll-Based Learning for Minimising Congestion under Heterogeneous Preferences 
 
Gabriel De Oliveira Ramos, Roxana Radulescu, Ann Nowe, Anderson Tavares
 
Abstract 

Multiagent reinforcement learning has shown its potential for tackling real world problems, like traffic. We consider the toll-based route choice problem, where self-interested agents repeatedly commute attempting to minimise their travel costs. In this paper, we introduce Generalised Toll-based Q-learning (GTQ-learning), a multiagent reinforcement learning algorithm capable of realigning agents' heterogeneous preferences over travel time and monetary expenses to obtain a system-efficient equilibrium. GTQ-learning also includes a mechanism to enforce agents to truthfully report their preferences. Our theoretical analysis and empirical results show that GTQ-learning minimises congestion on realistic road networks.