On Following Pareto-Optimal Policies in Multi-Objective Planning and Reinforcement Learning
 
On Following Pareto-Optimal Policies in Multi-Objective Planning and Reinforcement Learning 
 
Diederik M. Roijers, Willem Röpke, Ann Nowe, Roxana Radulescu