Multi-objective Reinforcement Learning for the Expected Utility of the Return
 
Multi-objective Reinforcement Learning for the Expected Utility of the Return 
 
Diederik Roijers, Denis Steckelmacher, Ann Nowe