Model-Based Reinforcement Learning in Multi-Objective Environments with a Distributional Critic
 
Model-Based Reinforcement Learning in Multi-Objective Environments with a Distributional Critic 
 
Willem Röpke, Diederik M Roijers, Ann Nowé, Roxana Radulescu, Hendrik Baier