Multi-Scale Reward Shaping via an Off-Policy Ensemble
 
Multi-Scale Reward Shaping via an Off-Policy Ensemble 
 
Anna Harutyunyan, Tim Brys, Peter Vrancx, Ann Nowe