Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets
 
Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets 
 
Denis Steckelmacher, Diederik Roijers, Anna Harutyunyan, Peter Vrancx, Helene Plisnier, Ann Nowe