Most current implementations of Reinforcement Learning agents con-sider that one agent interacts with one environment, and that the agent and envi-ronment run on the same machines. Previous work, such as RL-Glue1, went a stepin the direction of allowing the agent and environment to be different processeson a computer, but a wider separation of the agent and environment is much lesscommon. In this demonstration, we illustrate how Shepherd, a web-service thatallows clients to remotely query a Reinforcement Learning agent for actions, al-lowsmultiple peopleto interact at the same time with asingle agent, on theirphone, over the Internet, without having to install anything. Shepherd ensuresthat knowledge obtained from one client (one person in this demonstration) isquickly leveraged to improve the performance of the agent for the other clients.
Plisnier, H, Steckelmacher, D & Nowe, A 2021, 'Shepherd: Reinforcement Learning as a Service withDistributed Execution', Paper presented at 33rd Benelux Conference on Artificial Intelligence and 30th Belgian-Dutch Conference on Machine Learning, Luxembourg, 10/11/21 - 12/11/21 pp. 726-728.
Plisnier, H., Steckelmacher, D., & Nowe, A. (2021). Shepherd: Reinforcement Learning as a Service withDistributed Execution. 726-728. Paper presented at 33rd Benelux Conference on Artificial Intelligence and 30th Belgian-Dutch Conference on Machine Learning, Luxembourg.
@conference{f6e1c45f870547c6b08d6a9be6568ba9,
title = "Shepherd: Reinforcement Learning as a Service withDistributed Execution",
abstract = "Most current implementations of Reinforcement Learning agents con-sider that one agent interacts with one environment, and that the agent and envi-ronment run on the same machines. Previous work, such as RL-Glue1, went a stepin the direction of allowing the agent and environment to be different processeson a computer, but a wider separation of the agent and environment is much lesscommon. In this demonstration, we illustrate how Shepherd, a web-service thatallows clients to remotely query a Reinforcement Learning agent for actions, al-lowsmultiple peopleto interact at the same time with asingle agent, on theirphone, over the Internet, without having to install anything. Shepherd ensuresthat knowledge obtained from one client (one person in this demonstration) isquickly leveraged to improve the performance of the agent for the other clients.",
keywords = "Reinforcement Learning",
author = "Helene Plisnier and Denis Steckelmacher and Ann Nowe",
year = "2021",
month = nov,
day = "10",
language = "English",
pages = "726--728",
note = "33rd Benelux Conference on Artificial Intelligence and 30th Belgian-Dutch Conference on Machine Learning : 33rd Benelux Conference on Artificial Intelligence and 30th Belgian-Dutch Conference on Machine Learning, BNAIC/BeneLearn 2021 ; Conference date: 10-11-2021 Through 12-11-2021",
url = "https://bnaic2021.uni.lu/",
}