We propose a new method to generate a program from a Reinforcement Learning policy. Compared to previous methods, we exploit more RL-specific elements such as the critic value-network. Improved actions from the critic are used to steer a Genetic Programming process via a fitness function.
Deproost, S, Steckelmacher, D & Nowe, A 2024, 'Programmatic Reinforcement Learning using Critic-Moderated Evolution', BNAIC/BeNeLearn 2024: Joint International Scientific Conferences on AI and Machine Learning, Utrecht, Netherlands, 18/11/24 - 20/11/24.
Deproost, S., Steckelmacher, D., & Nowe, A. (2024). Programmatic Reinforcement Learning using Critic-Moderated Evolution. Poster session presented at BNAIC/BeNeLearn 2024: Joint International Scientific Conferences on AI and Machine Learning, Utrecht, Netherlands.
@conference{9b2db174ac0641ba9bec1057285a292d,
title = "Programmatic Reinforcement Learning using Critic-Moderated Evolution",
abstract = "We propose a new method to generate a program from a Reinforcement Learning policy. Compared to previous methods, we exploit more RL-specific elements such as the critic value-network. Improved actions from the critic are used to steer a Genetic Programming process via a fitness function.",
keywords = "Deep Reinforcement Learning, Genetic Programming, Explainable AI",
author = "Senne Deproost and Denis Steckelmacher and Ann Nowe",
year = "2024",
month = nov,
day = "18",
language = "English",
note = "BNAIC/BeNeLearn 2024: Joint International Scientific Conferences on AI and Machine Learning, BNAIC/BeNeLearn 2024 ; Conference date: 18-11-2024 Through 20-11-2024",
url = "https://bnaic2024.sites.uu.nl/, https://bnaic2024.sites.uu.nl",
}