Online Learning in Non-Cooperative Configurable Markov Decision Process
Online Learning in Non-Cooperative Configurable Markov Decision Process Authors: Giorgia Ramponi, Alberto Maria Metelli, Alessandro Concetti, Marcello Restelli Conference: AAAI 2021 Abstract: In the Configurable Markov Decision Processes there are two entities, a Reinforcement Learning agent and a configurator which can modify some parameters of the environment to improve the performance of the agent. What […]