CSAIL Event Calendar: Previous Series
Balancing Multiple Sources of Reward in Reinforcement Learning
Speaker: Christian Shelton , MIT AI Lab/Center for Biological and Computation Learning
For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems include agents with multiple goals (e.g., a financial portfolio manager balancing profit and risk) and agents with multiple users (e.g., a home entertainment system). Creating a single reward value by combining the multiple components can throw away vital information and can lead to incorrect solutions. We describe the multiple reward source problem and discuss the problems with applying traditional reinforcement learning. We then present an new algorithm for finding a solution and results on simulated environments.