Reinforcement learning
Introduction and Markov decision proceses
Reinforcement learning deals with sequential decision making problems where an agent: Observes t...
Value function and Q-function
The total reward is typically defined s the expected discounted cumulative reward. We can define...
Q-learning
We apply a policy to explore the environment in order to collect information and we keep a progre...