Reinforcement learning

Reinforcement learning deals with sequential decision making problems where an agent: Observes t...

The total reward is typically defined s the expected discounted cumulative reward. We can define...

We apply a policy to explore the environment in order to collect information and we keep a progre...