Reinforcement Learning

Course load: 80

Prerequisites

Reinforcement Learning (RL). RL Algorithms. How to build a reinforcement learning solution.

At the end of the course, the student should be able to:

Build a Reinforcement Learning system for sequential decision-making.
Understand how to formalize your task as a Reinforcement Learning problem, and how to implement a solution.
Understand the space of RL algorithms (Sarsa, Q-learning, Policy Gradients, and more).
Understand how RL fits under the broader umbrella of machine learning, and how it complements supervised and unsupervised learning.

Introduction to Reinforcement Learning.
Implementation of autonomous agents using reinforcement learning.
Temporal-Difference learning.
Q-Learning algorithm.
Sarsa algorithm.
Policy Gradients and Proximal Policy Optimization (PPO).
Deep Q-Learning algorithms.
Implementations of autonomous agents using OpenAI's Gym project and Kaggle's library for RL.
Reinforcement learning use cases.

GÉRON, A. Hands-on Machine Learning with Scikit-learn, Keras, and TensorFlow, 2ª ed., O'Reilly, 2021.
SUTTON, R.; BARTO, A. Reinforcement Learning: An Introduction. Second Edition. The MIT Press, 2018.
Van Hasselt, H., Guez, A. and Silver, D., 2016, March. Deep reinforcement learning with double q-learning. In Proceedings of the AAAI conference on artificial intelligence (Vol. 30, No. 1).
Schulman, J., Wolski, F., Dhariwal, P., Radford, A. and Klimov, O., 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347.
Brockman, G. et al., 2016. Openai gym. arXiv preprint arXiv:1606.01540.

NORVIG, P.; RUSSELL, S., Inteligência Artificial, 3ª ed., Campus Elsevier, 2013.
SILVER, D.; SINGH S.; PRECUP D.; SUTTON R. Reward is enough. Artificial Intelligence. Vol 299, 2021.
MuZero: Mastering Go, chess, shogi and Atari without rules. Publicado em Dezembro, 2020.
SILVER, D.; HUBERT T.; SCHRITTWIESER, J.; ANTONOGLOU, I.; LAI, M.; GUEZ, A. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science 362, 1140-1144 (2018).
Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D. and Riedmiller, M., 2013. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602.