Course load: 80
- Proficiency in Python.
- Basic machine learning knowledge.
Reinforcement Learning (RL). RL Algorithms. How to build a reinforcement learning solution.
At the end of the course, the student should be able to:
- Build a Reinforcement Learning system for sequential decision-making.
- Understand how to formalize your task as a Reinforcement Learning problem, and how to implement a solution.
- Understand the space of RL algorithms (Sarsa, Q-learning, Policy Gradients, and more).
- Understand how RL fits under the broader umbrella of machine learning, and how it complements supervised and unsupervised learning.
- Introduction to Reinforcement Learning.
- Implementation of autonomous agents using reinforcement learning.
- Temporal-Difference learning.
- Q-Learning algorithm.
- Sarsa algorithm.
- Policy Gradients and Proximal Policy Optimization (PPO).
- Deep Q-Learning algorithms.
- Implementations of autonomous agents using OpenAI's Gym project and Kaggle's library for RL.
- Reinforcement learning use cases.
- GÉRON, A. Hands-on Machine Learning with Scikit-learn, Keras, and TensorFlow, 2ª ed., O'Reilly, 2021.
- SUTTON, R.; BARTO, A. Reinforcement Learning: An Introduction. Second Edition. The MIT Press, 2018.
- Van Hasselt, H., Guez, A. and Silver, D., 2016, March. Deep reinforcement learning with double q-learning. In Proceedings of the AAAI conference on artificial intelligence (Vol. 30, No. 1).
- Schulman, J., Wolski, F., Dhariwal, P., Radford, A. and Klimov, O., 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347.
- Brockman, G. et al., 2016. Openai gym. arXiv preprint arXiv:1606.01540.
- NORVIG, P.; RUSSELL, S., Inteligência Artificial, 3ª ed., Campus Elsevier, 2013.
- SILVER, D.; SINGH S.; PRECUP D.; SUTTON R. Reward is enough. Artificial Intelligence. Vol 299, 2021.
- MuZero: Mastering Go, chess, shogi and Atari without rules. Publicado em Dezembro, 2020.
- SILVER, D.; HUBERT T.; SCHRITTWIESER, J.; ANTONOGLOU, I.; LAI, M.; GUEZ, A. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science 362, 1140-1144 (2018).
- Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D. and Riedmiller, M., 2013. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602.