In this Brev.dev Concepts video, Harper Carroll (Head of AI/ML) covers the basics of reinforcement learning, exploration and exploitation, model-based vs. model-free methods, Q-learning, Q*, and temporal difference learning. It is accessible to those of all backgrounds, and includes a little math for those interested.
Find me on 𝕏: [ Ссылка ]
Join our community on Discord: [ Ссылка ]
AI/ML Tutorial Notebooks: [ Ссылка ]
Intro: (0:00)
Reinforcement Learning: (1:10)
Exploration & Exploitation: (2:00)
Model-Based Methods: (3:36)
Model-Free Methods: (4:26)
Temporal Difference Learning (estimating Q): (4:36)
Q-Learning: (6:16)
Q* at OpenAI?: (7:46)
Conclusion: (8:24)
Ещё видео!