Course content Foundations of Reinforcement Learning Markov Decision Processes and Value Functions Bellman Equations and Dynamic Programming Model-Free Learning Function Approximation Introduction to Deep RL and DQN Policy Gradients: REINFORCE and Actor-Critic Proximal Policy Optimization Published on Apr 25, 2026 Comments Share Copy link Share to X Share to Facebook Share to Linkedin Copied