📚 Course

Reinforcement Learning

VidInsights Curriculum Team · Updated 24 Jul 2026

Agents that learn from rewards: Q-learning, policy gradients, and deep RL with PPO.

📖 3 lessons 🎯 Advanced

Deep RL & PPO

Apply An introduction to Policy Gradient methods - Deep Reinforcement Learning 12m

Recommended

Reinforcement Learning: An Introduction, 2nd Ed Check price on Amazon AU →

RL Environments

Analyze Getting Started With OpenAI Gym 12m

RL Fundamentals

Understand Reinforcement Learning from Human Feedback (RLHF) Explained 12m

🧭 You might also like

AI Agents — ReAct & AutoGPT 3 lessons AI Cost Optimization 3 lessons AI Observability — LangSmith & Arize 3 lessons AI Red Teaming & Prompt Security 3 lessons AI Safety & Alignment 3 lessons Android Jetpack Compose Mastery 3 lessons Apache Airflow Deep Dive 3 lessons Apache Cassandra — Wide-Column Store 3 lessons Apache Druid — Real-Time OLAP 3 lessons Apache Flink — Real-Time Stream Processing 3 lessons