Archive

Reinforcement Learning Course

Proximal Policy Optimization

RL Part 8: Trust regions, the clipped surrogate, and the workhorse of modern RL.

Jun 14

Reinforcement Learning Course

Policy Gradients: REINFORCE and Actor-Critic

RL Part 7: Learning the policy directly, from REINFORCE to actor-critic.

Jun 7

Policy Gradients: REINFORCE and Actor-Critic

Reinforcement Learning Course

Introduction to Deep RL and DQN

RL Part 6: From linear features to neural networks, and the engineering choices that makes deep value-based RL possible.

May 31

Reinforcement Learning Course

Function Approximation

RL Part 5: From tables to parameterized value functions.

May 24

Memory

[Hands-on] Agent memory is only as good as its schema

A deep dive on building production-grade memory for Agents.

May 23

[Hands-on] Agent memory is only as good as its schema

Reinforcement Learning Course

Model-Free Learning

RL Part 4: Learning value functions and policies without a model. Monte Carlo methods, TD(0), SARSA, Q-learning, and the bias-variance bridge between them.

May 17

Agents

Hermes Agent Masterclass

Everything you need to understand and customize Hermes Agent.

May 14

LLMs

Speculative Decoding in LLMs

...explained with code and tradeoffs.

May 13