Topic · 28 episodes across 6 reviews
Training and Reinforcement Learning for LLM Agents
A cluster of papers probed what RL actually does to agents — when it teaches genuinely new skills, how it silently fails, how a model could sabotage it, and how broken baselines distorted a whole subfield.