Glossary · Term

AlphaGo

← all terms

Definition

DeepMind's Go-playing AI that beat human champions and inspired a lot of later RL work.

DeepMind's neural-net-plus-MCTS system that defeated top human Go players and became a reference point for the claim that RL discovers novel strategies.

Mentioned in 2 episodes

  1. 026
    What RL Actually Does to Language Models, at the Token Level
  2. 013
    Why Search Keeps Rediscovering the Same Workflow, and What That Means

Related concepts