Glossary · Term

GitHub

← all terms

Definition

The dominant online home for code projects and where most open-source software lives.

The widely used hosted version-control and collaboration platform for software development; many agent benchmarks draw from real GitHub issues and pull requests.

Mentioned in 9 episodes

  1. 072
    A Robot Made Graphene Without Help, And Caught Itself Hallucinating
  2. 068
    The OS Trick That Makes Tree Search Practical for Coding Agents
  3. 061
    When Helpful Agents Go Sideways: A 404 Error, Campus Security, and Why Alignment Misses This
  4. 058
    Why Upgrading Your AI Auditor to a Smarter Model Can Make Your System Less Safe
  5. 047
    When Agent Benchmarks Lie: The Harness Problem in Open-Source AI
  6. 029
    Why Forty-Eight Percent on FrontierMath Isn't the Real Story in DeepMind's New Math Paper
  7. 012
    Why AI Coding Agents Keep Trying to Debug Without a Debugger
  8. 005
    Why a Debugger Designed for Humans Is the Wrong Tool for an AI Agent
  9. 003
    How to Pick the Best of Sixteen Coding Agent Rollouts

Related concepts