Glossary · Term

MultiArith

← all terms

Definition

A benchmark of multi-step arithmetic word problems used to test math reasoning.

A small dataset of multi-step arithmetic word problems, commonly used as an out-of-distribution check on math-reasoning agent workflows.

Mentioned in 1 episode

  1. 013
    Why Search Keeps Rediscovering the Same Workflow, and What That Means