Definition
The current version of a benchmark of small visual reasoning puzzles designed to be easy for humans and hard for AI.
The second iteration of the Abstraction and Reasoning Corpus benchmark, a harder set of grid-based abstract reasoning tasks used to probe general intelligence in AI systems.