Definition
Tool use is the model’s ability to call external functions — a calculator, a search engine, a code interpreter, an API — and use the results in its response. It’s what turns a chat model into something that can actually act in the world.
Episodes covering this
Worth reading next
Papers we haven't done a deep dive on yet, but would recommend on this topic.
- Search-o1: Agentic Search-Enhanced Large Reasoning Models
- AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents
- Not What You've Signed Up For: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
- ToolBench: Facilitating Large Language Models to Master 16000+ Real-world APIs
- τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
- AlphaProof and AlphaGeometry 2: AI achieves silver-medal standard solving International Mathematical Olympiad problems