harness · Glossary · AI Papers: A Deep Dive

Definition

Plain language

The wrapper of code, tools, and prompts around an AI agent that shapes how it actually behaves.

As stated in the literature

The surrounding software stack for an LLM agent — tool definitions, parsers, system prompts, scratchpad conventions, and orchestration logic — that determines in-context behavior beyond raw model weights.

Also called: agent harness, harnesses

Why it matters: Much of what looks like model capability is actually harness engineering, which is why fair agent comparisons require fixing the harness.

For example, two papers might use the same base model but report very different agent scores because their harnesses parse tool calls and format prompts differently.

Heard on the show

“So they built a real harness — an actual client and server talking to each other — and for each of the eight attacks they recorded four yes-or-no facts.”

Episode 208 — The Blank Space in Your AI Approval Box That Isn't Empty

Mentioned in 48 episodes

Related terms

agent scratchpad system prompt weights