Definition
Deciding per-query whether to use chain-of-thought reasoning or a direct answer.
In EDRM, the per-query decision pattern using a 64-token entropy probe to select between Direct, Standard, and CoT decoding modes for each individual input.
Deciding per-query whether to use chain-of-thought reasoning or a direct answer.
In EDRM, the per-query decision pattern using a 64-token entropy probe to select between Direct, Standard, and CoT decoding modes for each individual input.