Definition
pass@k is the probability that at least one of k sampled solutions is correct, used pervasively in code-generation benchmarks. The headline numbers (pass@1, pass@10) capture different things: pass@1 is “does the model get it right,” pass@k is “does the model know it.”
Episodes covering this
Worth reading next
Papers we haven't done a deep dive on yet, but would recommend on this topic.