Definition
A benchmark for testing whether AI systems can write fast custom GPU code.
A benchmark of CUDA kernel generation tasks measuring whether generated kernels match or beat hand-written PyTorch references in correctness and speedup.
A benchmark for testing whether AI systems can write fast custom GPU code.
A benchmark of CUDA kernel generation tasks measuring whether generated kernels match or beat hand-written PyTorch references in correctness and speedup.