Definition
Weight exfiltration is the theft of a trained model’s parameters from where they’re supposed to live. It’s a top-tier security concern for frontier AI labs: the weights are the model, and getting them out the door is the most direct way to bypass every other safeguard.