Definition
A hidden mechanism left in a system that lets someone bypass its normal protections.
A covert pathway intentionally embedded in software (or model weights) that allows an attacker or operator to bypass security or behavioral controls.
A hidden mechanism left in a system that lets someone bypass its normal protections.
A covert pathway intentionally embedded in software (or model weights) that allows an attacker or operator to bypass security or behavioral controls.