Policy & Safety · UK AI Security Institute
Can AI agents escape their sandboxes? A benchmark for safely measuring container breakout capabilities
UK AI Security Institute introduces Sandbox Escape Bench, the first benchmark to systematically evaluate whether AI agents can break out of their sandboxes, and some early results.