What Changed
- UK AI Security Institute introduces Sandbox Escape Bench, the first benchmark to systematically evaluate whether AI agents can break out of their sandboxes, and some early results.
No matching stories or pages.
Policy & Safety · UK AI Security Institute
UK AI Security Institute introduces Sandbox Escape Bench, the first benchmark to systematically evaluate whether AI agents can break out of their sandboxes, and some early results.