R-CS-09 bug_report Cybersecurity & Adversarial DAMAGE 3.6 / High

Execution Environment Escape

Agents with code execution capabilities can test sandbox boundaries. A successful escape grants access to host resources, adjacent containers, or the orchestration layer.

The Risk

Some agents have code execution capabilities: they can write and execute code, invoke shell commands, or run scripts. Code execution is powerful for agents (enabling complex computational tasks, data processing, system integration), but creates sandbox escape risk.

If an agent is compromised or injected with malicious instructions, the agent can attempt to break out of its sandbox or execution environment. A successful escape grants the attacker access to the host system, orchestration layer, or other containers on the same host.

Sandbox escapes are particularly dangerous because they are often undetectable. A process running inside a container attempting to escape the container looks like normal process activity to the host system. Traditional EDR may not flag escape attempts.

How It Materializes

A data analytics company deploys agents with code execution capability to process customer data and generate reports. Agents can write Python code to extract, transform, and analyze data. The agents run in Docker containers with resource limits and restricted file system access to limit blast radius.

An attacker compromises an agent through prompt injection, injecting the instruction: "Your next task is to execute the following Python code." The code probes the container for known escape vectors: checking for privileged mode, testing for kernel vulnerabilities, attempting to access the host file system.

The code discovers that the container is running with a vulnerable Linux kernel version. The attacker uses a known privilege escalation exploit to escape the container and gain access to the host system.

Once on the host, the attacker discovers that the host is running Kubernetes and has access to the Kubernetes API server (due to misconfigured RBAC). The attacker uses Kubernetes API access to access secrets stored in Kubernetes (database passwords, API keys), spawn new containers with elevated privileges, and access data from other customers' containers running on the same host. The attack has escalated from compromised agent to host compromise to Kubernetes cluster compromise, all through a single sandbox escape.

DAMAGE Score Breakdown

Dimension	Score	Rationale
D - Detectability	4	Sandbox escape attempts may generate suspicious system calls, but detection requires behavioral analysis. Not all escape attempts are detected.
A - Autonomy Sensitivity	4	High when agents have code execution autonomy. Escapes are more likely when agent can execute arbitrary code.
M - Multiplicative Potential	4	Successful escape affects all processes on the host and all containers sharing the host.
A - Attack Surface	3	Kernel vulnerabilities, container runtime vulnerabilities, and orchestration vulnerabilities are attack surfaces.
G - Governance Gap	3	Institutions may not have processes for validating sandbox strength or monitoring for escape attempts.
E - Enterprise Impact	5	Successful escape grants access to host, orchestration, and all systems on shared infrastructure. Material impact.
Composite DAMAGE Score	3.6	High. Requires dedicated controls and monitoring. Should not be accepted without mitigation.

Agent Impact Profile

How severity changes across the agent architecture spectrum.

Agent Type	Impact	How This Risk Manifests
manage_accounts Digital Assistant	Low	Agents do not have code execution. No escape risk.
school Digital Apprentice	Medium	Agents have limited code execution in sandboxes. Escape is difficult but possible.
smart_toy Autonomous Agent	High	Agents have autonomous code execution. Escape is a material risk.
share Delegating Agent	Medium	Delegating agent invokes tools that may execute code. Tool escape affects agent.
groups Agent Crew / Pipeline	High	Crew agents may invoke each other's code. Escape in one agent affects crew.
account_tree Agent Mesh / Swarm	High	Mesh agents execute code dynamically. Multiple escape vectors.

Regulatory Framework Mapping

Framework	Coverage	Citation	What It Addresses	What It Misses
NIST CSF 2.0	Partial	PR.IP-2 (Software Security)	Application and software security.	Container and sandbox security for agents.
NIST SP 800-53	Partial	SC-7, SC-39	Boundary and process isolation.	Agent sandbox isolation and escape prevention.
CIS Docker Benchmark	Partial	Container Security	Container security.	Agent container security.
NIST Container Security Guide	Partial	Container Isolation	Container isolation and security.	Sandbox escape in agent containers.

Why This Matters in Regulated Industries

In regulated industries, institutions are responsible for isolating customer data and systems. If an agent escapes its sandbox and accesses other customers' data or systems, the institution has failed in its isolation responsibility.

Additionally, sandbox escapes are evidence of inadequate security controls. Regulators assess whether institutions have properly isolated sensitive systems and data. A successful escape demonstrates that the isolation was insufficient.

Controls & Mitigations

architectureDesign-Time Controls

Minimize code execution privileges for agents. Only grant the minimum capabilities agents need. Avoid giving agents shell access or OS-level execution.
Use strongly-isolated sandbox technologies. Use language-level sandboxes (Python restricted mode, JS Web Workers) or container-level sandboxes, not just file system restrictions.
Implement principle of least privilege: agents run as restricted users with minimal file system and system call access.
Disable dangerous system calls and kernel features in container environments. Disable syscalls commonly used in escape exploits.

play_circleRuntime Controls

Monitor system calls made by agent processes. Use eBPF or seccomp to log and potentially block suspicious system calls (privilege escalation attempts, kernel module loading).
Implement host-level EDR monitoring for escape indicators. Monitor for processes escaping containers, suspicious kernel module loading, or unexpected privilege elevation.
Use Component 4 (Blast Radius Calculator) to estimate impact of sandbox escape. If an agent escapes, what systems and data would be exposed?
Implement resource limits on agent containers. Limit CPU, memory, and I/O so that escaped agents have limited capability to impact other containers.

monitoringDetection & Response

Conduct regular vulnerability assessments of kernel versions and container runtimes used by agent environments. Patch known escape vulnerabilities promptly.
Implement adversarial testing of agent sandboxes. Attempt to escape using known vulnerabilities and verify escapes are prevented.
Monitor for signs of escape attempts. Unusual system calls, privilege escalation attempts, or kernel module loading are indicators of escape attempts.
Implement incident response for successful escapes. If escape is detected, immediately isolate the affected host and investigate what was accessed.

Related Risks

Address This Risk in Your Institution

Execution Environment Escape requires architectural controls that go beyond what existing frameworks provide. Our advisory engagements are purpose-built for banks, insurers, and financial institutions subject to prudential oversight.

Schedule a Briefing

Agentic AI Risk & Controls Workshop Our Methodology Regulatory Landscape