Security Model

How it works

Source code goes through five stages:

Parse -- ast.parse() produces an AST
Validate -- the rewriter rejects unrecognized AST nodes (fail-closed)
Rewrite -- attribute access, imports, loops, and function/class definitions are transformed to route through gate functions
Compile -- the rewritten AST is compiled to bytecode
Execute -- bytecode runs with restricted __builtins__ and gate functions in the namespace

Gate functions

The rewriter injects calls to these internal functions:

Gate	Purpose
`__st_getattr__`	Policy-checked attribute read (`obj.attr`)
`__st_setattr__`	Policy-checked attribute write (`obj.attr = x`)
`__st_delattr__`	Policy-checked attribute delete (`del obj.attr`)
`__st_import__`	Module import (`import x`)
`__st_importfrom__`	From-import (`from x import y`)
`__st_checkpoint__`	Timeout, tick limit, memory, and cancellation check
`__st_defun__`	Function definition wrapping (wrapped mode)
`__st_defclass__`	Class definition wrapping (wrapped mode)

All obj.attr access in sandboxed code -- including in f-strings and augmented assignments -- goes through the getattr gate.

Builtins whitelist

Sandboxed code gets a restricted __builtins__ (frozen via _FrozenBuiltins, a read-only dict subclass). Access to __builtins__ itself is blocked at the AST level — sandboxed code cannot reference it. The builtins it contains:

Available: abs, all, any, ascii, bin, bool, bytearray, bytes, callable, chr, classmethod, complex, dict, divmod, enumerate, filter, float, format, frozenset, getattr (policy-gated), hasattr (policy-gated), hash, hex, id, int, isinstance, issubclass, iter, len, list, locals, map, max, min, next, object, oct, ord, pow, property, range, repr, reversed, round, set, slice, sorted, staticmethod, str, sum, super, tuple, type (single-arg only), zip, plus ~40 exception types.

Not available: exec, eval, compile, __import__, globals, vars, open (unless filesystem provided), dir, help, breakpoint, exit, quit, input, memoryview.

getattr() and hasattr() are routed through the attribute policy -- they respect the same allow/deny rules as obj.attr syntax.

Not available as names: BaseException, KeyboardInterrupt, GeneratorExit, SystemExit.

What's blocked

Arbitrary imports -- only policy-registered modules and VFS files
Private attributes -- _name and __dunder__ (except allowed dunders) blocked by default
Network I/O -- socket operations blocked unless allow_network=True
File I/O -- routes through VFS when filesystem provided, otherwise open unavailable
type(name, bases, dict) -- three-arg form blocked (prevents dynamic class creation outside the rewriter)
str.format traversal -- "{0.__class__}".format(obj) blocked
__builtins__ access -- blocked at the AST level; sandboxed code cannot read __builtins__
__st_* names -- reserved namespace rejected at validation time
globals() -- not available
Bare except: -- automatically rewritten to except Exception:. Without this, sandboxed code could swallow BaseException subclasses that the sandbox relies on for control flow (StTimeout, StCancelled, KeyboardInterrupt, SystemExit), defeating timeouts and cancellation. This is a deliberate semantic change -- Python's bare except: normally catches everything, but in the sandbox it only catches Exception and below. No warning is emitted; the rewrite is silent and unconditional

Checkpoint enforcement

Checkpoints are injected at:

Start of every loop body (for, while)
Start of every function/method body
Every comprehension iteration ([x for x in ...])
Every call to a non-type builtin function (len, sorted, sum, etc.)

Type builtins (str, int, dict, range, etc.) do not fire checkpoints -- they are real types so that library code receiving them (e.g. df.astype(str)) works correctly. This means a single type construction like list(range(10**8)) won't checkpoint before allocating. In practice this is not a gap: the allocation is a single C-level call that no per-call checkpoint could interrupt mid-flight, and the memory limit is enforced at the next checkpoint.

Each checkpoint increments the tick counter and checks: cancellation flag, tick limit, wall-clock timeout, and memory limit (in that order).

Memory limits

When Policy.memory_limit is set (in MB), two layers of enforcement apply:

RLIMIT_AS (Linux only) -- kernel-enforced virtual address space cap. Set via setrlimit for the duration of the sandbox execution. The kernel refuses allocations beyond current_RSS + limit, raising MemoryError. This catches single large allocations that happen between checkpoints. This is process-wide -- concurrent sandboxes share the limit.
Checkpoint-based detection -- at each checkpoint, peak RSS (ru_maxrss) is compared against the baseline. If it exceeds the limit, MemoryError is raised. This works on both Linux and macOS but only fires at checkpoint boundaries.

macOS does not support RLIMIT_AS (setrlimit returns EINVAL), so only checkpoint-based detection is available. Windows lacks the resource module entirely -- memory limits are a no-op.

Threat model

sandtrap is a "walled garden" -- it controls what sandboxed code can access, not what the Python runtime can do. It is designed to prevent accidental or casual misuse by LLM-generated code.

In-process (`isolation="none"`)

The default mode runs in-process and shares the host's memory space. It is not a security boundary against a determined attacker with full CPython knowledge. For hard isolation, use isolation="kernel".

Subprocess (`isolation="process"` / `isolation="kernel"`)

isolation="kernel" adds a process boundary with kernel-level restrictions:

Filesystem -- Landlock (Linux) or Seatbelt (macOS) restricts access to the IsolatedFS root directory
Syscalls -- seccomp (Linux) or Seatbelt (macOS) blocks process spawning, and blocks network when the policy doesn't need it
Process isolation -- crashes, OOM, or segfaults in the child don't affect the host

Kernel restrictions are applied once at worker startup and cannot be loosened afterward (seccomp/Landlock are strictly monotonic; Seatbelt is one-shot). If any part of the policy grants network_access=True or host_fs_access=True to any registration, the corresponding kernel restriction is completely off for the entire worker process. In that case, only the Python-level ContextVar gating enforces per-callable access control. See process.md for details.

isolation="process" provides crash protection without kernel restrictions.

Both are a meaningful security improvement over isolation="none", but kernel isolation is best-effort -- it degrades gracefully when platform features are unavailable (warnings are emitted).

What's defended

Attribute traversal attacks (MRO walking, __subclasses__(), __globals__)
Dynamic class creation via type(name, bases, dict)
Import of unregistered modules
str.format field traversal ({0.__class__})
Swallowing control exceptions via bare except:
Infinite loops and runaway computation (tick limits, timeouts)
Unauthorized file and network I/O
Process spawning and filesystem escape (isolation="kernel", kernel-enforced)

What's out of scope

These vectors are not defended against by the Python-level policy. isolation="kernel" mitigates some of them via kernel enforcement, but they remain concerns for in-process execution:

C extensions -- a registered module with C code can do anything (call ctypes, access raw memory, spawn processes). Only register modules you trust. With isolation="kernel", seccomp blocks execve and Landlock/Seatbelt restricts filesystem access, limiting the blast radius.
ctypes / cffi -- if registered, these provide unrestricted access to the C layer. Never register them.
gc.get_objects() -- if the gc module is registered, sandboxed code can enumerate all live Python objects. Don't register gc.
Signal handlers -- signal module access would allow overriding the host's signal handling. With process isolation, this only affects the child process.
Shared mutable state -- objects passed into the sandbox namespace are not copied. Sandboxed code can mutate them in place. With process isolation (isolation="process" or "kernel"), namespaces are serialized across the process boundary, so mutations don't propagate back to the host.
Side channels -- timing attacks, cache probing, and other side channels are not mitigated.
CPython internals -- bytecode manipulation, sys._getframe(), ctypes.pythonapi, and other CPython-specific escape hatches are blocked by the attribute gate and builtins whitelist, but novel CPython exploits may bypass AST-level controls.

Process-global patches

Filesystem interception is provided by monkeyfs, which monkey-patches builtins.open, os.stat, os.path.exists, and 20+ other stdlib functions at the process level. Network interception patches socket.socket.connect etc. similarly. Patches are installed once on first use and remain active permanently. They dispatch via ContextVar -- when no sandbox is executing, all calls fall through to the original functions transparently. This is necessary so that registered libraries (e.g., pd.read_csv) see the virtual filesystem during sandbox execution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Security

docs/security.md

Security Model

How it works

Gate functions

Builtins whitelist

What's blocked

Checkpoint enforcement

Memory limits

Threat model

In-process (`isolation="none"`)

Subprocess (`isolation="process"` / `isolation="kernel"`)

What's defended

What's out of scope

Process-global patches

There aren’t any published security advisories

Security: ashenfad/sandtrap

Security

docs/security.md

Security Model

How it works

Gate functions

Builtins whitelist

What's blocked

Checkpoint enforcement

Memory limits

Threat model

In-process (isolation="none")

Subprocess (isolation="process" / isolation="kernel")

What's defended

What's out of scope

Process-global patches

There aren’t any published security advisories

In-process (`isolation="none"`)

Subprocess (`isolation="process"` / `isolation="kernel"`)