Secure bydesign,not by accident.

The Harness treats security as infrastructure, not an afterthought. Sandboxed execution, explicit review gates, content capture disabled by default, and a clear production checklist.

Security Architecture

The harness lives outside the sandbox.

Most AI agent frameworks place the orchestration layer — credentials, API keys, access policies — inside or alongside the execution environment. When the model can reach them through tool calls or environment variables, so can an attacker with a prompt injection. PURISTA takes the opposite approach: the Harness runs in your application process. Only typed tool inputs ever cross the process boundary. Secrets never enter the sandbox.

API keys and credentials stay in your application process — never injected into the sandbox environment.
Only typed, schema-validated tool inputs cross the boundary. No raw strings, no env vars, no secrets.
The sandbox handles execution only: LLM calls, file ops, and commands — nothing sensitive is visible there.
Even under successful prompt injection, the model cannot reach keys it was never given.

Security Model

Four pillars of safe AI

Sandboxed Execution

Every agent runs in a controlled sandbox with explicit resource limits and timeout boundaries. File operations, tool calls, and MCP servers all respect the same boundary.

inMemorySandbox() — file-only, no command execution
bashSandbox() — command execution through just-bash
Declared skills mount under /skills/<name> and are read on demand
Custom adapters for containers or remote execution
Explicit executor availability: unavailable vs available

Human-in-the-Loop

No mutation before approval. Review gates and harness governance approval providers block sensitive tools until a human or policy system approves. Full OTel trace for every decision.

Visible review questions with recommended answers
Immediate answer capture on selection
Idempotent final decision submission
Optional governance events for policy and approval audit
Stale run/review IDs fail with clear errors

Content Privacy

Content capture is disabled by default. Prompts and responses are not persisted in traces or logs unless explicitly enabled for local diagnostics.

contentCaptureMode: 'NO_CONTENT' by default
Enable only for deliberate local diagnostics
OTel spans include metadata, not content
Application owns data residency decisions

Type Safety & Validation

Every agent input, output, and tool call is schema-validated. No unvalidated data flows from the model to your application code.

Zod schemas for all inputs and outputs
TypeScript inference through the entire chain
Tool input/output validation with normalized errors
MCP schema validation before tool registration

Sandbox boundaryTools run through the selected isolation layer.

Sandbox

Controlled execution

Choose the right sandbox for your risk profile. File-only for safe agents. Command execution only where needed. Custom adapters for containers or remote isolation.

inMemorySandbox() — file operations, no commands
bashSandbox() — command execution through just-bash
Skills are mounted as files and the prompt lists only metadata plus /skills/<name>/SKILL.md
Custom Sandbox/SandboxSession for containers
Snapshot, resume, and hibernate capabilities opt-in
Preview ports and browser routing stay application-owned

Human-in-the-loopThe write path only opens after approval.

Review Gates

No mutations without approval

When an agent proposes a change, the workflow creates a ReviewRequest. The user sees questions, selects answers, and submits a decision. Only approved changes execute write tools.

Proposal agent drafts the change
Workflow creates ReviewRequest with questions
User selects answers in questionnaire UI
Decision submitted — approved, revise, or reject
Write tool executes only on approved
All steps traced and logged

Production Checklist

Eight decisions before go-live

Each item is a boundary. Get it right once, and every agent you build inherits the safety.

Durable StateStore

Use a durable adapter so sessions, run history, and OTel traces survive process restart. In-memory state is fine for local development. Production needs persistence.

Tenant-safe Session IDs

Define session IDs that include tenant boundaries. A session ID like tenant-a:user-42 prevents cross-tenant data leakage by design.

Explicit Timeout Budgets

Set runTimeoutMs, modelTimeoutMs, and toolTimeoutMs for each path. User-facing routes get smaller budgets. Background research gets larger ones.

Content Capture Off

Keep contentCaptureMode: 'NO_CONTENT' outside local diagnostics. Never log PII, secrets, or model prompts in production traces. OTel spans include metadata, not content.

Permission and Governance Gates

Use review gates, explicit permission checks, or opt-in harness governance before sensitive tools. No write should happen without workflow approval, service authorization, or an enforced tool policy.

Minimal Sandbox Permissions

Use inMemorySandbox() unless command execution is explicitly required. Bind skills explicitly, keep project discovery trusted-root gated, and never grant more than needed. The sandbox is an adapter — choose the right isolation level.

Test Failure Modes

Test provider failures, validation failures, cancellation mid-run, and graceful shutdown paths. The Harness provides test harnesses and scripted models for deterministic failure simulation.

Validate Tool Outputs

Never trust model-generated tool inputs. The Harness validates with Zod before execution, but your tool handler should also defend against edge cases. Return structured data. Respect signals.

Build AI that your team trusts.

The Harness gives you the structure, safety, and observability that enterprise AI requires — without the SaaS lock-in.

Get started→Explore Use Cases