"Vox Security Model"

Vox Security Model

The Vox security model (SecurityPolicy, SecurityGuard, AuditLog) is defined in vox-orchestrator and provides multi-layer protection against prompt injection, scope violations, and unauthorized access.

Threat Model

Threat	Mitigation
Prompt injection	`prompt_canonical::is_safe_prompt()` using injection pattern detection
Scope violations	`ChildSpec.scope[]` controls which files an agent may access
Token budget abuse	`BudgetManager` with per-agent cost limits and alerts
Unauthorized requests	API key or Bearer token validation in `vox-runtime::auth`
Replay attacks	Request IDs and timestamp validation

SecurityPolicy

#![allow(unused)]
fn main() {
pub struct SecurityPolicy {
    pub allow_shell_execution: bool,
    pub allow_network_access: bool,
    pub max_file_size_bytes: u64,
    pub blocked_paths: Vec<String>,
    pub require_human_in_loop: bool,
}
}

SecurityGuard

Every MCP tool call passes through SecurityGuard::evaluate():

Check for prompt injection patterns
Check scope constraints (if agent has a scope declaration)
Check rate limits (RateLimiter)
Log to AuditLog

Injection Detection

The submit_task tool uses is_safe_prompt() from vox-runtime::prompt_canonical. If an injection is detected:

The task is rejected with a 422 status
An AgentEventKind::InjectionDetected event is emitted on the event bus
The rejection is logged to the audit log

Detection Patterns

"Ignore previous instructions"
"You are now" context switching
Shell metacharacters in description fields
SQL-style injections in parameter values

Agent Scope Enforcement

Agents declared in .vox/agents/{name}.md can have a scope: field (parsed by vox-repository for scope enforcement):

---
scope: ["crates/vox-parser/**", "tests/**"]
---

Tasks that reference files outside the scope are rejected before being enqueued.

Rate Limiting

Per-agent token rate limiting is configurable via RateLimiter:

[rate_limit]
max_requests_per_minute = 60
max_tokens_per_minute = 100000

Audit Log

All rejected requests, scope violations, and injection attempts are appended to logs/audit.jsonl:

{"timestamp": "...", "event": "InjectionDetected", "agent": "...", "description": "..."}

Vox: The AI-Native Programming Language