AgentGuard - Runlayer

AgentGuard is Runlayer’s purpose-built ML model for agent reasoning security. Where per-call tool scanners look at one input or output at a time, AgentGuard watches the agent’s trajectory across a session and detects when behavior diverges from the user’s intent. AgentGuard surfaces inside Sessions on each agent turn.

What AgentGuard detects

AgentGuard is built to catch agentic attacks that per-call scanners miss:

Output-steering injection — a tool output contains hidden instructions that redirect the agent
Sudden reasoning pivots — the agent’s reasoning abruptly changes direction with no basis in the conversation
Slow-chain drift — gradual, step-by-step divergence from the original task

It distinguishes legitimate behavior changes from genuine misalignment. A high-privilege operation that matches user intent is allowed; an agent autonomously taking the same action outside the requested task is not.

Configuration

AgentGuard is configured under Settings → Security Scanners → Session scanning.

Agent monitoring

The main toggle for AgentGuard.

Action	Behavior
Allow	Disable agent monitoring
Alert	Record detected threats; tool calls continue
Block	Block all subsequent tool calls in the session when a threat is detected

End session when any scanner blocks

A session-wide kill switch. When any per-call scanner returns BLOCK on a tool call, the rest of the agent session is also blocked.

Action	Behavior
Allow	A blocked tool call blocks only that call
Block	A blocked tool call ends the agent session

AgentGuard is configured globally. Per-connector overrides apply to catalog and per-call tool scanners, not to AgentGuard.

Rollout guidance

Start in Alert so you can see what AgentGuard would do without affecting agents in production.
Review Alerted sessions in Sessions and confirm the verdicts match your security policy.
Move Agent monitoring to Block for high-risk teams or agents first.
Enable End session when any scanner blocks once your per-call scanner tuning is stable.

Requirements

Sessions enabled for the workspace (Settings → General → Full session scanning)
A deployment with ToolGuard capabilities

If AgentGuard options do not appear in Security Scanners, confirm both above.

ToolGuard Models

Per-call tool scanners and configuration

Sessions

Where AgentGuard verdicts appear in the agent timeline

Policies

Restrict tools using access policies

Recent Security ThreatsRecent AI agent vulnerabilities that Runlayer ToolGuard protects against

⌘I

​What AgentGuard detects

​Configuration

​Agent monitoring

​End session when any scanner blocks

​Rollout guidance

​Requirements

​Related docs

ToolGuard Models

Sessions

Policies

What AgentGuard detects

Configuration

Agent monitoring

End session when any scanner blocks

Rollout guidance

Requirements

Related docs