Security

The Security system evaluates agent actions for potential risks before execution. It provides pluggable security analyzers that assess action risk levels and enforce confirmation policies based on security characteristics. Source: openhands-sdk/penhands/sdk/security/

Core Responsibilities

The Security system has four primary responsibilities:

Risk Assessment - Capture and validate LLM-provided risk levels for actions
Confirmation Policy - Determine when user approval is required based on risk
Action Validation - Enforce security policies before execution
Audit Trail - Record security decisions in event history

Architecture

Key Components

Component	Purpose	Design
`SecurityAnalyzerBase`	Abstract interface	Defines `security_risk()` contract
`LLMSecurityAnalyzer`	Inline risk assessment	Returns LLM-provided risk from action arguments
`NoOpSecurityAnalyzer`	Passthrough analyzer	Always returns UNKNOWN
`SecurityRisk`	Risk enum	LOW, MEDIUM, HIGH, UNKNOWN
`ConfirmationPolicy`	Decision logic	Maps risk levels to confirmation requirements

Risk Levels

Security analyzers return one of four risk levels:

Risk Level Definitions

Level	Characteristics	Examples
LOW	Read-only, no state changes	File reading, directory listing, search
MEDIUM	Modifies user data	File editing, creating files, API calls
HIGH	Dangerous operations	File deletion, system commands, privilege escalation
UNKNOWN	Not analyzed or indeterminate	Complex commands, ambiguous operations

Security Analyzers

LLMSecurityAnalyzer

Leverages the LLM’s inline risk assessment during action generation: Analysis Process:

Schema Enhancement: A required security_risk parameter is added to each tool’s schema
LLM Generation: The LLM generates tool calls with security_risk as part of the arguments
Risk Extraction: The agent extracts the security_risk value from the tool call arguments
ActionEvent Creation: The security risk is stored on the ActionEvent
Analyzer Query: LLMSecurityAnalyzer.security_risk() returns the pre-assigned risk level
No Additional LLM Calls: Risk assessment happens inline—no separate analysis step

Example Tool Call:

{
  "name": "execute_bash",
  "arguments": {
    "command": "rm -rf /tmp/cache",
    "security_risk": "HIGH"
  }
}

The LLM reasons about risk in context when generating the action, eliminating the need for a separate security analysis call. Configuration:

Enabled When: A LLMSecurityAnalyzer is configured for the agent
Schema Modification: Automatically adds security_risk field to non-read-only tools
Zero Overhead: No additional LLM calls or latency beyond normal action generation

NoOpSecurityAnalyzer

Passthrough analyzer that skips analysis: Use Case: Development, trusted environments, or when confirmation mode handles all actions

Confirmation Policy

The confirmation policy determines when user approval is required. There are three policy implementations: Source: confirmation_policy.py

Policy Types

Policy	Behavior	Use Case
`AlwaysConfirm`	Requires confirmation for all actions	Maximum safety, interactive workflows
`NeverConfirm`	Never requires confirmation	Fully autonomous agents, trusted environments
`ConfirmRisky`	Configurable risk-based policy	Balanced approach, production use

ConfirmRisky (Default Policy)

The most flexible policy with configurable thresholds: Configuration:

threshold (default: HIGH) - Risk level at or above which confirmation is required
- Cannot be set to UNKNOWN
- Uses reflexive comparison: risk.is_riskier(threshold) returns True if risk >= threshold
confirm_unknown (default: True) - Whether UNKNOWN risk requires confirmation

Confirmation Rules by Policy

ConfirmRisky with threshold=HIGH (Default)

Risk Level	`confirm_unknown=True` (default)	`confirm_unknown=False`
LOW	✅ Allow	✅ Allow
MEDIUM	✅ Allow	✅ Allow
HIGH	🔒 Require confirmation	🔒 Require confirmation
UNKNOWN	🔒 Require confirmation	✅ Allow

ConfirmRisky with threshold=MEDIUM

Risk Level	`confirm_unknown=True`	`confirm_unknown=False`
LOW	✅ Allow	✅ Allow
MEDIUM	🔒 Require confirmation	🔒 Require confirmation
HIGH	🔒 Require confirmation	🔒 Require confirmation
UNKNOWN	🔒 Require confirmation	✅ Allow

ConfirmRisky with threshold=LOW

Risk Level	`confirm_unknown=True`	`confirm_unknown=False`
LOW	🔒 Require confirmation	🔒 Require confirmation
MEDIUM	🔒 Require confirmation	🔒 Require confirmation
HIGH	🔒 Require confirmation	🔒 Require confirmation
UNKNOWN	🔒 Require confirmation	✅ Allow

Key Rules:

Risk comparison is reflexive: HIGH.is_riskier(HIGH) returns True
UNKNOWN handling is configurable via confirm_unknown flag
Threshold cannot be UNKNOWN - validated at policy creation time

Component Relationships

Relationship Characteristics:

Agent → Security: Validates actions before execution
Security → Tools: Examines tool characteristics (annotations)
Security → MCP: Uses MCP hints for risk assessment
Conversation → Agent: Pauses for user confirmation when required
Optional Component: Security analyzer can be disabled for trusted environments

Guides

Architecture

Core Responsibilities

Architecture

Key Components

Risk Levels

Risk Level Definitions

Security Analyzers

LLMSecurityAnalyzer

NoOpSecurityAnalyzer

Confirmation Policy

Policy Types

ConfirmRisky (Default Policy)

Confirmation Rules by Policy

ConfirmRisky with threshold=HIGH (Default)

ConfirmRisky with threshold=MEDIUM

ConfirmRisky with threshold=LOW

Component Relationships

See Also

Guides

Architecture

​Core Responsibilities

​Architecture

​Key Components

​Risk Levels

​Risk Level Definitions

​Security Analyzers

​LLMSecurityAnalyzer

​NoOpSecurityAnalyzer

​Confirmation Policy

​Policy Types

​ConfirmRisky (Default Policy)

​Confirmation Rules by Policy

​ConfirmRisky with threshold=HIGH (Default)

​ConfirmRisky with threshold=MEDIUM

​ConfirmRisky with threshold=LOW

​Component Relationships

​See Also

Core Responsibilities

Architecture

Key Components

Risk Levels

Risk Level Definitions

Security Analyzers

LLMSecurityAnalyzer

NoOpSecurityAnalyzer

Confirmation Policy

Policy Types

ConfirmRisky (Default Policy)

Confirmation Rules by Policy

ConfirmRisky with threshold=HIGH (Default)

ConfirmRisky with threshold=MEDIUM

ConfirmRisky with threshold=LOW

Component Relationships

See Also