Condenser

The Condenser system manages conversation history compression to keep agent context within LLM token limits. It reduces long event histories into condensed summaries while preserving critical information for reasoning. For more details, read the blog here. Source: openhands-sdk/openhands/sdk/context/condenser/

Core Responsibilities

The Condenser system has four primary responsibilities:

History Compression - Reduce event lists to fit within context windows
Threshold Detection - Determine when condensation should trigger
Summary Generation - Create meaningful summaries via LLM or heuristics
View Management - Transform event history into LLM-ready views

Architecture

Key Components

Component	Purpose	Design
`CondenserBase`	Abstract interface	Defines `condense()` contract
`RollingCondenser`	Rolling window base	Implements threshold-based triggering
`LLMSummarizingCondenser`	LLM summarization	Uses LLM to generate summaries
`NoOpCondenser`	No-op implementation	Returns view unchanged
`PipelineCondenser`	Multi-stage pipeline	Chains multiple condensers
`View`	Event view	Represents history for LLM
`Condensation`	Condensation event	Metadata about compression

Condenser Types

NoOpCondenser

Pass-through condenser that performs no compression:

LLMSummarizingCondenser

Uses an LLM to generate summaries of conversation history: Process:

Check Threshold: Compare view size to configured limit (e.g., event count > max_size)
Select Events: Identify events to keep (first N + last M) and events to summarize (middle)
LLM Call: Generate summary of middle events using dedicated LLM
Create Event: Wrap summary in Condensation event with forgotten_event_ids
Add to History: Agent adds Condensation to event log and returns early
Next Step: View.from_events() filters forgotten events and inserts summary

Configuration:

max_size: Event count threshold before condensation triggers (default: 120)
keep_first: Number of initial events to preserve verbatim (default: 4)
llm: LLM instance for summarization (often cheaper model than reasoning LLM)

PipelineCondenser

Chains multiple condensers in sequence: Use Case: Multi-stage compression (e.g., remove old events, then summarize, then truncate)

Condensation Flow

Trigger Mechanisms

Condensers can be triggered in two ways: Automatic Trigger:

When: Threshold exceeded (e.g., event count > max_size)
Who: Agent calls condenser.condense() each step
Purpose: Proactively keep context within limits

Manual Trigger:

When: CondensationRequest event added to history (via view.unhandled_condensation_request)
Who: Agent (on LLM context window error) or application code
Purpose: Force compression when context limit exceeded

Condensation Workflow

Key Steps:

Threshold Check: should_condense() determines if condensation needed
Event Selection: Identify events to keep (head + tail) vs forget (middle)
Summary Generation: LLM creates compressed representation of forgotten events
Condensation Creation: Create Condensation event with forgotten_event_ids and summary
Return to Agent: Condenser returns Condensation (not View)
History Update: Agent adds Condensation to event log and exits step
Next Step: View.from_events() (source) processes Condensation to filter events and insert summary

View and Condensation

View Structure

A View represents the conversation history as it will be sent to the LLM: View Components:

events: List of LLMConvertibleEvent objects (filtered by Condensation)
unhandled_condensation_request: Flag for pending manual condensation
condensations: List of all Condensation events processed
Methods: from_events() creates view from raw events, handling Condensation semantics

Condensation Event

When condensation occurs, a Condensation event is created: Condensation Fields:

forgotten_event_ids: List of event IDs to filter out
summary: Compressed text representation of forgotten events
summary_offset: Index where summary event should be inserted
Inherits from Event: id, timestamp, source

Rolling Window Pattern

RollingCondenser implements a common pattern for threshold-based condensation: Rolling Window Strategy:

Keep Head: Preserve first keep_first events (default: 4) - usually system prompts
Keep Tail: Preserve last target_size - keep_first - 1 events - recent context
Summarize Middle: Compress events between head and tail into summary
Target Size: After condensation, view has max_size // 2 events (default: 60)

Component Relationships

How Condenser Integrates

Relationship Characteristics:

Agent → State: Calls View.from_events() to get current view
Agent → Condenser: Calls condense(view) each step if condenser registered
Condenser → Agent: Returns View (proceed) or Condensation (defer)
Agent → Events: Adds Condensation event to log when returned

Guides

Architecture

Core Responsibilities

Architecture

Key Components

Condenser Types

NoOpCondenser

LLMSummarizingCondenser

PipelineCondenser

Condensation Flow

Trigger Mechanisms

Condensation Workflow

View and Condensation

View Structure

Condensation Event

Rolling Window Pattern

Component Relationships

How Condenser Integrates

See Also

Guides

Architecture

​Core Responsibilities

​Architecture

​Key Components

​Condenser Types

​NoOpCondenser

​LLMSummarizingCondenser

​PipelineCondenser

​Condensation Flow

​Trigger Mechanisms

​Condensation Workflow

​View and Condensation

​View Structure

​Condensation Event

​Rolling Window Pattern

​Component Relationships

​How Condenser Integrates

​See Also

Core Responsibilities

Architecture

Key Components

Condenser Types

NoOpCondenser

LLMSummarizingCondenser

PipelineCondenser

Condensation Flow

Trigger Mechanisms

Condensation Workflow

View and Condensation

View Structure

Condensation Event

Rolling Window Pattern

Component Relationships

How Condenser Integrates

See Also