Skip to main content
Version: 2.24-unstable

Pipeline Breakpoints

Learn how to pause and resume Haystack pipeline or Agent execution using breakpoints to debug, inspect, and continue workflows from saved snapshots.

Introduction​

Haystack pipelines support breakpoints for debugging complex execution flows. A Breakpoint allows you to pause the execution at specific components, inspect the pipeline state, and resume execution from saved snapshots. This feature works for any regular component as well as an Agent component.

You can set a Breakpoint on any component in a pipeline with a specific visit count. When triggered, the system stops the execution of the Pipeline and captures a snapshot of the current pipeline state. The state can be saved to a JSON file when snapshot file saving is enabled, see Snapshot file saving below. You can inspect and modify the snapshot and use it to resume execution from the exact point where it stopped.

You can also set breakpoints on an Agent, specifically on the ChatGenerator component or on any of the Tool specified in the ToolInvoker component .

Setting a Breakpoint on a Regular Component​

Create a Breakpoint by specifying the component name and the visit count at which to trigger it. This is useful for pipelines with loops. The default visit_count value is 0.

python
from haystack.dataclasses.breakpoints import Breakpoint
from haystack.core.errors import BreakpointException

## Create a breakpoint that triggers on the first visit to the "llm" component
break_point = Breakpoint(
component_name="llm",
visit_count=0, # 0 = first visit, 1 = second visit, etc.
snapshot_file_path="/path/to/snapshots" # Optional: save snapshot to file
)

## Run pipeline with breakpoint
try:
result = pipeline.run(data=input_data, break_point=break_point)
except BreakpointException as e:
print(f"Breakpoint triggered at component: {e.component}")
print(f"Component inputs: {e.inputs}")
print(f"Pipeline results so far: {e.results}")

A BreakpointException is raised containing the component inputs and the outputs of the pipeline up until the moment where the execution was interrupted, such as just before the execution of component associated with the breakpoint – the llm in the example above.

If a snapshot_file_path is specified in the Breakpoint and snapshot file saving is enabled, the system saves a JSON snapshot with the same information as in the BreakpointException. Snapshot file saving to disk is disabled by default; see Snapshot file saving below.

To access the pipeline state during the breakpoint we can both catch the exception raised by the breakpoint as well as specify where the JSON file should be saved, note that file saving is enabled must be enabled.

Using a custom snapshot callback​

You can pass a snapshot_callback to Pipeline.run() or Agent.run() to handle snapshots yourself instead of saving to a file. When a breakpoint is triggered or a snapshot is created on error, the callback is invoked with the PipelineSnapshot object. This is useful for saving snapshots to a database, sending them to a remote service, or custom logging.

python
from haystack.core.errors import BreakpointException
from haystack.dataclasses.breakpoints import Breakpoint, PipelineSnapshot

def my_snapshot_callback(snapshot: PipelineSnapshot) -> None:
# Custom handling: e.g. save to DB, send to API, or log
print(f"Snapshot at component: {snapshot.break_point}")

break_point = Breakpoint(component_name="llm", visit_count=0)
try:
result = pipeline.run(
data=input_data,
break_point=break_point,
snapshot_callback=my_snapshot_callback,
)
except BreakpointException as e:
print(f"Breakpoint triggered: {e.component}")

When snapshot_callback is provided, file-saving is skipped and the callback is responsible for handling the snapshot. The same parameter is available on Agent.run() for agent breakpoints.

Snapshot file saving​

Snapshot file saving to disk is disabled by default. To save snapshots as JSON files when a breakpoint is triggered or on pipeline failure, set the environment variable HAYSTACK_PIPELINE_SNAPSHOT_SAVE_ENABLED to "true" or "1" (case-insensitive). When enabled, snapshots are written to the path given by snapshot_file_path on the breakpoint, or to the default directory in Error Recovery with Snapshots when a run fails.

Custom snapshot_callback functions are always invoked when provided, regardless of this setting.

python
import os

# Enable saving snapshot files to disk
os.environ["HAYSTACK_PIPELINE_SNAPSHOT_SAVE_ENABLED"] = "true"

break_point = Breakpoint(
component_name="llm",
visit_count=0,
snapshot_file_path="/path/to/snapshots",
)
# When the breakpoint triggers, a JSON file will be written to /path/to/snapshots/

Resuming a Pipeline Execution from a Breakpoint​

To resume the execution of a pipeline from the breakpoint, pass the path to the generated JSON file at the run time of the pipeline, using the pipeline_snapshot.

Use the load_pipeline_snapshot() to first load the JSON and then pass it to the pipeline.

python
from haystack.core.pipeline.breakpoint import load_pipeline_snapshot

## Load the snapshot
snapshot = load_pipeline_snapshot("llm_2025_05_03_11_23_23.json")

## Resume execution from the snapshot
result = pipeline.run(data={}, pipeline_snapshot=snapshot)
print(result["llm"]["replies"])

Setting a Breakpoint on an Agent​

You can also set breakpoints in an Agent component. An Agent supports two types of breakpoints:

  1. Chat Generator Breakpoint: Pauses before LLM calls.
  2. Tool Invoker Breakpoint: Pauses before any tool execution.

A ChatGenerator breakpoint is defined as shown below. You need to define a Breakpoint as for a pipeline breakpoint and then an AgentBreakpoint where you pass the breakpoint defined before and the name of Agent component.

python
from haystack.dataclasses.breakpoints import AgentBreakpoint, Breakpoint, ToolBreakpoint

## Break at chat generator (LLM calls)
chat_bp = Breakpoint(component_name="chat_generator", visit_count=0)
agent_breakpoint = AgentBreakpoint(
break_point=chat_bp,
agent_name="my_agent"
)

To set a breakpoint on a Tool in an Agent, do the following:

First, define a ToolBreakpoint specifying the ToolInvoker component whose name is tool_invoker and then the tool associated with the breakpoint, in this case – a weather_tool .

Then, define an AgentBreakpoint passing the ToolBreakpoint defined before as the breakpoint.

python
from haystack.dataclasses.breakpoints import AgentBreakpoint, Breakpoint, ToolBreakpoint

## Break at tool invoker (tool calls)
tool_bp = ToolBreakpoint(
component_name="tool_invoker",
visit_count=0,
tool_name="weather_tool" # Specific tool, or None for any tool
)
agent_breakpoint = AgentBreakpoint(
break_point=tool_bp,
agent_name="my_agent"
)

Resuming Agent Execution​

When an Agent breakpoint is triggered, you can resume execution using the saved snapshot. Similar to the regular component in a pipeline, pass the JSON file with the snapshot to the run() method of the pipeline.

python
from haystack.core.pipeline.breakpoint import load_pipeline_snapshot

## Load the snapshot
snapshot_file = "./agent_debug/agent_chat_generator_2025_07_11_23_23.json"
snapshot = load_pipeline_snapshot(snapshot_file)

## Resume pipeline execution
result = pipeline.run(data={}, pipeline_snapshot=snapshot)
print("Pipeline resumed successfully")
print(f"Final result: {result}")

Error Recovery with Snapshots​

Pipelines automatically create a snapshot of the last valid state if a run fails. The snapshot contains inputs, visit counts, and intermediate outputs up to the failure. You can inspect it, fix the issue, and resume execution from that checkpoint instead of restarting the whole run.

Access the Snapshot on Failure​

Wrap pipeline.run() in a try/except block and retrieve the snapshot from the raised PipelineRuntimeError:

python
from haystack.core.errors import PipelineRuntimeError

try:
pipeline.run(data=input_data)
except PipelineRuntimeError as e:
snapshot = e.pipeline_snapshot
if snapshot is not None:
intermediate_outputs = snapshot.pipeline_state.pipeline_outputs
# Inspect intermediate_outputs to diagnose the failure

When snapshot file saving is enabled (see Snapshot file saving), Haystack also saves the same snapshot as a JSON file on disk. The directory is chosen automatically in this order:

  • ~/.haystack/pipeline_snapshot
  • /tmp/haystack/pipeline_snapshot
  • ./.haystack/pipeline_snapshot

Filenames will have the following pattern: {component_name}_{visit_nr}_{YYYY_MM_DD_HH_MM_SS}.json.

Resume from a Snapshot​

You can resume directly from the in-memory snapshot or load it from disk.

Resume from memory:

python
result = pipeline.run(data={}, pipeline_snapshot=snapshot)

Resume from disk:

python
from haystack.core.pipeline.breakpoint import load_pipeline_snapshot

snapshot = load_pipeline_snapshot("/path/to/.haystack/pipeline_snapshot/reader_0_2025_09_20_12_33_10.json")
result = pipeline.run(data={}, pipeline_snapshot=snapshot)