Version: 0.3

Building Your First Agent

This guide is the longer cousin of the Quick Start. We'll build a single-tool agent end-to-end, then layer on the things you'll want next: streaming UI, error handling, and a cancel button.

Step 1 — set up the provider and model

The provider is the connection to the LLM API; the Model describes which model you want to invoke and a few capabilities.

import os
from cubepi import Model
from cubepi.providers.anthropic import AnthropicProvider

provider = AnthropicProvider(api_key=os.environ["ANTHROPIC_API_KEY"])
model = Model(
    id="claude-sonnet-4-5-20250929",
    provider="anthropic",
    max_tokens=4096,        # response cap
    context_window=200_000,  # hard model limit; defaults are usually fine
    temperature=0.7,
)

The provider field on Model is a string label (e.g. "anthropic", "openai"). It's used by the framework to clamp thinking levels and tag responses — keep it stable, it doesn't have to match the provider's internal name.

Step 2 — declare a tool

Every tool is a Pydantic param model + an async execute function:

from pydantic import BaseModel
from cubepi import AgentTool, AgentToolResult, TextContent


class GetWeatherParams(BaseModel):
    city: str


async def get_weather(tool_call_id, params: GetWeatherParams, *, signal=None, on_update=None):
    # do real work here — call an HTTP API, query a DB, etc.
    return AgentToolResult(
        content=[TextContent(text=f"72°F and sunny in {params.city}")]
    )


weather_tool = AgentTool(
    name="get_weather",
    description="Get current weather for a city. Returns a short text summary.",
    parameters=GetWeatherParams,
    execute=get_weather,
)

A few details:

The Pydantic model is auto-converted to JSON Schema and sent to the model as part of the tool definition.
The execute signature is fixed: (tool_call_id, params, *, signal, on_update). The two keyword-only args are always passed — keep them in your signature even if you ignore them.
signal is an asyncio.Event that's set when the user cancels. Check it inside long-running work and bail out early.
on_update(partial) lets you stream incremental progress (covered in Tool Use).

Step 3 — assemble the agent

from cubepi import Agent

agent = Agent(
    provider=provider,
    model=model,
    system_prompt="You are a concise weather assistant.",
    tools=[weather_tool],
)

You can pass tools=[] (or omit it) for a plain chat agent.

agent.subscribe(listener) is how you observe the run. The listener receives every AgentEvent:

def on_event(event, signal=None):
    if event.type == "message_update" and event.stream_event.type == "text_delta":
        print(event.stream_event.delta, end="", flush=True)
    elif event.type == "tool_execution_start":
        print(f"\n→ calling {event.tool_name}({event.args})")
    elif event.type == "tool_execution_end":
        print(f"  ✓ done")

agent.subscribe(on_event)

You can register multiple listeners and they all receive every event. Subscribe before prompt() — events fire as soon as the run starts.

Step 5 — prompt and run

import asyncio

async def main():
    await agent.prompt("What's the weather in Tokyo?")

asyncio.run(main())

agent.prompt() does not return any value. The result lives on agent.state.messages (the full history) and agent.state.streaming_message (the current in-flight message, or None between turns).

Adding error handling

When provider.stream() raises, the agent loop still produces an AssistantMessage with stop_reason="error" and error_message filled in. The event sequence is: message_start → message_end → turn_end → agent_end.

You can either:

Catch in the subscriber, looking for event.type == "agent_end" and the last message's stop_reason:

def on_event(event, signal=None):
    if event.type == "agent_end":
        last = event.messages[-1]
        if getattr(last, "stop_reason", "") == "error":
            print(f"\nerror: {last.error_message}")

Or inspect agent.state.error_message after await agent.prompt(...) returns.

Adding a cancel button

agent.abort() sets the run-level signal. The provider stream short-circuits to "aborted", in-flight tools see signal.is_set() == True, and the loop emits agent_end cleanly.

async def main():
    task = asyncio.create_task(agent.prompt("Search for…"))
    await asyncio.sleep(0.5)
    agent.abort()
    await task              # always completes — never raises
    await agent.wait_for_idle()

Common pitfalls

RuntimeError: Agent is already processing a prompt. — You called prompt() twice without awaiting the first. Use await agent.wait_for_idle() or queue with steer() / follow_up() instead.
No text_delta events — Did you subscribe before calling prompt()? Listeners only see events emitted after registration.
Tool not found — The model invoked a tool whose name doesn't match any tool in tools=[...]. CubePi reports this as a tool result with is_error=True rather than crashing — check the tool_execution_end event's result.
Pydantic ValidationError swallowed — If the model produces malformed JSON, CubePi captures the validation error and feeds it back as a tool error result. The model usually corrects itself on the next turn.

Tool Use & Parallel Execution — multiple tools at once, sequential mode, terminate, incremental progress.
Streaming Events — the full event taxonomy.
Multi-turn Conversations — keeping state across turns, steer, follow_up, resume.

Step 1 — set up the provider and model​

Step 2 — declare a tool​

Step 3 — assemble the agent​

Step 4 — subscribe to events​

Step 5 — prompt and run​

Adding error handling​

Adding a cancel button​

Common pitfalls​

Next​