On this page

Agent Analytics taxonomy

Early Access

This feature is in Early Access. During this time, aspects of the functionality may still be developed, and this documentation may not always be up to date. If you have any questions, contact Amplitude Support.

This page is the reference for the data Agent Analytics produces: every [Agent] event, the properties on the enrichment events, and the default signals. To read these results in the UI, go to Analyze agent results. To emit the events from your code, go to the Agent Analytics SDK.

The taxonomy is configurable and still evolving during Open Beta. Treat the lists below as the default shape, and confirm the live set against your own event stream.

Data hierarchy

Agent Analytics models each agent interaction as a hierarchy:

  • Session: One job a user hands the agent, from start to finish. Amplitude identifies a session with the [Agent] Session ID property. The agent session differs from Amplitude's standard-analytics session ($session_id), which is the user's app or web visit.
  • Turn: A single back-and-forth exchange within a session: a user message, the agent's tool calls, and the AI response.
  • Span: A sub-turn step, such as a tool call, vector search, rerank, or guardrail.

Every user message, AI response, and tool call lands as an independent Amplitude event, so you can use agent data in funnels, cohorts, and retention charts without decomposing a trace first.

Event inventory

Agent Analytics produces these events. Your SDK instrumentation produces the first seven. The server enrichment pipeline produces the rest after a session closes.

SDK events

Your instrumentation produces these events as the agent runs. The [Agent] AI Response event carries the per-response model, provider, token, latency, and cost properties.

  • [Agent] User Message: a message the user sends to the agent.
  • [Agent] AI Response: the agent's response, with model, provider, tokens, latency, and cost.
  • [Agent] Tool Call: a function or tool the agent invokes.
  • [Agent] Embedding: an embedding or vector-search step.
  • [Agent] Span: any other pipeline step, such as a rerank or guardrail.
  • [Agent] Session End: marks the end of a session.
  • [Agent] Session Enrichment: your own session labels, sent in customer_enriched privacy mode.

Server enrichment events

After a session closes, the enrichment pipeline assesses it and writes two events back to your event stream.

Session Record

[Agent] Session Record lands once per session. It carries the session rollups, the always-on signal results, and quality flags.

Evaluator Result

[Agent] Evaluator Result lands once per evaluator per session. It is the unified event for every server-side evaluation: signal detectors, topic classifiers, and rubric scorers.

User feedback scores

[Agent] Score records explicit user feedback, such as a thumbs up or down on a response. Scores come from your application through the SDK's score() method, not from the enrichment pipeline. To send scores, go to Send user feedback (scores).

Signals

Signals are the default, always-on evaluators that Amplitude runs on every closed session. They land as [Agent] Evaluator Result events. You don't configure them, and Amplitude refines them over time, so treat them as directional.

Topics and custom evaluators

Beyond the default signals, you define your own topic models and evaluators. The enrichment taxonomy is fully configurable: topic model names such as query_intent and product_area, and evaluator names such as task_completion, come from configuration and differ per project. To create and refine your own evaluators, go to Create and refine custom evaluators.

Deprecated events

[Agent] Topic Classification is deprecated. Topic classifications now land as [Agent] Evaluator Result events with an output type of classification. Rubric scores also moved off [Agent] Score onto [Agent] Evaluator Result with an output type of score. [Agent] Score now carries user feedback only.

Was this helpful?