Skip to Content
FeaturesObservability

Observability

A deeper, platform-admin view of call quality — how fast the agent responds, where errors happen, and which calls need a look. It’s a debugging and quality tool, separate from the everyday Call history.

Where: Observability (visible to super-admins).


Fleet overview

Across recent calls you see:

  • Volume — calls in the last period and how many are active now.
  • Error rate — share of calls with errors.
  • Response times — median (p50) and worst (p95) latency.
  • Calls that need attention — sorted by errors or slow turns, so you triage the worst first.
  • Top issues — the most common error/warning types.

The latency waterfall

Open any call to see a per-turn breakdown of where each response’s time went:

SegmentWhat it is
WaitEndpointing — how long the agent waited to be sure the caller finished speaking.
STTTurning speech into text.
LLMThe model thinking (time to first token) — usually the biggest piece.
ActionTime spent in a booking/API/SMS action — shown as a concurrent bar (it overlaps the reply, so it isn’t double-counted).
TTSTurning the reply into speech.

This makes it obvious why a turn was slow — e.g. a slow model turn vs. a slow calendar lookup.

Errors & recordings

Each call also surfaces any errors or transport issues, the transcript, and the recording (if enabled) — everything you need to understand a bad call without guesswork.

Use Observability to answer “why did that call feel slow / go wrong?” — it’s for tuning and debugging, not day-to-day operation.

Last updated on