Interactive demo

Know exactly what your AI agents did.

This is a live simulation of Respan with your production agent data. Click the flagged trace below to see an eval failure — and the fix.

Traces7
live
Latency (ms) — last 10 calls
tr_4d1f9a✗ failed⚠ flagged for review
Model: gpt-4o1247ms1,891 tokens14:31:22
Input / Prompt
You are a support assistant. Summarize the following support ticket and identify the ticket ID.

Support ticket:
---
From: customer@acme.com
Subject: Login issue - cannot access dashboard

Hi team,

I've been unable to log into my dashboard since yesterday.
Ticket reference: #4128
Priority: High

Please advise.
---

Provide a brief summary and state the ticket number.
Model Response
Summary: The customer at acme.com is experiencing a login issue and has been unable to access their dashboard since yesterday. The issue is marked as high priority.

Ticket number: #4821
✗ Eval Failed — Factual Accuracy

Response references ticket #4821, but input contains #4128. Model transposed digits — hallucinated reference.

This is your data, live. Integrate in 5 min →
One SDK call. Traces start flowing instantly. No infra to manage.