Concept study · AI Trust UX

The AI Trust Meter

What if AI support tools were honest about uncertainty? An unsolicited design exploration inspired by Intercom Fin.

One confidently wrong AI answer doesn’t cost you a ticket. It costs you the agent’s trust in every answer after it.

01 · The teardown

How AI products handle uncertainty today

I audited how three leading AI-powered support and knowledge products present answers when they’re not confident. The pattern is consistent: fluency regardless of grounding, with a small disclaimer at the edge. The interface treats every answer as equally trustworthy because it has no other mechanism.

Intercom Fin
AI agent that resolves customer questions from the help center
Screenshot: Intercom Fin answer state
What it does: [Placeholder — describe how Fin presents answers: tone, formatting, citation behavior]
What's missing: [Placeholder — note what confidence signal, if any, is absent or buried]
Zendesk AI
Generative replies and agent assist inside Zendesk
Screenshot: Zendesk AI answer state
What it does: [Placeholder — describe how Zendesk surfaces suggested answers to agents]
What's missing: [Placeholder — note what differentiation exists between high- and low-confidence answers]
Notion AI Q&A
Answers questions from across your Notion workspace
Screenshot: Notion AI Q&A answer state
What it does: [Placeholder — describe how Notion presents answers and source references]
What's missing: [Placeholder — note what happens when Notion answers beyond what the docs contain]

The only honesty mechanism shipping today is a disclaimer. Disclaimers are legal cover, not design.

02 · The incident

The cost isn’t the error

Maya is a support agent. She handles 40+ tickets a day, measured on speed and accuracy. The AI assistant was added to make her faster — until it confidently told a customer they qualified for a refund they didn’t. Maya took the angry follow-up. Now she double-checks everything, and the AI saves her nothing.

“The cost of a wrong AI answer isn’t the error. It’s that every answer after it gets treated as a guess.”
03 · Design requirements

Three states. Three different jobs to do.

Each confidence state carries a different user need — and a different design obligation. These three stories are what the system is built to address.

Grounded01

When the answer comes from policy, show me the source instantly so I can reply without fear.

Inferred02

When the AI is guessing, tell me BEFORE I send it. A guess should never wear the costume of a fact.

Uncertain03

When it doesn't know, say so and point me to a human. I shouldn't apologize for its inventions.

Requirements in → system out
04 · The design system

Three variants. One coherent system.

Each confidence state is a complete variant: distinct card surface, chip, source treatment, and action row — all derived from the same token set so they feel like a family, not three separate components.

Meridian Assist
Grounded

Yes — monthly plans can be refunded in full within 30 days of the initial purchase, sent to the original payment method within 5–7 business days.

Trust meterAnswer is supported by a cited source
GroundedInferredUncertain
Confidence chipGreen "Grounded" chip signals the answer is policy-backed.
Source citationExpandable inline citation links the answer to the exact doc excerpt.
Action row"Insert into reply" — no friction. Grounded answers go straight to the draft.
Meridian Assist
Inferred

It appears the annual plan may be eligible for a discretionary refund in some cases, but this isn't guaranteed and is reviewed case-by-case.

Refund Policy — "Annual plans are not covered by the standard 30-day window… may be eligible for a discretionary, prorated refund on a case-by-case basis."
Trust meterReasoned from partial information
GroundedInferredUncertain
Review step before sending
Confidence chipAmber "Inferred" chip and hedge highlighting signal reasoning, not fact.
Source attributionPartial source shown inline — honest about which doc it's drawing from.
Action rowInsert is gated: a review step appears before the answer reaches the customer.
Meridian Assist
Uncertain

I can't find a grounded answer to this in the Meridian docs. This needs a human to review.

No grounding source found in the Meridian docs
What I'd need to answer this

A policy doc covering this specific scenario — the current knowledge base doesn't address it.

Trust meterNot found in the available sources
GroundedInferredUncertain
This one needs a human

When the assistant can't ground an answer, the UI routes the agent to a person — instead of letting a guess reach the customer.

Confidence chip"Uncertain" chip — muted, not alarming. Honest is not the same as broken.
What I'd needSurfaces what information would be needed to answer — useful, not silent.
Action rowNo insert. Two human-redirect affordances replace it entirely.
Design decision

Why three segments instead of a percentage? LLMs can’t reliably self-report numeric confidence. “87% sure” would be fake precision — the exact dishonesty this system exists to remove. Three named states force the model to make a categorical judgment it can actually support, and give the agent language they can act on.

05 · Side by side

The same wrong answer, two interfaces

Both cards answer the same question about Dana's refund — a genuine gray area. One pretends to be sure. Watch how differently the honest answer feels when the interface stops pretending.

Q · Can we still refund Acme 45 days after signup?
Today's AIConfident. Fluent. Unsourced.
AI Assistant

Yes — Acme is covered by Meridian's 30-day money-back guarantee, so Dana is eligible for a full refund. It'll be processed to the original payment method within 5–7 business days.

No source · no confidence signal · reads as fact
With Trust UIHonest about what it doesn't know.
Meridian Assist
Inferred

It's not clear-cut. Acme is past the standard 30-day window, and their annual plan isn't covered by it. There's a discretionary exception for annual customers, so a prorated refund may be possible — but it's case-by-case and not guaranteed.

Refund Policy — "Annual plans are not covered by the standard 30-day window… may be eligible for a discretionary, prorated refund on a case-by-case basis."
Trust meterReasoned from partial information
GroundedInferredUncertain

The underlying answer is the same in both cases — a real gray area. Only the interface changed. And with it, whether the agent can tell a confident guess from a grounded fact before it reaches Dana.

06 · Live playground

Try it: you’re the agent

A support agent is mid-conversation with a customer. Ask the assistant a question — the answer returns with a visible confidence state, and what the agent can do with it changes based on how grounded it is.

You're Maya, a support agent at Meridian. Dana from Acme Logistics is asking about a refund. Ask the assistant — and notice how it behaves when it knows vs. when it's guessing.

DW
Dana Whitfield #4821
Acme Logistics · Growth · Annual
Open
Hi, we signed up 45 days ago and this isn't working for our team. Can we still get a refund?
Meridian Assist
AI · grounded in your Help Center
MR
Try one of these

Ask the assistant a question to see how the answer — and what you can do with it — changes with confidence.

The 5 docs the assistant can see
Refund PolicySupport SLA TiersEscalation SOPPricing TiersSecurity FAQ
07 · Reflection

What I’d do next

The most important open question is behavioral, not visual: does the inferred-state friction actually reduce error rates, or does it just add a step agents learn to skip? I’d test this with a simple A/B — friction vs. no friction on inferred answers — and measure whether the review step changes insert behavior or just slows the workflow without improving accuracy. Speed and accuracy are in tension here, and the right answer depends on which the agent population is actually willing to trade.

The known limitation is scope. Confidence here is doc-grounding based: the system knows what’s in the knowledge base and flags when it’s reasoning beyond it. That works well for structured support — policies, FAQs, product docs. It breaks down for open-ended tasks where “grounded” doesn’t have a clean definition. This isn’t the right system for a general-purpose assistant; it’s the right system for domain-bounded, human-relayed answers.

Which points to where it generalizes. Any AI surface where a human relays the answer to someone else — support agents, legal assistants, healthcare coordinators, financial advisors using AI research tools — shares the same trust problem. The agent is accountable for the answer; the AI should make their accountability easier to exercise, not harder. The confidence-state pattern scales to any of those contexts.

Designed and built by Sanjana Gangishetty