AI & HUMAN INTERFACES

Designing for AI uncertainty

Last updated: June 2026

Designing for AI uncertainty means explicitly communicating the probabilistic and fallible nature of AI outputs so users can calibrate their trust, verify when needed, and make informed decisions rather than blindly accepting or rejecting results.

The Principle

Large language models and generative AI systems are fundamentally stochastic. They predict tokens based on probabilities learned from training data, not from grounded understanding or deterministic logic. The same prompt can produce meaningfully different outputs depending on temperature settings, context window, and random sampling. Hallucinations — confident-sounding but incorrect information — are not occasional bugs but an inherent feature of how these models work.

Good AI UX acknowledges this reality instead of hiding it. Users need visible signals about confidence, sources, limitations, and variability so they can treat AI as a powerful but imperfect collaborator rather than an oracle. This includes confidence indicators, citation of sources, edit/regenerate/dismiss patterns, and clear language about what the model can and cannot do reliably.

In my own work building with AI, this was a painful but necessary lesson. Early prototypes presented outputs as clean, authoritative answers. Users either trusted them too much (leading to errors) or dismissed the tool entirely after one hallucination. Shifting to honest uncertainty design — showing confidence ranges, surfacing sources, and making verification easy — dramatically improved both usability and trust. The interface stopped pretending the AI was perfect and started helping users work with its actual nature.

Why It Matters for Design & Building

Uncertainty is the central challenge of AI interfaces. Without thoughtful design, users swing between dangerous over-trust and unproductive skepticism. Neither serves them. Designing for uncertainty turns a fundamental limitation into a feature: it builds calibrated trust, reduces costly mistakes, and helps users develop healthy mental models of what AI can and cannot do.

As a Design Engineer, this has become my most important lens when working with generative systems. In one writing assistance tool, we moved from presenting AI suggestions as polished final text to showing them with visible confidence indicators, inline source highlights, and easy “edit this part” affordances. Error rates dropped and user satisfaction increased because people could use the AI more effectively without constant second-guessing.

This principle is especially critical for calm technology and human-centered AI. Interfaces that obscure uncertainty create stress and erode agency. Honest ones reduce cognitive load by making the AI’s limitations legible, allowing users to stay in control. The deeper practice is humility: we are not designing perfect intelligence; we are designing collaboration between fallible humans and probabilistic machines.

Real-World Examples

Perplexity.ai demonstrates strong handling of uncertainty. It presents answers with inline citations, clearly marks when information might be uncertain, and makes source verification trivial. Users learn to trust the system where appropriate and verify where needed, creating a healthy collaborative dynamic.

Many early chatbot implementations illustrate the opposite. They present every response with equal confidence and polished language, leading users to treat hallucinations as facts. When errors inevitably appear, trust collapses and users abandon the tool.

GitHub Copilot offers a mixed but evolving case. Its inline suggestions include acceptance rates and context awareness, but early versions sometimes presented incorrect code with the same visual weight as correct suggestions. Later improvements with better confidence signaling and “explain this” options have helped developers calibrate their reliance more effectively.

References

Bubeck, S., et al. (2023). "Sparks of Artificial General Intelligence: Early experiments with GPT-4." arXiv.
Weidinger, L., et al. (2022). "Taxonomy of Risks posed by Language Models." ACM FAccT.
Jacovi, A., et al. (2021). "Formalizing Trust in Artificial Intelligence: Prerequisites, Causes and Goals of Human Trust in AI." ACM FAccT.
Bai, Y., et al. (2022). "Constitutional AI: Harmlessness from AI Feedback." arXiv.
Budiu, R. (2023). "Explainable AI in Chat Interfaces." Nielsen Norman Group. nngroup.com

The Principle

Why It Matters for Design & Building

Real-World Examples

Related entries

References