Human-adaptive AI.
Trustworthy by design.

Hopperlace builds the infrastructure AI systems need to know when to act, when to defer, and how to coordinate with humans on high-stakes decisions — across evidence synthesis, clinical decision support, legal reasoning, and the other domains where confident errors are most costly.

The architecture

Three layers. One architecture.

Layer 1

Knowledge & Evaluation Layer

Evidence Synthesis

Early pilots

AI systems need to know what they don’t know. This layer builds the evaluation infrastructure for appropriate deferral — measuring not just accuracy but when the AI should stop and hand off. Applicable to any domain where AI confidence is decision-consequential: clinical triage, legal discovery, content moderation, scientific review. Current live application: systematic review screening.

Learn more

Layer 2

Routing & Orchestration Layer

LetsBegin

In Development

Getting the human in the loop isn’t enough — it matters when, how, and in what form. This layer manages the handoff: sequencing decisions, surfacing one thing at a time, and routing based on confidence and complexity. Designed around human attention and cognitive capacity, so the human who receives the task can actually do it well. Applicable to any human-AI workflow where attention is the bottleneck.

Layer 3

Trust & Governance Layer

Value Compass

In Development

Trust in AI systems has to be earned, not assumed. This layer makes AI behavior legible — measuring alignment between how a system acts and the values of the people and organisations relying on it, so the right tool gets used in the right situation, with the right expectations. Applicable wherever AI tool selection and trust calibration matter.

Our first application

Deference-aware evaluation for systematic review

We chose screening deliberately. Within the evidence synthesis workflow, screening is the sub-task where deference-aware evaluation creates the most value — an overconfident AI screener corrupts every downstream step of the review, while an overly cautious one wipes out the time savings that justify using AI at all. Screening is the test case that proves the evaluation layer works in production.

Evidence Synthesis AI handles the confident decisions autonomously — the clear includes, the clear excludes — and surfaces only the genuinely ambiguous studies for human review. The time savings come from the AI acting decisively where it’s well-calibrated to be right; the evidence quality is protected by the system knowing when it isn’t. Every decision comes with the reasoning behind it. Reviewers can override at any point. Every action is logged.

Running a systematic review team? We’re onboarding early pilots.Learn more about Evidence Synthesis AI

Research

White Paper · March 2026 · Hopperlace Research · DOI: 10.17605/OSF.IO/A69YH

Deference-Aware Evaluation for Human-in-the-Loop AI Systems

A framework for evaluating AI systems not just on accuracy, but also on their capacity to recognise the limits of their own competence and defer to human judgement when appropriate. The paper identifies two failure modes that standard accuracy metrics conflate — penalised conservatism and genuine confident errors — and introduces deference-aware metrics that distinguish them. Validated across nine frontier models and 258 systematic review studies. The methodology is domain-general; systematic review is the first application.

Read on OSF

Team

Who we are

Yuyu Shen

Founder

AI product leader with a decade building production AI systems across fintech, enterprise software, and consumer technology — at Meta, Walmart, Beamery, Cleo, and others. Founded Hopperlace to close a gap that kept surfacing: we’re deploying AI in high-stakes contexts without the evaluation infrastructure to know when it’s actually safe to trust. That changes everything about how those systems should be designed.

Martin Walker, MPH

Co-founder, Evidence Synthesis

Background in evidence-based health improvement and systematic review evidence synthesis. Brings the passion for better public outcomes and domain experience to ensure Evidence Synthesis AI works with the right rigour and goal.

Contact

Get in touch

Interested in our research or exploring collaboration?

hello@hopperlace.ai

Human-adaptive AI.Trustworthy by design.