Decagon Launched Duet Autopilot to Bring Self-Improving AI to Customer Experience Operations

NEWS 10 June 2026

The platform enables organizations to automate customer engagement while continuously refining performance through verified feedback loops.

Decagon Launched Duet Autopilot to Bring Self-Improving AI to Customer Experience Operations

Image source: Public Domain

Decagon, the leader in conversational AI agents for concierge customer experiences, announced Duet Autopilot, the first agent to deliver automatic and verifiable self-improvement for CX agents.

To measure Autopilot’s efficacy, Decagon also built DuetBench, the industry's first benchmark for evaluating agent self-improvement end-to-end. Against it, Duet Autopilot passed 93% of diagnostic tasks, exceeding the average human score.

"Autopilot is a shift from building agents by hand to managing agents that improve themselves," said Alan Yiu, VP of Product at Decagon. "Teams set the direction and review the work; Autopilot handles the diagnosing, testing, and editing that used to consume their week. Every fix compounds, which ultimately empowers businesses to provide their customers with a 24/7 AI concierge that gets measurably better with every interaction."

Closing the loop on agent improvement

Until now, improving an AI agent has been bottlenecked by manual work. As customer signals accumulate, teams must interpret feedback, decide on changes, test them, and ship improvements by hand. Too many cycles go into identifying and prioritizing high-impact updates, and even then, manual effort caps how much gets done. Duet Autopilot removes that constraint by acting on the full breadth of production signals.

Duet Autopilot delivers three core capabilities that work together as a continuous loop:

Automated agent improvement: Autopilot continuously translates production signals into proposed updates, acting on opportunities ranging from highest priority to small adjustments.
Self-validation: Every proposed change is tested against the original conversation that surfaced the issue, regression tests, and a curated golden set representing real customer personas and intents. If a change doesn’t pass those tests, Autopilot keeps iterating until it does.
Enterprise governance: Teams set guidance up front using brand voice, writing standards, policy preferences, and off-limits rules. Every change surfaces as a versioned update with the issues found, validation results, and exact diffs, requiring human approval before going live.

Because Autopilot is itself a Decagon agent, it is subject to its own improvement loop. Every reviewer correction and successful outcome feeds back into how it operates, so each cycle produces higher-quality updates than the last. This way, agent performance improves not at a fixed rate, but exponentially.

Proven in the field, formalized in the benchmark

Duet Autopilot is being validated with a cohort of enterprise customers and design partners across financial services, retail, and consumer technology, who are measuring its impact on resolution rates, escalation rates, and coverage.

“At our scale, manually reviewing conversations for errors isn't an option,” said Matt McCollum, senior manager of customer experience at Opendoor. “Decagon Autopilot frees our team to focus on decisions rather than digging through logs. It surfaces what changed, what was considered, and why. That transparency is what makes AI actually trustworthy in production.”

Furthermore, DuetBench fills a gap in how conversational AI agents are evaluated. Existing benchmarks measure whether an agent can resolve a fixed set of issues, but they don’t yet measure the improvement loop. By contrast, DuetBench measures whether Autopilot can make verifiable agent improvements, rather than producing plausible-looking changes.

Decagon Launched Duet Autopilot to Bring Self-Improving AI to Customer Experience Operations

The platform enables organizations to automate customer engagement while continuously refining performance through verified feedback loops.

United States of AMERICA

West Monroe Introduced WestMonroe.ai to Deliver Public Access to AI Business Strategy Agents

Kong Rolled Out Ascent to Help Enterprises Transition From Legacy APIs to AI-Ready Architectures

New Relic Launched AI Coding Observability to Enhance Oversight of AI-Powered Development Tool

Expensify Introduced MCP to Advance AI-Driven Expense Management and Automation

SPACInsider Strengthened Financial Data Platform With AI-Powered SPAC Database Access

Cardo AI Strengthened Asset Finance Intelligence Platform With Launch of Cash Flow Modeling Capability

Pega Introduced AI-Powered Modernization Capability on AWS for Legacy Application Transformation

EUROPE

Tempus Introduced Enhanced Lens Platform to Advance AI-Powered Oncology Development

Autobrains and Uber Launched Strategic Robotaxi Initiative Powered by NVIDIA DRIVE Hyperion

Nucs AI Partnered to Advance AI-Based Response Prediction for Therapeutic Radioconjugates

AppClose® Expanded Co-Parenting Support With Spanish Language Access and AI Communication Assistance

bizZone Introduced Autrinity, an AI-Powered Association Management System Built for the Future

Appriss Retail Expanded Retail Protection Capabilities With Launch of Agentic AI Platform Sidekick

ASIA

SPACInsider Strengthened Financial Data Platform With AI-Powered SPAC Database Access

Hitachi Strengthened AI Transformation Efforts Through Strategic Collaboration With Intel

OpsGuru Debuted Agentic Delivery to Modernize Enterprise AI Adoption and Service Delivery

Outreach Strengthened Revenue Intelligence Platform With Agentic AI and MCP Integration

Utopai Studios Announced PAI 2.0 to Transform AI-Powered Storytelling and Media Production

Decagon Launched Duet Autopilot to Bring Self-Improving AI to Customer Experience Operations

United States of AMERICA

EUROPE

ASIA

Keep Up to Date with the Latest Artificial Intelligence Industry NEWS & Insights